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Preface 


Welcome to College Physics, an OpenStax resource. This textbook was 
written to increase student access to high-quality learning materials, 
maintaining highest standards of academic rigor at little to no cost. 


About OpenStax 


OpenStax is a nonprofit based at Rice University, and it’s our mission to 
improve student access to education. Our first openly licensed college 
textbook was published in 2012, and our library has since scaled to over 20 
books for college and AP courses used by hundreds of thousands of 
students. Our adaptive learning technology, designed to improve learning 
outcomes through personalized educational paths, is being piloted in 
college courses throughout the country. Through our partnerships with 
philanthropic foundations and our alliance with other educational resource 
organizations, OpenStax is breaking down the most common barriers to 
learning and empowering students and instructors to succeed. 


About OpenStax Resources 


Customization 


College Physics is licensed under a Creative Commons Attribution 4.0 
International (CC BY) license, which means that you can distribute, remix, 
and build upon the content, as long as you provide attribution to OpenStax 
and its content contributors. 


Because our books are openly licensed, you are free to use the entire book 
or pick and choose the sections that are most relevant to the needs of your 
course. Feel free to remix the content by assigning your students certain 
chapters and sections in your syllabus, in the order that you prefer. You can 
even provide a direct link in your syllabus to the sections in the web view of 
your book. 


Instructors also have the option of creating a customized version of their 
OpenStax book. The custom version can be made available to students in 
low-cost print or digital form through their campus bookstore. Visit your 
book page on openstax.org for more information. 


Errata 


All OpenStax textbooks undergo a rigorous review process. However, like 
any professional-grade textbook, errors sometimes occur. Since our books 
are web based, we can make updates periodically when deemed 
pedagogically necessary. If you have a correction to suggest, submit it 
through the link on your book page on openstax.org. Subject matter experts 
review all errata suggestions. OpenStax is committed to remaining 
transparent about all updates, so you will also find a list of past errata 
changes on your book page on openstax.org. 


Format 


You can access this textbook for free in web view or PDF through 
openstax.org, and in low-cost print and iBooks editions. 


About College Physics 


College Physics meets standard scope and sequence requirements for a two- 
semester introductory algebra-based physics course. The text is grounded in 
real-world examples to help students grasp fundamental physics concepts. It 
requires knowledge of algebra and some trigonometry, but not calculus. 
College Physics includes learning objectives, concept questions, links to 
labs and simulations, and ample practice opportunities for traditional 
physics application problems. 


Coverage and Scope 


College Physics is organized such that topics are introduced conceptually 
with a steady progression to precise definitions and analytical applications. 
The analytical aspect (problem solving) is tied back to the conceptual 
before moving on to another topic. Each introductory chapter, for example, 
opens with an engaging photograph relevant to the subject of the chapter 
and interesting applications that are easy for most students to visualize. 
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Concepts and Calculations 


The ability to calculate does not guarantee conceptual understanding. In 
order to unify conceptual, analytical, and calculation skills within the 
learning process, we have integrated Strategies and Discussions throughout 
the text. 


Modern Perspective 


The chapters on modern physics are more complete than many other texts 
on the market, with an entire chapter devoted to medical applications of 
nuclear physics and another to particle physics. The final chapter of the 
text, “Frontiers of Physics,” is devoted to the most exciting endeavors in 
physics. It ends with a module titled “Some Questions We Know to Ask.” 


Key Features 


Modularity 


This textbook is organized as a collection of modules that can be rearranged 
and modified to suit the needs of a particular professor or class. That being 
said, modules often contain references to content in other modules, as most 
topics in physics cannot be discussed in isolation. 


Learning Objectives 


Every module begins with a set of learning objectives. These objectives are 
designed to guide the instructor in deciding what content to include or 
assign, and to guide the student with respect to what he or she can expect to 
learn. After completing the module and end-of-module exercises, students 
should be able to demonstrate mastery of the learning objectives. 


Call-Outs 


Key definitions, concepts, and equations are called out with a special design 
treatment. Call-outs are designed to catch readers’ attention, to make it clear 
that a specific term, concept, or equation is particularly important, and to 
provide easy reference for a student reviewing content. 


Key Terms 


Key terms are in bold and are followed by a definition in context. 
Definitions of key terms are also listed in the Glossary, which appears at the 
end of the module. 


Worked Examples 


Worked examples have four distinct parts to promote both analytical and 
conceptual skills. Worked examples are introduced in words, always using 
some application that should be of interest. This is followed by a Strategy 
section that emphasizes the concepts involved and how solving the problem 


relates to those concepts. This is followed by the mathematical Solution and 
Discussion. 


Many worked examples contain multiple-part problems to help the students 
learn how to approach normal situations, in which problems tend to have 
multiple parts. Finally, worked examples employ the techniques of the 
problem-solving strategies so that students can see how those strategies 
succeed in practice as well as in theory. 


Problem-Solving Strategies 


Problem-solving strategies are first presented in a special section and 
subsequently appear at crucial points in the text where students can benefit 
most from them. Problem-solving strategies have a logical structure that is 
reinforced in the worked examples and supported in certain places by line 
drawings that illustrate various steps. 


Misconception Alerts 


Students come to physics with preconceptions from everyday experiences 
and from previous courses. Some of these preconceptions are 
misconceptions, and many are very common among students and the 
general public. Some are inadvertently picked up through 
misunderstandings of lectures and texts. The Misconception Alerts feature 
is designed to point these out and correct them explicitly. 


Take-Home Investigations 


Take Home Investigations provide the opportunity for students to apply or 
explore what they have learned with a hands-on activity. 


Things Great and Small 


In these special topic essays, macroscopic phenomena (such as air pressure) 
are explained with submicroscopic phenomena (such as atoms bouncing off 
walls). These essays support the modern perspective by describing aspects 
of modern physics before they are formally treated in later chapters. 
Connections are also made between apparently disparate phenomena. 


Simulations 


Where applicable, students are directed to the interactive PHeT physics 
simulations developed by the University of Colorado. There they can 
further explore the physics concepts they have learned about in the module. 


Summary 


Module summaries are thorough and functional and present all important 
definitions and equations. Students are able to find the definitions of all 
terms and symbols as well as their physical relationships. The structure of 
the summary makes plain the fundamental principles of the module or 
collection and serves as a useful study guide. 


Glossary 


At the end of every module or chapter is a Glossary containing definitions 
of all of the key terms in the module or chapter. 


End-of-Module Problems 


At the end of every chapter is a set of Conceptual Questions and/or skills- 
based Problems & Exercises. Conceptual Questions challenge students’ 
ability to explain what they have learned conceptually, independent of the 
mathematical details. Problems & Exercises challenge students to apply 
both concepts and skills to solve mathematical physics problems. Online, 


every other problem includes an answer that students can reveal 
immediately by clicking on a “Show Solution” button. 


In addition to traditional skills-based problems, there are three special types 
of end-of-module problems: Integrated Concept Problems, Unreasonable 
Results Problems, and Construct Your Own Problems. All of these 
problems are indicated with a subtitle preceding the problem. 


Integrated Concept Problems 


In Integrated Concept Problems, students are asked to apply what they have 
learned about two or more concepts to arrive at a solution to a problem. 
These problems require a higher level of thinking because, before solving a 
problem, students have to recognize the combination of strategies required 
to solve it. 


Unreasonable Results 


In Unreasonable Results Problems, students are challenged to not only 
apply concepts and skills to solve a problem, but also to analyze the answer 
with respect to how likely or realistic it really is. These problems contain a 
premise that produces an unreasonable answer and are designed to further 
emphasize that properly applied physics must describe nature accurately 
and is not simply the process of solving equations. 


Construct Your Own Problem 


These problems require students to construct the details of a problem, 
justify their starting assumptions, show specific steps in the problem’s 
solution, and finally discuss the meaning of the result. These types of 
problems relate well to both conceptual and analytical aspects of physics, 
emphasizing that physics must describe nature. Often they involve an 
integration of topics from more than one chapter. Unlike other problems, 
solutions are not provided since there is no single correct answer. 


Instructors should feel free to direct students regarding the level and scope 
of their considerations. Whether the problem is solved and described 
correctly will depend on initial assumptions. 


Additional Resources 


Student and Instructor Resources 


We’ve compiled additional resources for both students and instructors, 
including Getting Started Guides, an instructor solution manual, and 
PowerPoint slides. Instructor resources require a verified instructor account, 
which can be requested on your openstax.org log-in. Take advantage of 
these resources to supplement your OpenStax book. 


Partner Resources 


OpenStax Partners are our allies in the mission to make high-quality 
learning materials affordable and accessible to students and instructors 
everywhere. Their tools integrate seamlessly with our OpenStax titles at a 
low cost. To access the partner resources for your text, visit your book page 
on openstax.org. 
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Introduction to Science and the Realm of Physics, Physical Quantities, and 
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Galaxies are 
as immense 
as atoms are 
small. Yet the 
same laws of 
physics 
describe 
both, and all 
the rest of 
nature—an 
indication of 
the 
underlying 
unity in the 
universe. The 
laws of 
physics are 
surprisingly 
few in 
number, 
implying an 
underlying 
simplicity to 
nature’s 
apparent 
complexity. 
(credit: 
NASA, JPL- 
Caltech, P. 
Barmby, 
Harvard- 
Smithsonian 
Center for 


Astrophysics 
) 


What is your first reaction when you hear the word “physics”? Did you 
imagine working through difficult equations or memorizing formulas that 
seem to have no real use in life outside the physics classroom? Many people 
come to the subject of physics with a bit of fear. But as you begin your 
exploration of this broad-ranging subject, you may soon come to realize 
that physics plays a much larger role in your life than you first thought, no 
matter your life goals or career choice. 


For example, take a look at the image above. This image is of the 
Andromeda Galaxy, which contains billions of individual stars, huge clouds 
of gas, and dust. Two smaller galaxies are also visible as bright blue spots in 
the background. At a staggering 2.5 million light years from the Earth, this 
galaxy is the nearest one to our own galaxy (which is called the Milky 
Way). The stars and planets that make up Andromeda might seem to be the 
furthest thing from most people’s regular, everyday lives. But Andromeda is 
a great starting point to think about the forces that hold together the 
universe. The forces that cause Andromeda to act as it does are the same 
forces we contend with here on Earth, whether we are planning to send a 
rocket into space or simply raise the walls for a new home. The same 
gravity that causes the stars of Andromeda to rotate and revolve also causes 
water to flow over hydroelectric dams here on Earth. Tonight, take a 
moment to look up at the stars. The forces out there are the same as the ones 
here on Earth. Through a study of physics, you may gain a greater 


understanding of the interconnectedness of everything we can see and know 
in this universe. 


Think now about all of the technological devices that you use on a regular 
basis. Computers, smart phones, GPS systems, MP3 players, and satellite 
radio might come to mind. Next, think about the most exciting modern 
technologies that you have heard about in the news, such as trains that 
levitate above tracks, “invisibility cloaks” that bend light around them, and 
microscopic robots that fight cancer cells in our bodies. All of these 
groundbreaking advancements, commonplace or unbelievable, rely on the 
principles of physics. Aside from playing a significant role in technology, 
professionals such as engineers, pilots, physicians, physical therapists, 
electricians, and computer programmers apply physics concepts in their 
daily work. For example, a pilot must understand how wind forces affect a 
flight path and a physical therapist must understand how the muscles in the 
body experience forces as they move and bend. As you will learn in this 
text, physics principles are propelling new, exciting technologies, and these 
principles are applied in a wide range of careers. 


In this text, you will begin to explore the history of the formal study of 
physics, beginning with natural philosophy and the ancient Greeks, and 
leading up through a review of Sir Isaac Newton and the laws of physics 
that bear his name. You will also be introduced to the standards scientists 
use when they study physical quantities and the interrelated system of 
measurements most of the scientific community uses to communicate in a 
single mathematical language. Finally, you will study the limits of our 
ability to be accurate and precise, and the reasons scientists go to 
painstaking lengths to be as clear as possible regarding their own 
limitations. 


Physics: An Introduction 


e Explain the difference between a principle and a law. 
e Explain the difference between a model and a theory. 


The flight formations of migratory 
birds such as Canada geese are 
governed by the laws of physics. 
(credit: David Merrett) 


The physical universe is enormously complex in its detail. Every day, each 
of us observes a great variety of objects and phenomena. Over the centuries, 
the curiosity of the human race has led us collectively to explore and 
catalog a tremendous wealth of information. From the flight of birds to the 
colors of flowers, from lightning to gravity, from quarks to clusters of 
galaxies, from the flow of time to the mystery of the creation of the 
universe, we have asked questions and assembled huge arrays of facts. In 
the face of all these details, we have discovered that a surprisingly small 
and unified set of physical laws can explain what we observe. As humans, 
we inake generalizations and seek order. We have found that nature is 
remarkably cooperative—it exhibits the underlying order and simplicity we 
so value. 


It is the underlying order of nature that makes science in general, and 
physics in particular, so enjoyable to study. For example, what do a bag of 
chips and a car battery have in common? Both contain energy that can be 


converted to other forms. The law of conservation of energy (which says 
that energy can change form but is never lost) ties together such topics as 
food calories, batteries, heat, light, and watch springs. Understanding this 
law makes it easier to learn about the various forms energy takes and how 
they relate to one another. Apparently unrelated topics are connected 
through broadly applicable physical laws, permitting an understanding 
beyond just the memorization of lists of facts. 


The unifying aspect of physical laws and the basic simplicity of nature form 
the underlying themes of this text. In learning to apply these laws, you will, 
of course, study the most important topics in physics. More importantly, 
you will gain analytical abilities that will enable you to apply these laws far 
beyond the scope of what can be included in a single book. These analytical 
skills will help you to excel academically, and they will also help you to 
think critically in any professional career you choose to pursue. This 
module discusses the realm of physics (to define what physics is), some 
applications of physics (to illustrate its relevance to other disciplines), and 
more precisely what constitutes a physical law (to illuminate the importance 
of experimentation to theory). 


Science and the Realm of Physics 


Science consists of the theories and laws that are the general truths of nature 
as well as the body of knowledge they encompass. Scientists are continually 
trying to expand this body of knowledge and to perfect the expression of the 
laws that describe it. Physics is concerned with describing the interactions 
of energy, matter, space, and time, and it is especially interested in what 
fundamental mechanisms underlie every phenomenon. The concern for 
describing the basic phenomena in nature essentially defines the realm of 
physics. 


Physics aims to describe the function of everything around us, from the 
movement of tiny charged particles to the motion of people, cars, and 
spaceships. In fact, almost everything around you can be described quite 
accurately by the laws of physics. Consider a smart phone ([link]). Physics 
describes how electricity interacts with the various circuits inside the 
device. This knowledge helps engineers select the appropriate materials and 


circuit layout when building the smart phone. Next, consider a GPS system. 
Physics describes the relationship between the speed of an object, the 
distance over which it travels, and the time it takes to travel that distance. 
When you use a GPS device in a vehicle, it utilizes these physics equations 
to determine the travel time from one location to another. 


The Apple 
“iPhone” is a 


common 
smart phone 
with a GPS 
function. 
Physics 
describes the 
way that 
electricity 
flows through 
the circuits of 
this device. 
Engineers use 
their 
knowledge of 
physics to 
construct an 


iPhone with 
features that 
consumers 
will enjoy. 
One specific 
feature of an 
iPhone is the 
GPS 
function. 
GPS uses 
physics 
equations to 
determine the 
driving time 
between two 
locations on a 
map. (credit: 
@gletham 
GIS, Social, 
Mobile Tech 
Images) 


Applications of Physics 


You need not be a scientist to use physics. On the contrary, knowledge of 
physics is useful in everyday situations as well as in nonscientific 
professions. It can help you understand how microwave ovens work, why 
metals should not be put into them, and why they might affect pacemakers. 
(See [link] and [link].) Physics allows you to understand the hazards of 
radiation and rationally evaluate these hazards more easily. Physics also 
explains the reason why a black car radiator helps remove heat in a car 
engine, and it explains why a white roof helps keep the inside of a house 
cool. Similarly, the operation of a car’s ignition system as well as the 
transmission of electrical signals through our body’s nervous system are 


much easier to understand when you think about them in terms of basic 
physics. 


Physics is the foundation of many important disciplines and contributes 
directly to others. Chemistry, for example—-since it deals with the 
interactions of atoms and molecules—is rooted in atomic and molecular 
physics. Most branches of engineering are applied physics. In architecture, 
physics is at the heart of structural stability, and is involved in the acoustics, 
heating, lighting, and cooling of buildings. Parts of geology rely heavily on 
physics, such as radioactive dating of rocks, earthquake analysis, and heat 
transfer in the Earth. Some disciplines, such as biophysics and geophysics, 
are hybrids of physics and other disciplines. 


Physics has many applications in the biological sciences. On the 
microscopic level, it helps describe the properties of cell walls and cell 
membranes ({link] and [link]). On the macroscopic level, it can explain the 
heat, work, and power associated with the human body. Physics is involved 
in medical diagnostics, such as x-rays, magnetic resonance imaging (MRI), 
and ultrasonic blood flow measurements. Medical therapy sometimes 
directly involves physics; for example, cancer radiotherapy uses ionizing 
radiation. Physics can also explain sensory phenomena, such as how 
musical instruments make sound, how the eye detects color, and how lasers 
can transmit information. 


It is not necessary to formally study all applications of physics. What is 
most useful is knowledge of the basic laws of physics and a skill in the 
analytical methods for applying them. The study of physics also can 
improve your problem-solving skills. Furthermore, physics has retained the 
most basic aspects of science, so it is used by all of the sciences, and the 
study of physics makes other sciences easier to understand. 


The laws of physics help 
us understand how 
common appliances 
work. For example, the 
laws of physics can help 
explain how microwave 
ovens heat up food, and 
they also help us 
understand why it is 
dangerous to place metal 
objects in a microwave 
oven. (credit: 
MoneyBlogNewz) 


These two 
applications of 
physics have 
more in common 
than meets the 
eye. Microwave 
ovens use 
electromagnetic 
waves to heat 
food. Magnetic 
resonance 
imaging (MRI) 
also uses 
electromagnetic 
waves to yield an 
image of the 
brain, from which 
the exact location 
of tumors can be 
determined. 
(credit: Rashmi 
Chawla, Daniel 
Smith, and Paul 
E. Marik) 


Physics, chemistry, 


and biology help 
describe the properties 
of cell walls in plant 
cells, such as the onion 
cells seen here. (credit: 
Umberto Salvagnin) 
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An artist’s rendition of the the structure of a cell membrane. 
Membranes form the boundaries of animal cells and are 
complex in structure and function. Many of the most 
fundamental properties of life, such as the firing of nerve cells, 
are related to membranes. The disciplines of biology, 
chemistry, and physics all help us understand the membranes of 
animal cells. (credit: Mariana Ruiz) 


Models, Theories, and Laws; The Role of Experimentation 


The laws of nature are concise descriptions of the universe around us; they 
are human statements of the underlying laws or rules that all natural 
processes follow. Such laws are intrinsic to the universe; humans did not 


create them and so cannot change them. We can only discover and 
understand them. Their discovery is a very human endeavor, with all the 
elements of mystery, imagination, struggle, triumph, and disappointment 
inherent in any creative effort. (See [link] and [link].) The cornerstone of 
discovering natural laws is observation; science must describe the universe 
as it is, not as we may imagine it to be. 


Sir Isaac Newton 


Isaac Newton 
(1642-1727) was 
very reluctant to 

publish his 
revolutionary 
work and had to 
be convinced to 
do so. In his later 
years, he stepped 
down from his 
academic post and 
became 
exchequer of the 
Royal Mint. He 
took this post 


seriously, 
inventing reeding 
(or creating 
ridges) on the 
edge of coins to 
prevent 
unscrupulous 
people from 
trimming the 
silver off of them 
before using them 
as Currency. 
(credit: Arthur 
Shuster and 
Arthur E. Shipley: 
Britain’s Heritage 
of Science. 
London, 1917.) 


Marie Curie 
(1867-1934) 
sacrificed 


monetary assets 
to help finance 
her early 
research and 
damaged her 
physical well- 
being with 
radiation 
exposure. She is 
the only person 
to win Nobel 
prizes in both 
physics and 
chemistry. One 
of her daughters 
also won a 
Nobel Prize. 
(credit: 
Wikimedia 
Commons) 


We all are curious to some extent. We look around, make generalizations, 
and try to understand what we see—for example, we look up and wonder 
whether one type of cloud signals an oncoming storm. As we become 
serious about exploring nature, we become more organized and formal in 
collecting and analyzing data. We attempt greater precision, perform 
controlled experiments (if we can), and write down ideas about how the 
data may be organized and unified. We then formulate models, theories, and 
laws based on the data we have collected and analyzed to generalize and 
communicate the results of these experiments. 


A model is a representation of something that is often too difficult (or 
impossible) to display directly. While a model is justified with experimental 
proof, it is only accurate under limited situations. An example is the 
planetary model of the atom in which electrons are pictured as orbiting the 


nucleus, analogous to the way planets orbit the Sun. (See [link].) We cannot 
observe electron orbits directly, but the mental image helps explain the 
observations we can make, such as the emission of light from hot gases 
(atomic spectra). Physicists use models for a variety of purposes. For 
example, models can help physicists analyze a scenario and perform a 
calculation, or they can be used to represent a situation in the form of a 
computer simulation. A theory is an explanation for patterns in nature that 
is supported by scientific evidence and verified multiple times by various 
groups of researchers. Some theories include models to help visualize 
phenomena, whereas others do not. Newton’s theory of gravity, for 
example, does not require a model or mental image, because we can 
observe the objects directly with our own senses. The kinetic theory of 
gases, on the other hand, is a model in which a gas is viewed as being 
composed of atoms and molecules. Atoms and molecules are too small to 
be observed directly with our senses—thus, we picture them mentally to 
understand what our instruments tell us about the behavior of gases. 


A law uses concise language to describe a generalized pattern in nature that 
is supported by scientific evidence and repeated experiments. Often, a law 
can be expressed in the form of a single mathematical equation. Laws and 
theories are similar in that they are both scientific statements that result 
from a tested hypothesis and are supported by scientific evidence. However, 
the designation law is reserved for a concise and very general statement that 
describes phenomena in nature, such as the law that energy is conserved 
during any process, or Newton’s second law of motion, which relates force, 
mass, and acceleration by the simple equation F = ma. A theory, in 
contrast, is a less concise statement of observed phenomena. For example, 
the Theory of Evolution and the Theory of Relativity cannot be expressed 
concisely enough to be considered a law. The biggest difference between a 
law and a theory is that a theory is much more complex and dynamic. A law 
describes a single action, whereas a theory explains an entire group of 
related phenomena. And, whereas a law is a postulate that forms the 
foundation of the scientific method, a theory is the end result of that 
process. 


Less broadly applicable statements are usually called principles (such as 
Pascal’s principle, which is applicable only in fluids), but the distinction 


between laws and principles often is not carefully made. 


What is a 
model? 
This 
planetary 
model of 
the atom 
shows 
electrons 
orbiting the 
nucleus. It 
isa 
drawing 
that we use 
to form a 
mental 
image of 
the atom 
that we 
cannot see 
directly 
with our 
eyes 
because it 
is too 
small. 


Note: 

Models, Theories, and Laws 

Models, theories, and laws are used to help scientists analyze the data they 
have already collected. However, often after a model, theory, or law has 
been developed, it points scientists toward new discoveries they would not 
otherwise have made. 


The models, theories, and laws we devise sometimes imply the existence of 
objects or phenomena as yet unobserved. These predictions are remarkable 
triumphs and tributes to the power of science. It is the underlying order in 
the universe that enables scientists to make such spectacular predictions. 
However, if experiment does not verify our predictions, then the theory or 
law is wrong, no matter how elegant or convenient it is. Laws can never be 
known with absolute certainty because it is impossible to perform every 
imaginable experiment in order to confirm a law in every possible scenario. 
Physicists operate under the assumption that all scientific laws and theories 
are valid until a counterexample is observed. If a good-quality, verifiable 
experiment contradicts a well-established law, then the law must be 
modified or overthrown completely. 


The study of science in general and physics in particular is an adventure 
much like the exploration of uncharted ocean. Discoveries are made; 
models, theories, and laws are formulated; and the beauty of the physical 
universe is made more sublime for the insights gained. 


Note: 

The Scientific Method 

As scientists inquire and gather information about the world, they follow a 
process called the scientific method. This process typically begins with an 
observation and question that the scientist will research. Next, the scientist 


typically performs some research about the topic and then devises a 
hypothesis. Then, the scientist will test the hypothesis by performing an 
experiment. Finally, the scientist analyzes the results of the experiment and 
draws a conclusion. Note that the scientific method can be applied to many 
situations that are not limited to science, and this method can be modified 
to suit the situation. 

Consider an example. Let us say that you try to turn on your car, but it will 
not start. You undoubtedly wonder: Why will the car not start? You can 
follow a scientific method to answer this question. First off, you may 
perform some research to determine a variety of reasons why the car will 
not start. Next, you will state a hypothesis. For example, you may believe 
that the car is not starting because it has no engine oil. To test this, you 
open the hood of the car and examine the oil level. You observe that the oil 
is at an acceptable level, and you thus conclude that the oil level is not 
contributing to your car issue. To troubleshoot the issue further, you may 
devise a new hypothesis to test and then repeat the process again. 


The Evolution of Natural Philosophy into Modern Physics 


Physics was not always a separate and distinct discipline. It remains 
connected to other sciences to this day. The word physics comes from 
Greek, meaning nature. The study of nature came to be called “natural 
philosophy.” From ancient times through the Renaissance, natural 
philosophy encompassed many fields, including astronomy, biology, 
chemistry, physics, mathematics, and medicine. Over the last few centuries, 
the growth of knowledge has resulted in ever-increasing specialization and 
branching of natural philosophy into separate fields, with physics retaining 
the most basic facets. (See [link], [link], and [link].) Physics as it developed 
from the Renaissance to the end of the 19th century is called classical 
physics. It was transformed into modern physics by revolutionary 
discoveries made starting at the beginning of the 20th century. 


Over the 


centuries, 
natural 
philosophy has 
evolved into 
more 
specialized 
disciplines, as 
illustrated by 
the 
contributions of 
some of the 
greatest minds 
in history. The 
Greek 
philosopher 
Aristotle (384— 
322 B.C.) wrote 
on a broad 
range of topics 
including 
physics, 
animals, the 
soul, politics, 
and poetry. 
(credit: Jastrow 


(2006)/Ludovisi 
Collection) 


Galileo Galilei 
(1564-1642) laid 
the foundation of 

modern 
experimentation 
and made 
contributions in 
mathematics, 
physics, and 
astronomy. 
(credit: 
Domenico 
Tintoretto) 


Niels Bohr 
(1885-1962) 
made 
fundamental 
contributions to 
the development 
of quantum 
mechanics, one 
part of modern 
physics. (credit: 
United States 
Library of 
Congress Prints 
and Photographs 
Division) 


Classical physics is not an exact description of the universe, but it is an 
excellent approximation under the following conditions: Matter must be 
moving at speeds less than about 1% of the speed of light, the objects dealt 
with must be large enough to be seen with a microscope, and only weak 
gravitational fields, such as the field generated by the Earth, can be 
involved. Because humans live under such circumstances, classical physics 
seems intuitively reasonable, while many aspects of modern physics seem 
bizarre. This is why models are so useful in modern physics—they let us 


conceptualize phenomena we do not ordinarily experience. We can relate to 
models in human terms and visualize what happens when objects move at 
high speeds or imagine what objects too small to observe with our senses 
might be like. For example, we can understand an atom’s properties because 
we can picture it in our minds, although we have never seen an atom with 
our eyes. New tools, of course, allow us to better picture phenomena we 
cannot see. In fact, new instrumentation has allowed us in recent years to 
actually “picture” the atom. 


Note: 

Limits on the Laws of Classical Physics 

For the laws of classical physics to apply, the following criteria must be 
met: Matter must be moving at speeds less than about 1% of the speed of 
light, the objects dealt with must be large enough to be seen with a 
microscope, and only weak gravitational fields (such as the field generated 
by the Earth) can be involved. 


Using a 
scanning 
tunneling 
microscope 
(STM), 
scientists can 
see the 
individual 
atoms that 


compose this 

sheet of gold. 
(credit: 

Erwinrossen) 


Some of the most spectacular advances in science have been made in 
modern physics. Many of the laws of classical physics have been modified 
or rejected, and revolutionary changes in technology, society, and our view 
of the universe have resulted. Like science fiction, modern physics is filled 
with fascinating objects beyond our normal experiences, but it has the 
advantage over science fiction of being very real. Why, then, is the majority 
of this text devoted to topics of classical physics? There are two main 
reasons: Classical physics gives an extremely accurate description of the 
universe under a wide range of everyday circumstances, and knowledge of 
classical physics is necessary to understand modern physics. 


Modern physics itself consists of the two revolutionary theories, relativity 
and quantum mechanics. These theories deal with the very fast and the very 
small, respectively. Relativity must be used whenever an object is traveling 
at greater than about 1% of the speed of light or experiences a strong 
gravitational field such as that near the Sun. Quantum mechanics must be 
used for objects smaller than can be seen with a microscope. The 
combination of these two theories is relativistic quantum mechanics, and it 
describes the behavior of small objects traveling at high speeds or 
experiencing a strong gravitational field. Relativistic quantum mechanics is 
the best universally applicable theory we have. Because of its mathematical 
complexity, it is used only when necessary, and the other theories are used 
whenever they will produce sufficiently accurate results. We will find, 
however, that we can do a great deal of modern physics with the algebra 
and trigonometry used in this text. 

Exercise: 

Check Your Understanding 


Problem: 


A friend tells you he has learned about a new law of nature. What can 
you know about the information even before your friend describes the 
law? How would the information be different if your friend told you he 
had learned about a scientific theory rather than a law? 


Solution: 


Without knowing the details of the law, you can still infer that the 
information your friend has learned conforms to the requirements of 
all laws of nature: it will be a concise description of the universe 
around us; a statement of the underlying rules that all natural processes 
follow. If the information had been a theory, you would be able to infer 
that the information will be a large-scale, broadly applicable 
generalization. 


Note: 

PhET Explorations: Equation Grapher 

Learn about graphing polynomials. The shape of the curve changes as the 
constants are adjusted. View the curves for the individual terms (e.g. 

y = bz) to see how they add to generate the polynomial curve. 


Summary 


e Science seeks to discover and describe the underlying order and 
simplicity in nature. 

e Physics is the most basic of the sciences, concerning itself with energy, 
matter, space and time, and their interactions. 

e Scientific laws and theories express the general truths of nature and the 
body of knowledge they encompass. These laws of nature are rules 
that all natural processes appear to follow. 


Conceptual Questions 


Exercise: 
Problem: 
Models are particularly useful in relativity and quantum mechanics, 


where conditions are outside those normally encountered by humans. 
What is a model? 


Exercise: 


Problem: How does a model differ from a theory? 
Exercise: 
Problem: 
If two different theories describe experimental observations equally 


well, can one be said to be more valid than the other (assuming both 
use accepted rules of logic)? 


Exercise: 


Problem: What determines the validity of a theory? 
Exercise: 
Problem: 
Certain criteria must be satisfied if a measurement or observation is to 


be believed. Will the criteria necessarily be as strict for an expected 
result as for an unexpected result? 


Exercise: 
Problem: 
Can the validity of a model be limited, or must it be universally valid? 
How does this compare to the required validity of a theory or a law? 


Exercise: 


Problem: 


Classical physics is a good approximation to modern physics under 
certain circumstances. What are they? 


Exercise: 


Problem: When is it necessary to use relativistic quantum mechanics? 
Exercise: 


Problem: 


Can classical physics be used to accurately describe a satellite moving 
at a speed of 7500 m/s? Explain why or why not. 


Glossary 


classical physics 
physics that was developed from the Renaissance to the end of the 
19th century 


physics 
the science concerned with describing the interactions of energy, 
matter, space, and time; it is especially interested in what fundamental 
mechanisms underlie every phenomenon 


model 
representation of something that is often too difficult (or impossible) to 
display directly 


theory 
an explanation for patterns in nature that is supported by scientific 
evidence and verified multiple times by various groups of researchers 


law 
a description, using concise language or a mathematical formula, a 
generalized pattern in nature that is supported by scientific evidence 


and repeated experiments 


scientific method 
a method that typically begins with an observation and question that 
the scientist will research; next, the scientist typically performs some 
research about the topic and then devises a hypothesis; then, the 
scientist will test the hypothesis by performing an experiment; finally, 
the scientist analyzes the results of the experiment and draws a 
conclusion 


modern physics 
the study of relativity, quantum mechanics, or both 


relativity 
the study of objects moving at speeds greater than about 1% of the 
speed of light, or of objects being affected by a strong gravitational 
field 


quantum mechanics 
the study of objects smaller than can be seen with a microscope 


Physical Quantities and Units 


e Perform unit conversions both in the SI and English units. 
e Explain the most common prefixes in the SI units and be able to write them in scientific notation. 


The distance from Earth to the 


Moon may seem immense, but 
it is just a tiny fraction of the 
distances from Earth to other 

celestial bodies. (credit: NASA) 


The range of objects and phenomena studied in physics is immense. From the incredibly short lifetime of 
a nucleus to the age of the Earth, from the tiny sizes of sub-nuclear particles to the vast distance to the 
edges of the known universe, from the force exerted by a jumping flea to the force between Earth and the 
Sun, there are enough factors of 10 to challenge the imagination of even the most experienced scientist. 
Giving numerical values for physical quantities and equations for physical principles allows us to 
understand nature much more deeply than does qualitative description alone. To comprehend these vast 
ranges, we must also have accepted units in which to express them. And we shall find that (even in the 
potentially mundane discussion of meters, kilograms, and seconds) a profound simplicity of nature 
appears—all physical quantities can be expressed as combinations of only four fundamental physical 
quantities: length, mass, time, and electric current. 


We define a physical quantity either by specifying how it is measured or by stating how it is calculated 
from other measurements. For example, we define distance and time by specifying methods for 
measuring them, whereas we define average speed by stating that it is calculated as distance traveled 
divided by time of travel. 


Measurements of physical quantities are expressed in terms of units, which are standardized values. For 
example, the length of a race, which is a physical quantity, can be expressed in units of meters (for 
sprinters) or kilometers (for distance runners). Without standardized units, it would be extremely difficult 
for scientists to express and compare measured values in a meaningful way. (See [link].) 


a 


| wonder 
how big 
a cable is? 


.. oA 
San oe tines 
‘ 


Distances given in 
unknown units are 
maddeningly useless. 


There are two major systems of units used in the world: SI units (also known as the metric system) and 
English units (also known as the customary or imperial system). English units were historically used in 
nations once ruled by the British Empire and are still widely used in the United States. Virtually every 
other country in the world now uses SI units as the standard; the metric system is also the standard system 
agreed upon by scientists and mathematicians. The acronym “SI” is derived from the French Systéme 
International. 


SI Units: Fundamental and Derived Units 


[link] gives the fundamental SI units that are used throughout this textbook. This text uses non-SI units in 
a few applications where they are in very common use, such as the measurement of blood pressure in 
millimeters of mercury (mm Hg). Whenever non-SI units are discussed, they will be tied to SI units 
through conversions. 


Length Mass Time Electric Current 
meter (m) kilogram (kg) second (s) ampere (A) 
Fundamental SI Units 


It is an intriguing fact that some physical quantities are more fundamental than others and that the most 
fundamental physical quantities can be defined only in terms of the procedure used to measure them. The 
units in which they are measured are thus called fundamental units. In this textbook, the fundamental 
physical quantities are taken to be length, mass, time, and electric current. (Note that electric current will 
not be introduced until much later in this text.) All other physical quantities, such as force and electric 
charge, can be expressed as algebraic combinations of length, mass, time, and current (for example, speed 
is length divided by time); these units are called derived units. 


Units of Time, Length, and Mass: The Second, Meter, and Kilogram 


The Second 


The SI unit for time, the second(abbreviated s), has a long history. For many years it was defined as 
1/86,400 of a mean solar day. More recently, a new standard was adopted to gain greater accuracy and to 
define the second in terms of a non-varying, or constant, physical phenomenon (because the solar day is 
getting longer due to very gradual slowing of the Earth’s rotation). Cesium atoms can be made to vibrate 
in a very steady way, and these vibrations can be readily observed and counted. In 1967 the second was 
redefined as the time required for 9,192,631,770 of these vibrations. (See [link].) Accuracy in the 
fundamental units is essential, because all measurements are ultimately expressed in terms of 
fundamental units and can be no more accurate than are the fundamental units themselves. 


An atomic clock such 
as this one uses the 
vibrations of cesium 


atoms to keep time to 
a precision of better 
than a microsecond 
per year. The 
fundamental unit of 
time, the second, is 
based on such clocks. 
This image is looking 
down from the top of 
an atomic fountain 
nearly 30 feet tall! 
(credit: Steve 
Jurvetson/Flickr) 


The Meter 


The SI unit for length is the meter (abbreviated m); its definition has also changed over time to become 
more accurate and precise. The meter was first defined in 1791 as 1/10,000,000 of the distance from the 
equator to the North Pole. This measurement was improved in 1889 by redefining the meter to be the 
distance between two engraved lines on a platinum-iridium bar now kept near Paris. By 1960, it had 
become possible to define the meter even more accurately in terms of the wavelength of light, so it was 
again redefined as 1,650,763.73 wavelengths of orange light emitted by krypton atoms. In 1983, the 
meter was given its present definition (partly for greater accuracy) as the distance light travels in a 
vacuum in 1/299,792,458 of a second. (See [link].) This change defines the speed of light to be exactly 
299,792,458 meters per second. The length of the meter will change if the speed of light is someday 
measured with greater accuracy. 


The Kilogram 


The SI unit for mass is the kilogram (abbreviated kg); it is defined to be the mass of a platinum-iridium 
cylinder kept with the old meter standard at the International Bureau of Weights and Measures near Paris. 
Exact replicas of the standard kilogram are also kept at the United States’ National Institute of Standards 


and Technology, or NIST, located in Gaithersburg, Maryland outside of Washington D.C., and at other 
locations around the world. The determination of all other masses can be ultimately traced to a 
comparison with the standard mass. 


OEE 
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Light travels a distance of 1 meter 
in 1/299,792,458 seconds 


The meter is defined to be the distance light 
travels in 1/299,792,458 of a second ina 
vacuum. Distance traveled is speed 
multiplied by time. 


Electric current and its accompanying unit, the ampere, will be introduced in Introduction to Electric 
Current, Resistance, and Ohm's Law when electricity and magnetism are covered. The initial modules in 
this textbook are concerned with mechanics, fluids, heat, and waves. In these subjects all pertinent 
physical quantities can be expressed in terms of the fundamental units of length, mass, and time. 


Metric Prefixes 


SI units are part of the metric system. The metric system is convenient for scientific and engineering 
calculations because the units are categorized by factors of 10. [link] gives metric prefixes and symbols 
used to denote various factors of 10. 


Metric systems have the advantage that conversions of units involve only powers of 10. There are 100 
centimeters in a meter, 1000 meters in a kilometer, and so on. In nonmetric systems, such as the system of 
U.S. customary units, the relationships are not as simple—there are 12 inches in a foot, 5280 feet in a 
mile, and so on. Another advantage of the metric system is that the same unit can be used over extremely 
large ranges of values simply by using an appropriate metric prefix. For example, distances in meters are 
suitable in construction, while distances in kilometers are appropriate for air travel, and the tiny measure 
of nanometers are convenient in optical design. With the metric system there is no need to invent new 
units for particular applications. 


The term order of magnitude refers to the scale of a value expressed in the metric system. Each power 
of 10 in the metric system represents a different order of magnitude. For example, 10', 107, 10°, and so 
forth are all different orders of magnitude. All quantities that can be expressed as a product of a specific 
power of 10 are said to be of the same order of magnitude. For example, the number 800 can be written 
as 8 x 102, and the number 450 can be written as 4.5 x 10. Thus, the numbers 800 and 450 are of the 
same order of magnitude: 10”. Order of magnitude can be thought of as a ballpark estimate for the scale 
of a value. The diameter of an atom is on the order of 10~° m, while the diameter of the Sun is on the 
order of 10° m. 


Note: 
The Quest for Microscopic Standards for Basic Units 


The fundamental units described in this chapter are those that produce the greatest accuracy and 
precision in measurement. There is a sense among physicists that, because there is an underlying 
microscopic substructure to matter, it would be most satisfying to base our standards of measurement on 
microscopic objects and fundamental physical phenomena such as the speed of light. A microscopic 
standard has been accomplished for the standard of time, which is based on the oscillations of the cesium 
atom. 

The standard for length was once based on the wavelength of light (a small-scale length) emitted by a 
certain type of atom, but it has been supplanted by the more precise measurement of the speed of light. If 
it becomes possible to measure the mass of atoms or a particular arrangement of atoms such as a silicon 
sphere to greater precision than the kilogram standard, it may become possible to base mass 
measurements on the small scale. There are also possibilities that electrical phenomena on the small 
scale may someday allow us to base a unit of charge on the charge of electrons and protons, but at 
present current and charge are related to large-scale currents and forces between wires. 


Value|[footnote] 
See Appendix 
A fora 
discussion of 
Prefix Symbol powers of 10. Example (some are approximate) 
distance 
exa E 1018 exameter Em 1018 m light travels 
in a century 
30 million 
15 15 
peta P 10 petasecond Ps 10155 an 
12 2 powerful 
tera T 10 terawatt TW 10/4 W laser output 
a 
giga G 10° gigahertz GHz 10° Hz microwave 
frequency 
: : high 
6 6: 
mega M 10 megacurie MCi 10° Ci radioactivity 
F 3 : 3 about 6/10 
kilo k 10 kilometer km 10° m scale 


hecto h 102 hectoliter hL 107 L 26 gallons 


Value[footnote] 


See Appendix 
A fora 
discussion of 
Prefix Symbol powers of 10. Example (some are approximate) 
1 1 teaspoon of 
deka da 10 dekagram dag 10° ¢g Hiker 
= _ 10° 
(=1) 
deci d 1071 deciliter dL 10! L less than 
half a soda 
. = ; _ fingertip 
2 2 
centi c 10 centimeter cm 10-2 m iarleacee 
milli m 1073 millimeter mm 10-3 m flea at its 
shoulders 
micro 10-6 micrometer m 10-6 geal ae 
a B ta microscope 
nano n 10-9 nanogram ng 105? ee aa 
small 
pico p 10°" picofarad pF 10°" F capacitor in 
radio 
15 45 size of a 
femto f 10 femtometer fm 10-7 m proron 
time light 
atto a 10-18 attosecond as 10-8 s crosses an 
atom 


Metric Prefixes for Powers of 10 and their Symbols 


Known Ranges of Length, Mass, and Time 


The vastness of the universe and the breadth over which physics applies are illustrated by the wide range 
of examples of known lengths, masses, and times in [link]. Examination of this table will give you some 


feeling for the range of possible topics and numerical values. (See [link] and [link].) 


Tiny phytoplankton 
swims among crystals of 
ice in the Antarctic Sea. 
They range from a few 
micrometers to as much 

as 2 millimeters in length. 
(credit: Prof. Gordon T. 
Taylor, Stony Brook 
University; NOAA Corps 
Collections) 


Galaxies collide 2.4 
billion light years away 
from Earth. The 
tremendous range of 
observable phenomena in 
nature challenges the 
imagination. (credit: 
NASA/CXC/UVic./A. 
Mahdavi et al. 
Optical/lensing: 
CFHT/UVic./H. Hoekstra 
et al.) 


Unit Conversion and Dimensional Analysis 


It is often necessary to convert from one type of unit to another. For example, if you are reading a 
European cookbook, some quantities may be expressed in units of liters and you need to convert them to 
cups. Or, perhaps you are reading walking directions from one location to another and you are interested 
in how many miles you will be walking. In this case, you will need to convert units of feet to miles. 


Let us consider a simple example of how to convert units. Let us say that we want to convert 80 meters 
(m) to kilometers (km). 


The first thing to do is to list the units that you have and the units that you want to convert to. In this case, 
we have units in meters and we want to convert to kilometers. 


Next, we need to determine a conversion factor relating meters to kilometers. A conversion factor is a 
ratio expressing how many of one unit are equal to another unit. For example, there are 12 inches in 1 
foot, 100 centimeters in 1 meter, 60 seconds in 1 minute, and so on. In this case, we know that there are 
1,000 meters in 1 kilometer. 


Now we can set up our unit conversion. We will write the units that we have and then multiply them by 
the conversion factor so that the units cancel out, as shown: 
Equation: 


1k 
80 af x ———— = 0.080 km. 


1000 mx 


Note that the unwanted m unit cancels, leaving only the desired km unit. You can use this method to 
convert between any types of unit. 


Click [link] for a more complete list of conversion factors. 


Masses in kilograms (more Times in seconds (more 
precise values in precise values in 
Lengths in meters parentheses) parentheses) 
Present : : 
10-18 experimental limit 2a Mass of an electron 43 nae for light 
to smallest 10 (9.11 x 107-3! kg) 10 ei . 
observable detail P 
Mass of a hydrogen Mean life of an 
10-15 Diameter of a 10-2? air 10-22 extremely 
proton (1. 67 x 10-27 kg) unstable 


nucleus 


Lengths in meters 


Diameter of a 


-14 
10 uranium nucleus 
10-10 Diameter of a 
hydrogen atom 
Thickness of 
10-8 membranes in cells 
of living organisms 
10-8 Wavelength of 
visible light 
Size of a grain of 
-3 
10 sand 
1 Height of a 4-year- 
old child 
102 Length of a football 
c field 
104 Greatest ocean 
° depth 
107 Diameter of the 
0 Earth 
104 Distance from the 


Earth to the Sun 


Distance traveled 
1016 by light in 1 year (a 
light year) 


Diameter of the 


10s 
Milky Way galaxy 


Masses in kilograms (more 
precise values in 


parentheses) 

10-15 Mass of a 
bacterium 

10° Mass of a mosquito 

10-2 Mass of a 
hummingbird 


Mass of a liter of 


1 water (about a 
quart) 
102 Mass of a person 
103 Mass of a car 


Mass of a large 


8 
19 ship 
12 Mass of a large 
10 iceberg 
1015 Mass of the nucleus 


of a comet 


Ba Mass of the Moon 
£0 (7.35 x 10” kg) 


ae Mass of the Earth 
+0 (5.97 x 1074 kg) 


$6 Mass of the Sun 
(1.99 x 10°? kg) 


Times in seconds (more 
precise values in 
parentheses) 


Time for one 
19715 oscillation of 
visible light 


Time for one 
19078 vibration of an 
atom in a solid 


Time for one 
oscillation of 
an FM radio 
wave 


10-8 


Duration of a 
nerve impulse 


Time for one 
heartbeat 


One day 
10° (8.64 x 104s) 


One year (y) 
7 
EO (3.16 x 10’ s) 


About half the 
10° life expectancy 
of a human 


Recorded 


1 
10 history 


Age of the 


17 
10 Earth 


Age of the 
universe 


10"* 


Masses in kilograms (more Times in seconds (more 


precise values in precise values in 
Lengths in meters parentheses) parentheses) 
Distance from the Mass of the Milky 
102 Earth to the nearest 10” Way galaxy 
large galaxy (current upper 
(Andromeda) limit) 
Distance from the Mace OERe Lao ani 
1026 Earth to the edges 10° universe (current 
of the known ne 
upper limit) 


universe 


Approximate Values of Length, Mass, and Time 


Example: 

Unit Conversions: A Short Drive Home 

Suppose that you drive the 10.0 km from your university to home in 20.0 min. Calculate your average 
speed (a) in kilometers per hour (km/h) and (b) in meters per second (m/s). (Note: Average speed is 
distance traveled divided by time of travel.) 

Strategy 

First we calculate the average speed using the given units. Then we can get the average speed into the 
desired units by picking the correct conversion factor and multiplying by it. The correct conversion 
factor is the one that cancels the unwanted unit and leaves the desired unit in its place. 

Solution for (a) 

(1) Calculate average speed. Average speed is distance traveled divided by time of travel. (Take this 
definition as a given for now—average speed and other motion concepts will be covered in a later 
module.) In equation form, 


Equation: 
distance 
average speed =———_—__.. 
time 

(2) Substitute the given values for distance and time. 
Equation: 

10.0 km km 

average speed =—————. = 0.500 — 
20.0 min min 


(3) Convert km/min to km/h: multiply by the conversion factor that will cancel minutes and leave hours. 
That conversion factor is 60 min/hr. Thus, 
Equation: 


km 60 min km 
average speed =0.500—— x = 30.0 : 
min 1h h 


Discussion for (a) 
To check your answer, consider the following: 


(1) Be sure that you have properly cancelled the units in the unit conversion. If you have written the unit 
conversion factor upside down, the units will not cancel properly in the equation. If you accidentally get 
the ratio upside down, then the units will not cancel; rather, they will give you the wrong units as 
follows: 

Equation: 


km 1 hr 1 km-hr 


. x . = 2 b) 
min 60 min 60 min 


which are obviously not the desired units of km/h. 

(2) Check that the units of the final answer are the desired units. The problem asked us to solve for 
average speed in units of km/h and we have indeed obtained these units. 

(3) Check the significant figures. Because each of the values given in the problem has three significant 
figures, the answer should also have three significant figures. The answer 30.0 km/hr does indeed have 
three significant figures, so this is appropriate. Note that the significant figures in the conversion factor 
are not relevant because an hour is defined to be 60 minutes, so the precision of the conversion factor is 
perfect. 

(4) Next, check whether the answer is reasonable. Let us consider some information from the problem— 
if you travel 10 km in a third of an hour (20 min), you would travel three times that far in an hour. The 
answer does seem reasonable. 

Solution for (b) 

There are several ways to convert the average speed into meters per second. 

(1) Start with the answer to (a) and convert km/h to m/s. Two conversion factors are needed—one to 
convert hours to seconds, and another to convert kilometers to meters. 

(2) Multiplying by these yields 

Equation: 


km 1h 1,000m 
h 3600s. 1km ’ 


Average speed = 30.0 


Equation: 
Average speed = —— 
s 


Discussion for (b) 

If we had started with 0.500 km/min, we would have needed different conversion factors, but the answer 
would have been the same: 8.33 m/s. 

You may have noted that the answers in the worked example just covered were given to three digits. 
Why? When do you need to be concerned about the number of digits in something you calculate? Why 
not write down all the digits your calculator produces? The module Accuracy, Precision, and Significant 
Figures will help you answer these questions. 


Note: 

Nonstandard Units 

While there are numerous types of units that we are all familiar with, there are others that are much more 
obscure. For example, a firkin is a unit of volume that was once used to measure beer. One firkin equals 
about 34 liters. To learn more about nonstandard units, use a dictionary or encyclopedia to research 
different “weights and measures.” Take note of any unusual units, such as a barleycorn, that are not listed 
in the text. Think about how the unit is defined and state its relationship to SI units. 


Exercise: 
Check Your Understanding 


Problem: 


Some hummingbirds beat their wings more than 50 times per second. A scientist is measuring the 
time it takes for a hummingbird to beat its wings once. Which fundamental unit should the scientist 
use to describe the measurement? Which factor of 10 is the scientist likely to use to describe the 
motion precisely? Identify the metric prefix that corresponds to this factor of 10. 


Solution: 


The scientist will measure the time between each movement using the fundamental unit of seconds. 
Because the wings beat so fast, the scientist will probably need to measure in milliseconds, or 10~° 
seconds. (50 beats per second corresponds to 20 milliseconds per beat.) 


Exercise: 
Check Your Understanding 


Problem: 


One cubic centimeter is equal to one milliliter. What does this tell you about the different units in the 
SI metric system? 


Solution: 


The fundamental unit of length (meter) is probably used to create the derived unit of volume (liter). 
The measure of a milliliter is dependent on the measure of a centimeter. 


Summary 


e Physical quantities are a characteristic or property of an object that can be measured or calculated 
from other measurements. 

e Units are standards for expressing and comparing the measurement of physical quantities. All units 
can be expressed as combinations of four fundamental units. 

e The four fundamental units we will use in this text are the meter (for length), the kilogram (for 
mass), the second (for time), and the ampere (for electric current). These units are part of the metric 
system, which uses powers of 10 to relate quantities over the vast ranges encountered in nature. 

e The four fundamental units are abbreviated as follows: meter, m; kilogram, kg; second, s; and 
ampere, A. The metric system also uses a standard set of prefixes to denote each order of magnitude 
greater than or lesser than the fundamental unit itself. 

e Unit conversions involve changing a value expressed in one type of unit to another type of unit. This 
is done by using conversion factors, which are ratios relating equal quantities of different units. 


Conceptual Questions 


Exercise: 


Problem: Identify some advantages of metric units. 


Problems & Exercises 


Exercise: 


Problem: 


The speed limit on some interstate highways is roughly 100 km/h. (a) What is this in meters per 
second? (b) How many miles per hour is this? 


Solution: 


a. 27.8 m/s 
b. 62.1 mph 


Exercise: 
Problem: 
A car is traveling at a speed of 33 m/s. (a) What is its speed in kilometers per hour? (b) Is it 
exceeding the 90 km/h speed limit? 
Exercise: 
Problem: 


Show that 1.0 m/s = 3.6 km/h. Hint: Show the explicit steps involved in converting 
1.0 m/s = 3.6 km/h. 


Solution: 
1.0m 5 1.0m x 3600s lkm 
s s lhr 1000 m 
= 3.6 km/h. 
Exercise: 
Problem: 


American football is played on a 100-yd-long field, excluding the end zones. How long is the field in 
meters? (Assume that 1 meter equals 3.281 feet.) 


Exercise: 
Problem: 


Soccer fields vary in size. A large soccer field is 115 m long and 85 m wide. What are its dimensions 
in feet and inches? (Assume that 1 meter equals 3.281 feet.) 


Solution: 


length: 377 ft; 4.53 x 10° in. width: 280 ft; 3.3 x 10° in. 
Exercise: 
Problem: 
What is the height in meters of a person who is 6 ft 1.0 in. tall? (Assume that 1 meter equals 39.37 
in.) 


Exercise: 


Problem: 


Mount Everest, at 29,028 feet, is the tallest mountain on the Earth. What is its height in kilometers? 
(Assume that 1 kilometer equals 3,281 feet.) 


Solution: 


8.847 km 


Exercise: 


Problem: The speed of sound is measured to be 342 m/s on a certain day. What is this in km/h? 
Exercise: 

Problem: 

Tectonic plates are large segments of the Earth’s crust that move slowly. Suppose that one such plate 


has an average speed of 4.0 cm/year. (a) What distance does it move in 1 s at this speed? (b) What is 
its speed in kilometers per million years? 


Solution: 

(a) 1.3 x 10°°m 

(b) 40 km/My 
Exercise: 


Problem: 


(a) Refer to [link] to determine the average distance between the Earth and the Sun. Then calculate 
the average speed of the Earth in its orbit in kilometers per second. (b) What is this in meters per 
second? 


Glossary 


physical quantity 
a characteristic or property of an object that can be measured or calculated from other measurements 


units 
a standard used for expressing and comparing measurements 


SI units 
the international system of units that scientists in most countries have agreed to use; includes units 
such as meters, liters, and grams 


English units 
system of measurement used in the United States; includes units of measurement such as feet, 
gallons, and pounds 


fundamental units 
units that can only be expressed relative to the procedure used to measure them 


derived units 
units that can be calculated using algebraic combinations of the fundamental units 


second 
the SI unit for time, abbreviated (s) 


meter 
the SI unit for length, abbreviated (m) 


kilogram 
the SI unit for mass, abbreviated (kg) 


metric system 
a system in which values can be calculated in factors of 10 


order of magnitude 
refers to the size of a quantity as it relates to a power of 10 


conversion factor 
a ratio expressing how many of one unit are equal to another unit 


Accuracy, Precision, and Significant Figures 


e Determine the appropriate number of significant figures in both 
addition and subtraction, as well as multiplication and division 
calculations. 

e Calculate the percent uncertainty of a measurement. 


A double-pan mechanical balance is 
used to compare different masses. 
Usually an object with unknown mass 
is placed in one pan and objects of 
known mass are placed in the other 
pan. When the bar that connects the 
two pans is horizontal, then the 
masses in both pans are equal. The 
“known masses” are typically metal 
cylinders of standard mass such as 1 
gram, 10 grams, and 100 grams. 
(credit: Serge Melki) 


Many mechanical balances, 
such as double-pan balances, 
have been replaced by digital 

scales, which can typically 
measure the mass of an object 

more precisely. Whereas a 
mechanical balance may only 
read the mass of an object to 
the nearest tenth of a gram, 
many digital scales can 
measure the mass of an object 
up to the nearest thousandth 
of a gram. (credit: Karel 
Jakubec) 


Accuracy and Precision of a Measurement 


Science is based on observation and experiment—that is, on measurements. 
Accuracy is how close a measurement is to the correct value for that 
measurement. For example, let us say that you are measuring the length of 
standard computer paper. The packaging in which you purchased the paper 
States that it is 11.0 inches long. You measure the length of the paper three 
times and obtain the following measurements: 11.1 in., 11.2 in., and 10.9 in. 


These measurements are quite accurate because they are very close to the 
correct value of 11.0 inches. In contrast, if you had obtained a measurement 
of 12 inches, your measurement would not be very accurate. 


The precision of a measurement system is refers to how close the 
agreement is between repeated measurements (which are repeated under the 
same conditions). Consider the example of the paper measurements. The 
precision of the measurements refers to the spread of the measured values. 
One way to analyze the precision of the measurements would be to 
determine the range, or difference, between the lowest and the highest 
measured values. In that case, the lowest value was 10.9 in. and the highest 
value was 11.2 in. Thus, the measured values deviated from each other by at 
most 0.3 in. These measurements were relatively precise because they did 
not vary too much in value. However, if the measured values had been 10.9, 
11.1, and 11.9, then the measurements would not be very precise because 
there would be significant variation from one measurement to another. 


The measurements in the paper example are both accurate and precise, but 
in some cases, measurements are accurate but not precise, or they are 
precise but not accurate. Let us consider an example of a GPS system that is 
attempting to locate the position of a restaurant in a city. Think of the 
restaurant location as existing at the center of a bull’s-eye target, and think 
of each GPS attempt to locate the restaurant as a black dot. In [link], you 
can see that the GPS measurements are spread out far apart from each other, 
but they are all relatively close to the actual location of the restaurant at the 
center of the target. This indicates a low precision, high accuracy measuring 
system. However, in [link], the GPS measurements are concentrated quite 
closely to one another, but they are far away from the target location. This 
indicates a high precision, low accuracy measuring system. 


A GPS system 
attempts to 
locate a 
restaurant at the 
center of the 
bull’s-eye. The 
black dots 
represent each 
attempt to 
pinpoint the 
location of the 
restaurant. The 
dots are spread 
out quite far 
apart from one 
another, 
indicating low 
precision, but 
they are each 
rather close to 
the actual 
location of the 
restaurant, 
indicating high 
accuracy. 
(credit: Dark 
Evil) 


In this figure, 
the dots are 
concentrated 
rather closely to 
one another, 
indicating high 
precision, but 
they are rather 
far away from 
the actual 
location of the 
restaurant, 
indicating low 
accuracy. 
(credit: Dark 
Evil) 


Accuracy, Precision, and Uncertainty 


The degree of accuracy and precision of a measuring system are related to 
the uncertainty in the measurements. Uncertainty is a quantitative measure 
of how much your measured values deviate from a standard or expected 
value. If your measurements are not very accurate or precise, then the 


uncertainty of your values will be very high. In more general terms, 
uncertainty can be thought of as a disclaimer for your measured values. For 
example, if someone asked you to provide the mileage on your car, you 
might say that it is 45,000 miles, plus or minus 500 miles. The plus or 
minus amount is the uncertainty in your value. That is, you are indicating 
that the actual mileage of your car might be as low as 44,500 miles or as 
high as 45,500 miles, or anywhere in between. All measurements contain 
some amount of uncertainty. In our example of measuring the length of the 
paper, we might say that the length of the paper is 11 in., plus or minus 0.2 
in. The uncertainty in a measurement, A, is often denoted as 6A (“delta A 
”), so the measurement result would be recorded as A + 6A. In our paper 
example, the length of the paper could be expressed as 11 in.+0.2. 


The factors contributing to uncertainty in a measurement include: 


1. Limitations of the measuring device, 

2. The skill of the person making the measurement, 

3. Irregularities in the object being measured, 

4. Any other factors that affect the outcome (highly dependent on the 
situation). 


In our example, such factors contributing to the uncertainty could be the 
following: the smallest division on the ruler is 0.1 in., the person using the 
ruler has bad eyesight, or one side of the paper is slightly longer than the 
other. At any rate, the uncertainty in a measurement must be based on a 
careful consideration of all the factors that might contribute and their 
possible effects. 


Note: 

Making Connections: Real-World Connections — Fevers or Chills? 
Uncertainty is a critical piece of information, both in physics and in many 
other real-world applications. Imagine you are caring for a sick child. You 
suspect the child has a fever, so you check his or her temperature with a 
thermometer. What if the uncertainty of the thermometer were 3.0°C? If 
the child’s temperature reading was 37.0°C (which is normal body 
temperature), the “true” temperature could be anywhere from a 


hypothermic 34.0°C to a dangerously high 40.0°C. A thermometer with an 
uncertainty of 3.0°C would be useless. 


Percent Uncertainty 


One method of expressing uncertainty is as a percent of the measured value. 
If a measurement A is expressed with uncertainty, 6A, the percent 
uncertainty (Y%ounc) is defined to be 

Equation: 


A 
% unc =< x 100%. 


Example: 

Calculating Percent Uncertainty: A Bag of Apples 

A grocery store sells 5-lb bags of apples. You purchase four bags over the 
course of a month and weigh the apples each time. You obtain the 
following measurements: 


Week 1 weight: 4.8 lb 
Week 2 weight: 5.3 lb 
Week 3 weight: 4.9 lb 
Week 4 weight: 5.4 lb 


You determine that the weight of the 5-lb bag has an uncertainty of 

+0.4 lb. What is the percent uncertainty of the bag’s weight? 

Strategy 

First, observe that the expected value of the bag’s weight, A, is 5 lb. The 
uncertainty in this value, 6A, is 0.4 lb. We can use the following equation 
to determine the percent uncertainty of the weight: 

Equation: 


A 
% unc au x 100%. 


A 
Solution 
Plug the known values into the equation: 
Equation: 
0.4 lb 
= x 100% = 8%. 

% unc =n % = 8% 

Discussion 


We can conclude that the weight of the apple bag is 5 lb + 8%. Consider 
how this percent uncertainty would change if the bag of apples were half as 
heavy, but the uncertainty in the weight remained the same. Hint for future 
calculations: when calculating percent uncertainty, always remember that 
you must multiply the fraction by 100%. If you do not do this, you will 
have a decimal quantity, not a percent value. 


Uncertainties in Calculations 


There is an uncertainty in anything calculated from measured quantities. 
For example, the area of a floor calculated from measurements of its length 
and width has an uncertainty because the length and width have 
uncertainties. How big is the uncertainty in something you calculate by 
multiplication or division? If the measurements going into the calculation 
have small uncertainties (a few percent or less), then the method of adding 
percents can be used for multiplication or division. This method says that 
the percent uncertainty in a quantity calculated by multiplication or 
division is the sum of the percent uncertainties in the items used to make the 
calculation. For example, if a floor has a length of 4.00 m and a width of 
3.00 m, with uncertainties of 2% and 1%, respectively, then the area of the 
floor is 12.0 m? and has an uncertainty of 3%. (Expressed as an area this is 
0.36 m2, which we round to 0.4 m? since the area of the floor is given toa 
tenth of a square meter.) 

Exercise: 

Check Your Understanding 


Problem: 


A high school track coach has just purchased a new stopwatch. The 
stopwatch manual states that the stopwatch has an uncertainty of 
+0.05 s. Runners on the track coach’s team regularly clock 100-m 
sprints of 11.49 s to 15.01 s. At the school’s last track meet, the first- 
place sprinter came in at 12.04 s and the second-place sprinter came in 
at 12.07 s. Will the coach’s new stopwatch be helpful in timing the 
sprint team? Why or why not? 


Solution: 


No, the uncertainty in the stopwatch is too great to effectively 
differentiate between the sprint times. 


Precision of Measuring Tools and Significant Figures 


An important factor in the accuracy and precision of measurements 
involves the precision of the measuring tool. In general, a precise measuring 
tool is one that can measure values in very small increments. For example, a 
standard ruler can measure length to the nearest millimeter, while a caliper 
can measure length to the nearest 0.01 millimeter. The caliper is a more 
precise measuring tool because it can measure extremely small differences 
in length. The more precise the measuring tool, the more precise and 
accurate the measurements can be. 


When we express measured values, we can only list as many digits as we 
initially measured with our measuring tool. For example, if you use a 
standard ruler to measure the length of a stick, you may measure it to be 
36.7 cm. You could not express this value as 36.71 cm because your 
measuring tool was not precise enough to measure a hundredth of a 
centimeter. It should be noted that the last digit in a measured value has 
been estimated in some way by the person performing the measurement. 
For example, the person measuring the length of a stick with a ruler notices 
that the stick length seems to be somewhere in between 36.6 cm and 

36.7 cm, and he or she must estimate the value of the last digit. Using the 


method of significant figures, the rule is that the last digit written down in 
a measurement is the first digit with some uncertainty. In order to determine 
the number of significant digits in a value, start with the first measured 
value at the left and count the number of digits through the last digit written 
on the right. For example, the measured value 36.7 cm has three digits, or 
significant figures. Significant figures indicate the precision of a measuring 
tool that was used to measure a value. 


Zeros 


Special consideration is given to zeros when counting significant figures. 
The zeros in 0.053 are not significant, because they are only placekeepers 
that locate the decimal point. There are two significant figures in 0.053. The 
zeros in 10.053 are not placekeepers but are significant—this number has 
five significant figures. The zeros in 1300 may or may not be significant 
depending on the style of writing numbers. They could mean the number is 
known to the last digit, or they could be placekeepers. So 1300 could have 
two, three, or four significant figures. (To avoid this ambiguity, write 1300 
in scientific notation.) Zeros are significant except when they serve only as 
placekeepers. 

Exercise: 

Check Your Understanding 


Problem: 


Determine the number of significant figures in the following 
measurements: 


a. 0.0009 
b. 15,450.0 
c.6 x 103 
d. 87.990 
e. 30.42 


Solution: 


(a) 1; the zeros in this number are placekeepers that indicate the 
decimal point 


(b) 6; here, the zeros indicate that a measurement was made to the 0.1 
decimal point, so the zeros are significant 


(c) 1; the value 10° signifies the decimal place, not the number of 
measured values 


(d) 5; the final zero indicates that a measurement was made to the 
0.001 decimal point, so it is significant 


(e) 4; any zeros located in between significant figures in a number are 
also significant 


Significant Figures in Calculations 


When combining measurements with different degrees of accuracy and 
precision, the number of significant digits in the final answer can be no 
greater than the number of significant digits in the least precise measured 
value. There are two different rules, one for multiplication and division and 
the other for addition and subtraction, as discussed below. 


1. For multiplication and division: The result should have the same 
number of significant figures as the quantity having the least significant 
figures entering into the calculation. For example, the area of a circle can 
be calculated from its radius using A = mr”. Let us see how many 
significant figures the area has if the radius has only two—-say, r = 1.2 m. 
Then, 

Equation: 


A = nr? = (3.1415927...) x (1.2 m)* = 4.5238934 m? 


is what you would get using a calculator that has an eight-digit output. But 
because the radius has only two significant figures, it limits the calculated 


quantity to two significant figures or 
Equation: 


A=4.5 m?, 


even though 7r is good to at least eight digits. 


2. For addition and subtraction: The answer can contain no more decimal 
places than the least precise measurement. Suppose that you buy 7.56-kg of 
potatoes in a grocery store as measured with a scale with precision 0.01 kg. 

Then you drop off 6.052-kg of potatoes at your laboratory as measured by a 


scale with precision 0.001 kg. Finally, you go home and add 13.7 kg of 


potatoes as measured by a bathroom scale with precision 0.1 kg. How many 


kilograms of potatoes do you now have, and how many significant figures 
are appropriate in the answer? The mass is found by simple addition and 
subtraction: 

Equation: 


7.06 kg 
— 6.052 kg 


413.7 k 
15.208 = = 15.2 kg. 


Next, we identify the least precise measurement: 13.7 kg. This 
measurement is expressed to the 0.1 decimal place, so our final answer 
must also be expressed to the 0.1 decimal place. Thus, the answer is 
rounded to the tenths place, giving us 15.2 kg. 


Significant Figures in this Text 


In this text, most numbers are assumed to have three significant figures. 
Furthermore, consistent numbers of significant figures are used in all 
worked examples. You will note that an answer given to three digits is 
based on input good to at least three digits, for example. If the input has 
fewer significant figures, the answer will also have fewer significant 


figures. Care is also taken that the number of significant figures is 
reasonable for the situation posed. In some topics, particularly in optics, 
more accurate numbers are needed and more than three significant figures 
will be used. Finally, if a number is exact, such as the two in the formula for 
the circumference of a circle, c = 277, it does not affect the number of 
significant figures in a calculation. 

Exercise: 

Check Your Understanding 


Problem: 


Perform the following calculations and express your answer using the 
correct number of significant digits. 


(a) A woman has two bags weighing 13.5 pounds and one bag witha 
weight of 10.2 pounds. What is the total weight of the bags? 


(b) The force F’ on an object is equal to its mass m multiplied by its 
acceleration a. If a wagon with mass 55 kg accelerates at a rate of 
0.0255 m/ s”, what is the force on the wagon? (The unit of force is 
called the newton, and it is expressed with the symbol N.) 


Solution: 


(a) 37.2 pounds; Because the number of bags is an exact value, it is not 
considered in the significant figures. 


(b) 1.4 N; Because the value 55 kg has only two significant figures, the 
final value must also contain two significant figures. 


Note: 

PhET Explorations: Estimation 

Explore size estimation in one, two, and three dimensions! Multiple levels 
of difficulty allow for progressive skill improvement. 
https://phet.colorado.edu/sims/estimation/estimation_en.html 


Summary 


e Accuracy of a measured value refers to how close a measurement is to 
the correct value. The uncertainty in a measurement is an estimate of 
the amount by which the measurement result may differ from this 
value. 

e Precision of measured values refers to how close the agreement is 
between repeated measurements. 

e The precision of a measuring tool is related to the size of its 
measurement increments. The smaller the measurement increment, the 
more precise the tool. 

e Significant figures express the precision of a measuring tool. 

e When multiplying or dividing measured values, the final answer can 
contain only as many significant figures as the least precise value. 

e When adding or subtracting measured values, the final answer cannot 
contain more decimal places than the least precise value. 


Conceptual Questions 


Exercise: 


Problem: 


What is the relationship between the accuracy and uncertainty of a 
measurement? 


Exercise: 


Problem: 


Prescriptions for vision correction are given in units called diopters 
(D). Determine the meaning of that unit. Obtain information (perhaps 
by calling an optometrist or performing an internet search) on the 
minimum uncertainty with which corrections in diopters are 
determined and the accuracy with which corrective lenses can be 
produced. Discuss the sources of uncertainties in both the prescription 
and accuracy in the manufacture of lenses. 


Problems & Exercises 


Express your answers to problems in this section to the correct number 
of significant figures and proper units. 
Exercise: 


Problem: 


Suppose that your bathroom scale reads your mass as 65 kg with a 3% 
uncertainty. What is the uncertainty in your mass (in kilograms)? 


Solution: 


2 kg 
Exercise: 
Problem: 
A good-quality measuring tape can be off by 0.50 cm over a distance 
of 20 m. What is its percent uncertainty? 
Exercise: 
Problem: 
(a) A car speedometer has a 5.0% uncertainty. What is the range of 


possible speeds when it reads 90 km/h? (b) Convert this range to 
miles per hour. (1 km = 0.6214 mi) 


Solution: 
a. 85.5 to 94.5 km/h 
b. 53.1 to 58.7 mi/h 


Exercise: 


Problem: 


An infant’s pulse rate is measured to be 130 + 5 beats/min. What is 
the percent uncertainty in this measurement? 


Exercise: 


Problem: 


(a) Suppose that a person has an average heart rate of 72.0 beats/min. 
How many beats does he or she have in 2.0 y? (b) In 2.00 y? (c) In 
2.000 y? 


Solution: 
(a) 7.6 x 10’ beats 
(b) 7.57 x 10° beats 


(c) 7.57 x 10’ beats 
Exercise: 
Problem: 
A can contains 375 mL of soda. How much is left after 308 mL is 
removed? 
Exercise: 


Problem: 


State how many significant figures are proper in the results of the 
following calculations: (a) (106.7) (98.2) /(46.210)(1.01) (b) (18.7)? 
(c) (1.60 x 10~'°) (3712). 


Solution: 


Aa op 
WWW 


Exercise: 


Problem: 


(a) How many significant figures are in the numbers 99 and 100? (b) If 
the uncertainty in each number is 1, what is the percent uncertainty in 
each? (c) Which is a more meaningful way to express the accuracy of 
these two numbers, significant figures or percent uncertainties? 


Exercise: 
Problem: 
(a) If your speedometer has an uncertainty of 2.0 km / h at a speed of 
90 km i h, what is the percent uncertainty? (b) If it has the same 


percent uncertainty when it reads 60 km/h, what is the range of 
speeds you could be going? 


Solution: 
a) 2.2% 


(b) 59 to 61 km/h 
Exercise: 
Problem: 
(a) A person’s blood pressure is measured to be 120 + 2 mm Hg. 
What is its percent uncertainty? (b) Assuming the same percent 


uncertainty, what is the uncertainty in a blood pressure measurement of 
80 mm Hg? 


Exercise: 


Problem: 
A person measures his or her heart rate by counting the number of 
beats in 30 s. If 40 + 1 beats are counted in 30.0 + 0.5 s, what is the 


heart rate and its uncertainty in beats per minute? 


Solution: 


80 + 3 beats/min 


Exercise: 


Problem: What is the area of a circle 3.102 cm in diameter? 
Exercise: 
Problem: 


If a marathon runner averages 9.5 mi/h, how long does it take him or 
her to run a 26.22-mi marathon? 


Solution: 


2.8h 

Exercise: 
Problem: 
A marathon runner completes a 42.188-km course in 2 h, 30 min, and 
12 s. There is an uncertainty of 25 m in the distance traveled and an 
uncertainty of 1 s in the elapsed time. (a) Calculate the percent 
uncertainty in the distance. (b) Calculate the uncertainty in the elapsed 


time. (c) What is the average speed in meters per second? (d) What is 
the uncertainty in the average speed? 


Exercise: 


Problem: 


The sides of a small rectangular box are measured to be 
1.80 + 0.01 cm, 2.05 + 0.02 cm, and 3.1 + 0.1 cm long. Calculate 
its volume and uncertainty in cubic centimeters. 


Solution: 


11+1cm?® 


Exercise: 


Problem: 


When non-metric units were used in the United Kingdom, a unit of 
mass called the pound-mass (lbm) was employed, where 

1 lbm = 0.4539 kg. (a) If there is an uncertainty of 0.0001 kg in the 
pound-mass unit, what is its percent uncertainty? (b) Based on that 
percent uncertainty, what mass in pound-mass has an uncertainty of 1 
kg when converted to kilograms? 


Exercise: 
Problem: 
The length and width of a rectangular room are measured to be 


3.955 + 0.005 m and 3.050 + 0.005 m. Calculate the area of the 
room and its uncertainty in square meters. 


Solution: 


12.06 + 0.04 m? 

Exercise: 
Problem: 
A car engine moves a piston with a circular cross section of 
7.500 + 0.002 cm diameter a distance of 3.250 + 0.001 cm to 
compress the gas in the cylinder. (a) By what amount is the gas 


decreased in volume in cubic centimeters? (b) Find the uncertainty in 
this volume. 


Glossary 
accuracy 
the degree to which a measured value agrees with correct value for that 


measurement 


method of adding percents 


the percent uncertainty in a quantity calculated by multiplication or 
division is the sum of the percent uncertainties in the items used to 
make the calculation 


percent uncertainty 
the ratio of the uncertainty of a measurement to the measured value, 
expressed as a percentage 


precision 
the degree to which repeated measurements agree with each other 


significant figures 
express the precision of a measuring tool used to measure a value 


uncertainty 
a quantitative measure of how much your measured values deviate 
from a standard or expected value 


Approximation 
e Make reasonable approximations based on given data. 


On many occasions, physicists, other scientists, and engineers need to make 
approximations or “guesstimates” for a particular quantity. What is the 
distance to a certain destination? What is the approximate density of a given 
item? About how large a current will there be in a circuit? Many 
approximate numbers are based on formulae in which the input quantities 
are known only to a limited accuracy. As you develop problem-solving 
skills (that can be applied to a variety of fields through a study of physics), 
you will also develop skills at approximating. You will develop these skills 
through thinking more quantitatively, and by being willing to take risks. As 
with any endeavor, experience helps, as well as familiarity with units. These 
approximations allow us to rule out certain scenarios or unrealistic 
numbers. Approximations also allow us to challenge others and guide us in 
our approaches to our scientific world. Let us do two examples to illustrate 
this concept. 


Example: 

Approximate the Height of a Building 

Can you approximate the height of one of the buildings on your campus, or 
in your neighborhood? Let us make an approximation based upon the 
height of a person. In this example, we will calculate the height of a 39- 
story building. 

Strategy 

Think about the average height of an adult male. We can approximate the 
height of the building by scaling up from the height of a person. 

Solution 

Based on information in the example, we know there are 39 stories in the 
building. If we use the fact that the height of one story is approximately 
equal to about the length of two adult humans (each human is about 2-m 
tall), then we can estimate the total height of the building to be 
Equation: 


2 2 
cle x ial aed cae x39 stories = 156 m. 
1 person 1 story 


Discussion 

You can use known quantities to determine an approximate measurement 
of unknown quantities. If your hand measures 10 cm across, how many 
hand lengths equal the width of your desk? What other measurements can 
you approximate besides length? 


Example: 
Approximating Vast Numbers: a Trillion Dollars 


A bank stack contains one-hundred 
$100 bills, and is worth $10,000. How 
many bank stacks make up a trillion 
dollars? (credit: Andrew Magill) 


The U.S. federal deficit in the 2008 fiscal year was a little greater than $10 
trillion. Most of us do not have any concept of how much even one trillion 
actually is. Suppose that you were given a trillion dollars in $100 bills. If 
you made 100-bill stacks and used them to evenly cover a football field 
(between the end zones), make an approximation of how high the money 
pile would become. (We will use feet/inches rather than meters here 


because football fields are measured in yards.) One of your friends says 3 
in., while another says 10 ft. What do you think? 

Strategy 

When you imagine the situation, you probably envision thousands of small 
stacks of 100 wrapped $100 bills, such as you might see in movies or at a 
bank. Since this is an easy-to-approximate quantity, let us start there. We 
can find the volume of a stack of 100 bills, find out how many stacks make 
up one trillion dollars, and then set this volume equal to the area of the 
football field multiplied by the unknown height. 

Solution 

(1) Calculate the volume of a stack of 100 bills. The dimensions of a single 
bill are approximately 3 in. by 6 in. A stack of 100 of these is about 0.5 in. 
thick. So the total volume of a stack of 100 bills is: 

Equation: 


volume of stack = length x width x height, 
volume of stack = 6 in. x 3 in. x 0.5 in., 


volume of stack = 9 in.?. 


(2) Calculate the number of stacks. Note that a trillion dollars is equal to 
$1 x 10!%, and a stack of one-hundred $100 bills is equal to $10,000, or 
$1 x 10*. The number of stacks you will have is: 

Equation: 


$1 x 10'(a trillion dollars)/ $1 x 10* per stack = 1 x 108 stacks. 


(3) Calculate the area of a football field in square inches. The area of a 
football field is 100 yd x50 yd, which gives 5,000 yd’. Because we are 
working in inches, we need to convert square yards to square inches: 
Equation: 
Area = 5,000 yd? x 3 x BE x at x tH = 6,480,000 in.?, 
Area 6 x 10° in.”. 


This conversion gives us 6 x 10° in.? for the area of the field. (Note that 
we are using only one significant figure in these calculations.) 


(4) Calculate the total volume of the bills. The volume of all the $100-bill 
stacks is 9 in.*/stack x 10° stacks = 9 x 10° in.°*. 

(5) Calculate the height. To determine the height of the bills, use the 
equation: 

Equation: 


volume of bills = areaof field x height of money: 


volume of bills 
area of field ’ 


Height of money = 


Height of money = Sx) in = 1.33 x 107in., 


Height of money ~ 1 x 107in. = 100 in. 


The height of the money will be about 100 in. high. Converting this value 
to feet gives 
Equation: 


1f 
100 in. x a = 8.33 ft = 8 ft. 
12 in. 


Discussion 

The final approximate value is much higher than the early estimate of 3 in., 
but the other early estimate of 10 ft (120 in.) was roughly correct. How did 
the approximation measure up to your first guess? What can this exercise 
tell you in terms of rough “guesstimates” versus carefully calculated 
approximations? 


Exercise: 
Check Your Understanding 


Problem: 
Using mental math and your understanding of fundamental units, 


approximate the area of a regulation basketball court. Describe the 
process you used to arrive at your final approximation. 


Solution: 


An average male is about two meters tall. It would take approximately 
15 men laid out end to end to cover the length, and about 7 to cover the 
width. That gives an approximate area of 420 m7. 


Summary 


Scientists often approximate the values of quantities to perform calculations 
and analyze systems. 


Problems & Exercises 


Exercise: 


Problem: How many heartbeats are there in a lifetime? 
Solution: 


Sample answer: 2 x 10° heartbeats 
Exercise: 
Problem: 
A generation is about one-third of a lifetime. Approximately how 
many generations have passed since the year 0 AD? 
Exercise: 
Problem: 
How many times longer than the mean life of an extremely unstable 


atomic nucleus is the lifetime of a human? (Hint: The lifetime of an 
unstable atomic nucleus is on the order of 10°? s.) 


Solution: 


Sample answer: 2 x 10°! if an average human lifetime is taken to be 
about 70 years. 


Exercise: 


Problem: 


Calculate the approximate number of atoms in a bacterium. Assume 
that the average mass of an atom in the bacterium is ten times the mass 
of a hydrogen atom. (Hint: The mass of a hydrogen atom is on the 
order of 10-2” kg and the mass of a bacterium is on the order of 

10°*° kg.) 


This color-enhanced photo 
shows Salmonella typhimurium 
(red) attacking human cells. 
These bacteria are commonly 
known for causing foodborne 
illness. Can you estimate the 
number of atoms in each 
bacterium? (credit: Rocky 
Mountain Laboratories, NIAID, 
NIH) 


Exercise: 


Problem: 


Approximately how many atoms thick is a cell membrane, assuming 
all atoms there average about twice the size of a hydrogen atom? 


Solution: 


Sample answer: 50 atoms 
Exercise: 


Problem: 


(a) What fraction of Earth’s diameter is the greatest ocean depth? (b) 
The greatest mountain height? 


Exercise: 


Problem: 

(a) Calculate the number of cells in a hummingbird assuming the mass 
of an average cell is ten times the mass of a bacterium. (b) Making the 
Same assumption, how many cells are there in a human? 

Solution: 

Sample answers: 


(a) 107? cells/hummingbird 


(b) 101° cells/human 
Exercise: 


Problem: 


Assuming one nerve impulse must end before another can begin, what 
is the maximum firing rate of a nerve in impulses per second? 


Glossary 


approximation 
an estimated value based on prior experience and reasoning 


Introduction to One-Dimensional Kinematics 
class="introduction" 


The motion 
of an 
American 
kestrel 
through the 
air can be 
described by 
the bird’s 
displacement 
, speed, 
velocity, and 
acceleration. 
When it flies 
in a Straight 
line without 
any change 
in direction, 
its motion is 
said to be 
one 
dimensional. 
(credit: Vince 
Maidens, 
Wikimedia 
Commons) 


Objects are in motion everywhere we look. Everything from a tennis game 
to a space-probe flyby of the planet Neptune involves motion. When you 
are resting, your heart moves blood through your veins. And even in 
inanimate objects, there is continuous motion in the vibrations of atoms and 
molecules. Questions about motion are interesting in and of themselves: 
How long will it take for a space probe to get to Mars? Where will a 
football land if it is thrown at a certain angle? But an understanding of 
motion is also key to understanding other concepts in physics. An 
understanding of acceleration, for example, is crucial to the study of force. 


Our formal study of physics begins with kinematics which is defined as the 
study of motion without considering its causes. The word “kinematics” 
comes from a Greek term meaning motion and is related to other English 
words such as “cinema” (movies) and “kinesiology” (the study of human 
motion). In one-dimensional kinematics and Two-Dimensional Kinematics 
we will study only the motion of a football, for example, without worrying 
about what forces cause or change its motion. Such considerations come in 
other chapters. In this chapter, we examine the simplest type of motion— 
namely, motion along a straight line, or one-dimensional motion. In Two- 
Dimensional Kinematics, we apply concepts developed here to study 
motion along curved paths (two- and three-dimensional motion); for 
example, that of a car rounding a curve. 


Displacement 


e Define position, displacement, distance, and distance traveled. 

e Explain the relationship between position and displacement. 

e Distinguish between displacement and distance traveled. 

e Calculate displacement and distance given initial position, final 
position, and the path between the two. 


These cyclists in Vietnam can 
be described by their position 
relative to buildings and a canal. 
Their motion can be described 
by their change in position, or 
displacement, in the frame of 
reference. (credit: Suzan Black, 
Fotopedia) 


Position 


In order to describe the motion of an object, you must first be able to 
describe its position—where it is at any particular time. More precisely, 
you need to specify its position relative to a convenient reference frame. 
Earth is often used as a reference frame, and we often describe the position 
of an object as it relates to stationary objects in that reference frame. For 


example, a rocket launch would be described in terms of the position of the 
rocket with respect to the Earth as a whole, while a professor’s position 
could be described in terms of where she is in relation to the nearby white 
board. (See [link].) In other cases, we use reference frames that are not 
stationary but are in motion relative to the Earth. To describe the position of 
a person in an airplane, for example, we use the airplane, not the Earth, as 
the reference frame. (See [link].) 


Displacement 


If an object moves relative to a reference frame (for example, if a professor 
moves to the right relative to a white board or a passenger moves toward 
the rear of an airplane), then the object’s position changes. This change in 
position is known as displacement. The word “displacement” implies that 
an object has moved, or has been displaced. 


Note: 

Displacement 

Displacement is the change in position of an object: 
Equation: 


PNG ae iy 


where Az is displacement, 2 is the final position, and 29 is the initial 
position. 


In this text the upper case Greek letter A (delta) always means “change in” 
whatever quantity follows it; thus, Az means change in position. Always 
solve for displacement by subtracting initial position xg from final position 
£e. 


Note that the SI unit for displacement is the meter (m) (see Physical 
Quantities and Units), but sometimes kilometers, miles, feet, and other units 
of length are used. Keep in mind that when units other than the meter are 


used in a problem, you may need to convert them into meters to complete 
the calculation. 


*0 xf 


1.5m 3.5m 


A professor paces left and right 
while lecturing. Her position 
relative to Earth is given by z. 
The +2.0 m displacement of 
the professor relative to Earth is 
represented by an arrow 
pointing to the right. 


==" 


2.0 m 6.0 m 


A passenger moves from his seat to the back of the 
plane. His location relative to the airplane is given 
by z. The —4.0-m displacement of the passenger 
relative to the plane is represented by an arrow 
toward the rear of the plane. Notice that the arrow 
representing his displacement is twice as long as 
the arrow representing the displacement of the 
professor (he moves twice as far) in [link]. 


Note that displacement has a direction as well as a magnitude. The 
professor’s displacement is 2.0 m to the right, and the airline passenger’s 
displacement is 4.0 m toward the rear. In one-dimensional motion, direction 
can be specified with a plus or minus sign. When you begin a problem, you 
should select which direction is positive (usually that will be to the right or 
up, but you are free to select positive as being any direction). The 
professor’s initial position is z9 = 1.5 m and her final position is 

xp = 3.5 m. Thus her displacement is 

Equation: 


Ax = 25-22% =3.5m—-15m=42.0m. 


In this coordinate system, motion to the right is positive, whereas motion to 
the left is negative. Similarly, the airplane passenger’s initial position is 

Xo = 6.0 m and his final position is x = 2.0 m, so his displacement is 
Equation: 


Ax = x5 —% = 2.0m—6.0m = —4.0 m. 


His displacement is negative because his motion is toward the rear of the 
plane, or in the negative x direction in our coordinate system. 


Distance 


Although displacement is described in terms of direction, distance is not. 
Distance is defined to be the magnitude or size of displacement between 
two positions. Note that the distance between two positions is not the same 
as the distance traveled between them. Distance traveled is the total length 
of the path traveled between two positions. Distance has no direction and, 
thus, no sign. For example, the distance the professor walks is 2.0 m. The 
distance the airplane passenger walks is 4.0 m. 


Note: 

Misconception Alert: Distance Traveled vs. Magnitude of Displacement 
It is important to note that the distance traveled, however, can be greater 
than the magnitude of the displacement (by magnitude, we mean just the 
size of the displacement without regard to its direction; that is, just a 
number with a unit). For example, the professor could pace back and forth 
many times, perhaps walking a distance of 150 m during a lecture, yet still 
end up only 2.0 m to the right of her starting point. In this case her 
displacement would be +2.0 m, the magnitude of her displacement would 
be 2.0 m, but the distance she traveled would be 150 m. In kinematics we 
nearly always deal with displacement and magnitude of displacement, and 
almost never with distance traveled. One way to think about this is to 
assume you marked the start of the motion and the end of the motion. The 


displacement is simply the difference in the position of the two marks and 
is independent of the path taken in traveling between the two marks. The 
distance traveled, however, is the total length of the path taken between the 
two marks. 


Exercise: 
Check Your Understanding 


Problem: 
A cyclist rides 3 km west and then turns around and rides 2 km east. 
(a) What is her displacement? (b) What distance does she ride? (c) 


What is the magnitude of her displacement? 


Solution: 


—-ae MiGs) 4) 
XE xo 


Ax) = +2 km 


(a) The rider’s displacement is Ax = x¢ — x9 = —1 km. (The 
displacement is negative because we take east to be positive and west 
to be negative.) 


(b) The distance traveled is 3 km + 2 km = 5 km. 


(c) The magnitude of the displacement is 1 km. 


Section Summary 


e Kinematics is the study of motion without considering its causes. In 
this chapter, it is limited to motion along a straight line, called one- 
dimensional motion. 

e Displacement is the change in position of an object. 


e Insymbols, displacement Az is defined to be 
Equation: 


Ax = xf — Xo, 


where 2q is the initial position and 2¢ is the final position. In this text, 
the Greek letter A (delta) always means “change in” whatever quantity 
follows it. The SI unit for displacement is the meter (m). Displacement 
has a direction as well as a magnitude. 

e When you start a problem, assign which direction will be positive. 

e Distance is the magnitude of displacement between two positions. 

e Distance traveled is the total length of the path traveled between two 
positions. 


Conceptual Questions 


Exercise: 
Problem: 
Give an example in which there are clear distinctions among distance 


traveled, displacement, and magnitude of displacement. Specifically 
identify each quantity in your example. 


Exercise: 
Problem: 
Under what circumstances does distance traveled equal magnitude of 


displacement? What is the only case in which magnitude of 
displacement and displacement are exactly the same? 


Exercise: 
Problem: 
Bacteria move back and forth by using their flagella (structures that 
look like little tails). Speeds of up to 50 pm/s (50 x 10° m/ s) have 


been observed. The total distance traveled by a bacterium is large for 
its size, while its displacement is small. Why is this? 


Problems & Exercises 
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Exercise: 
Problem: 
Find the following for path A in [link]: (a) The distance traveled. (b) 


The magnitude of the displacement from start to finish. (c) The 
displacement from start to finish. 


Solution: 
(a) 7m 
(b) 7m 
(c) +7 m 


Exercise: 
Problem: 
Find the following for path B in [link]: (a) The distance traveled. (b) 


The magnitude of the displacement from start to finish. (c) The 
displacement from start to finish. 


Exercise: 


Problem: 


Find the following for path C in [link]: (a) The distance traveled. (b) 
The magnitude of the displacement from start to finish. (c) The 
displacement from start to finish. 


Solution: 
(a) 13m 
(b)9m 


(c) +9 m 
Exercise: 
Problem: 
Find the following for path D in [link]: (a) The distance traveled. (b) 


The magnitude of the displacement from start to finish. (c) The 
displacement from start to finish. 


Glossary 


kinematics 
the study of motion without considering its causes 


position 
the location of an object at a particular time 


displacement 
the change in position of an object 


distance 
the magnitude of displacement between two positions 


distance traveled 
the total length of the path traveled between two positions 


Vectors, Scalars, and Coordinate Systems 


¢ Define and distinguish between scalar and vector quantities. 
e Assign a coordinate system for a scenario involving one-dimensional 
motion. 
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The motion of this Eclipse 
Concept jet can be described in 
terms of the distance it has 
traveled (a scalar quantity) or its 
displacement in a specific 
direction (a vector quantity). In 
order to specify the direction of 
motion, its displacement must 
be described based on a 
coordinate system. In this case, 
it may be convenient to choose 
motion toward the left as 
positive motion (it is the 
forward direction for the plane), 
although in many cases, the z- 
coordinate runs from left to 
right, with motion to the right as 
positive and motion to the left 
as negative. (credit: Armchair 
Aviator, Flickr) 


What is the difference between distance and displacement? Whereas 
displacement is defined by both direction and magnitude, distance is 
defined only by magnitude. Displacement is an example of a vector 
quantity. Distance is an example of a scalar quantity. A vector is any 
quantity with both magnitude and direction. Other examples of vectors 
include a velocity of 90 km/h east and a force of 500 newtons straight 
down. 


The direction of a vector in one-dimensional motion is given simply by a 
plus (+) or minus (—) sign. Vectors are represented graphically by arrows. 
An arrow used to represent a vector has a length proportional to the vector’s 
magnitude (e.g., the larger the magnitude, the longer the length of the 
vector) and points in the same direction as the vector. 


Some physical quantities, like distance, either have no direction or none is 
specified. A scalar is any quantity that has a magnitude, but no direction. 
For example, a 20°C temperature, the 250 kilocalories (250 Calories) of 
energy in a candy bar, a 90 km/h speed limit, a person’s 1.8 m height, and a 
distance of 2.0 m are all scalars—quantities with no specified direction. 
Note, however, that a scalar can be negative, such as a —20°C temperature. 
In this case, the minus sign indicates a point on a scale rather than a 
direction. Scalars are never represented by arrows. 


Coordinate Systems for One-Dimensional Motion 


In order to describe the direction of a vector quantity, you must designate a 
coordinate system within the reference frame. For one-dimensional motion, 
this is a simple coordinate system consisting of a one-dimensional 
coordinate line. In general, when describing horizontal motion, motion to 
the right is usually considered positive, and motion to the left is considered 
negative. With vertical motion, motion up is usually positive and motion 
down is negative. In some cases, however, as with the jet in [link], it can be 
more convenient to switch the positive and negative directions. For 
example, if you are analyzing the motion of falling objects, it can be useful 
to define downwards as the positive direction. If people in a race are 


running to the left, it is useful to define left as the positive direction. It does 
not matter as long as the system is clear and consistent. Once you assign a 
positive direction and start solving a problem, you cannot change it. 
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It is usually 
convenient to 
consider motion 
upward or to the 
right as positive 
(+) and motion 
downward or to the 
left as negative (—) 


Exercise: 
Check Your Understanding 


Problem: 
A person’s speed can stay the same as he or she rounds a corner and 
changes direction. Given this information, is speed a scalar or a vector 


quantity? Explain. 


Solution: 


Speed is a scalar quantity. It does not change at all with direction 
changes; therefore, it has magnitude only. If it were a vector quantity, 
it would change as direction changes (even if its magnitude remained 
constant). 


Section Summary 


e A vector is any quantity that has magnitude and direction. 
e A scalar is any quantity that has magnitude but no direction. 


e Displacement and velocity are vectors, whereas distance and speed are 


scalars. 
e In one-dimensional motion, direction is specified by a plus or minus 
sign to signify left or right, up or down, and the like. 


Conceptual Questions 


Exercise: 
Problem: 
A student writes, “A bird that is diving for prey has a speed of 


—10 m/s.” What is wrong with the student’s statement? What has the 
student actually described? Explain. 


Exercise: 


Problem: What is the speed of the bird in [link]? 
Exercise: 
Problem: 
Acceleration is the change in velocity over time. Given this 
information, is acceleration a vector or a scalar quantity? Explain. 


Exercise: 


Problem: 


A weather forecast states that the temperature is predicted to be —5°C 
the following day. Is this temperature a vector or a scalar quantity? 
Explain. 


Glossary 


scalar 
a quantity that is described by magnitude, but not direction 


vector 
a quantity that is described by both magnitude and direction 


Time, Velocity, and Speed 


e Explain the relationships between instantaneous velocity, average 
velocity, instantaneous speed, average speed, displacement, and time. 

¢ Calculate velocity and speed given initial position, initial time, final 
position, and final time. 

e Derive a graph of velocity vs. time given a graph of position vs. time. 

e Interpret a graph of velocity vs. time. 


The motion of these racing 
snails can be described by their 
speeds and their velocities. 
(credit: tobitasflickr, Flickr) 


There is more to motion than distance and displacement. Questions such as, 
“How long does a foot race take?” and “What was the runner’s speed?” 
cannot be answered without an understanding of other concepts. In this 
section we add definitions of time, velocity, and speed to expand our 
description of motion. 


Time 


As discussed in Physical Quantities and Units, the most fundamental 
physical quantities are defined by how they are measured. This is the case 
with time. Every measurement of time involves measuring a change in 


some physical quantity. It may be a number on a digital clock, a heartbeat, 
or the position of the Sun in the sky. In physics, the definition of time is 
simple—time is change, or the interval over which change occurs. It is 
impossible to know that time has passed unless something changes. 


The amount of time or change is calibrated by comparison with a standard. 
The SI unit for time is the second, abbreviated s. We might, for example, 
observe that a certain pendulum makes one full swing every 0.75 s. We 
could then use the pendulum to measure time by counting its swings or, of 
course, by connecting the pendulum to a clock mechanism that registers 
time on a dial. This allows us to not only measure the amount of time, but 
also to determine a sequence of events. 


How does time relate to motion? We are usually interested in elapsed time 
for a particular motion, such as how long it takes an airplane passenger to 
get from his seat to the back of the plane. To find elapsed time, we note the 
time at the beginning and end of the motion and subtract the two. For 
example, a lecture may start at 11:00 A.M. and end at 11:50 A.M., so that 
the elapsed time would be 50 min. Elapsed time At is the difference 
between the ending time and beginning time, 

Equation: 


At = ts — to, 


where At is the change in time or elapsed time, ¢, is the time at the end of 
the motion, and ¢ is the time at the beginning of the motion. (As usual, the 
delta symbol, A, means the change in the quantity that follows it.) 


Life is simpler if the beginning time fg is taken to be zero, as when we use a 
stopwatch. If we were using a stopwatch, it would simply read zero at the 
start of the lecture and 50 min at the end. If t) = 0, then At = ¢; = t. 


In this text, for simplicity’s sake, 


e motion starts at time equal to zero (to = 0) 


e the symbol ¢ is used for elapsed time unless otherwise specified 
(At =t= t) 


Velocity 


Your notion of velocity is probably the same as its scientific definition. You 
know that if you have a large displacement in a small amount of time you 
have a large velocity, and that velocity has units of distance divided by 
time, such as miles per hour or kilometers per hour. 


Note: 

Average Velocity 

Average velocity is displacement (change in position) divided by the time 
of travel, 

Equation: 


ne Az eet 
SNe 


where v is the average (indicated by the bar over the v) velocity, Az is the 
change in position (or displacement), and x¢ and 29 are the final and 
beginning positions at times ¢¢ and fo, respectively. If the starting time fo is 
taken to be zero, then the average velocity is simply 
Equation: 
Az 

= 


Ve 


Notice that this definition indicates that velocity is a vector because 
displacement is a vector. It has both magnitude and direction. The SI unit 
for velocity is meters per second or m/s, but many other units, such as km/h, 
mi/h (also written as mph), and cm/s, are in common use. Suppose, for 
example, an airplane passenger took 5 seconds to move —4 m (the negative 
sign indicates that displacement is toward the back of the plane). His 
average velocity would be 

Equation: 


=. = —0.8 m/s. 


_ Az —4m 
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The minus sign indicates the average velocity is also toward the rear of the 
plane. 


The average velocity of an object does not tell us anything about what 
happens to it between the starting point and ending point, however. For 
example, we cannot tell from average velocity whether the airplane 
passenger stops momentarily or backs up before he goes to the back of the 
plane. To get more details, we must consider smaller segments of the trip 
over smaller time intervals. 


mieceleie eevee eleTeree wt = 
1 aa 
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A more detailed record of an airplane 

passenger heading toward the back of 

the plane, showing smaller segments 
of his trip. 


The smaller the time intervals considered in a motion, the more detailed the 
information. When we carry this process to its logical conclusion, we are 
left with an infinitesimally small interval. Over such an interval, the average 
velocity becomes the instantaneous velocity or the velocity at a specific 
instant. A car’s speedometer, for example, shows the magnitude (but not the 


direction) of the instantaneous velocity of the car. (Police give tickets based 
on instantaneous velocity, but when calculating how long it will take to get 
from one place to another on a road trip, you need to use average velocity.) 
Instantaneous velocity v is the average velocity at a specific instant in time 
(or over an infinitesimally small time interval). 


Mathematically, finding instantaneous velocity, v, at a precise instant ¢ can 
involve taking a limit, a calculus operation beyond the scope of this text. 
However, under many circumstances, we can find precise values for 
instantaneous velocity without calculus. 


Speed 


In everyday language, most people use the terms “speed” and “velocity” 
interchangeably. In physics, however, they do not have the same meaning 
and they are distinct concepts. One major difference is that speed has no 
direction. Thus speed is a scalar. Just as we need to distinguish between 
instantaneous velocity and average velocity, we also need to distinguish 
between instantaneous speed and average speed. 


Instantaneous speed is the magnitude of instantaneous velocity. For 
example, suppose the airplane passenger at one instant had an instantaneous 
velocity of —3.0 m/s (the minus meaning toward the rear of the plane). At 
that same time his instantaneous speed was 3.0 m/s. Or suppose that at one 
time during a shopping trip your instantaneous velocity is 40 km/h due 
north. Your instantaneous speed at that instant would be 40 km/h—the same 
magnitude but without a direction. Average speed, however, is very 
different from average velocity. Average speed is the distance traveled 
divided by elapsed time. 


We have noted that distance traveled can be greater than displacement. So 
average speed can be greater than average velocity, which is displacement 
divided by time. For example, if you drive to a store and return home in half 
an hour, and your car’s odometer shows the total distance traveled was 6 
km, then your average speed was 12 km/h. Your average velocity, however, 
was zero, because your displacement for the round trip is zero. 


(Displacement is change in position and, thus, is zero for a round trip.) Thus 
average speed is not simply the magnitude of average velocity. 


During a 30-minute round trip to the 
store, the total distance traveled is 6 
km. The average speed is 12 km/h. 
The displacement for the round trip is 
zero, since there was no net change in 
position. Thus the average velocity is 
zero. 


Another way of visualizing the motion of an object is to use a graph. A plot 
of position or of velocity as a function of time can be very useful. For 
example, for this trip to the store, the position, velocity, and speed-vs.-time 
graphs are displayed in [link]. (Note that these graphs depict a very 
simplified model of the trip. We are assuming that speed is constant during 
the trip, which is unrealistic given that we’ll probably stop at the store. But 
for simplicity’s sake, we will model it with no stops or changes in speed. 
We are also assuming that the route between the store and the house is a 
perfectly straight line.) 
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Speed vs. Time 
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Position vs. time, velocity vs. 

time, and speed vs. time on a 

trip. Note that the velocity for 
the return trip is negative. 


Note: 

Making Connections: Take-Home Investigation—Getting a Sense of Speed 
If you have spent much time driving, you probably have a good sense of 
speeds between about 10 and 70 miles per hour. But what are these in 
meters per second? What do we mean when we say that something is 
moving at 10 m/s? To get a better sense of what these values really mean, 
do some observations and calculations on your own: 


¢ calculate typical car speeds in meters per second 

e estimate jogging and walking speed by timing yourself; convert the 
measurements into both m/s and mi/h 

e determine the speed of an ant, snail, or falling leaf 


Exercise: 
Check Your Understanding 


Problem: 


A commuter train travels from Baltimore to Washington, DC, and back 
in 1 hour and 45 minutes. The distance between the two stations is 
approximately 40 miles. What is (a) the average velocity of the train, 
and (b) the average speed of the train in m/s? 


Solution: 


(a) The average velocity of the train is zero because xs = 29; the train 
ends up at the same place it starts. 


(b) The average speed of the train is calculated below. Note that the 
train travels 40 miles one way and 40 miles back, for a total distance of 
80 miles. 

Equation: 


distance 80 miles 


time 105 minutes 


Equation: 


80 miles 5280 feet 1 meter 1 minute 


a ee = SH 0 m/s 
105 minutes 1 mile 3.28 feet 60 seconds 


Section Summary 


e Time is measured in terms of change, and its SI unit is the second (s). 
Elapsed time for an event is 
Equation: 


At = ts — to, 


where t, is the final time and fo is the initial time. The initial time is 
often taken to be zero, as if measured with a stopwatch; the elapsed 
time is then just ¢. 


e Average velocity v is defined as displacement divided by the travel 
time. In symbols, average velocity is 
Equation: 


= Az _ & — 2 
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e The SI unit for velocity is m/s. 

e Velocity is a vector and thus has a direction. 

e Instantaneous velocity v is the velocity at a specific instant or the 
average velocity for an infinitesimal interval. 

e Instantaneous speed is the magnitude of the instantaneous velocity. 

e Instantaneous speed is a scalar quantity, as it has no direction 
specified. 

e Average speed is the total distance traveled divided by the elapsed 
time. (Average speed is not the magnitude of the average velocity.) 
Speed is a scalar quantity; it has no direction associated with it. 


Conceptual Questions 


Exercise: 
Problem: 
Give an example (but not one from the text) of a device used to 


measure time and identify what change in that device indicates a 
change in time. 


Exercise: 
Problem: 
There is a distinction between average speed and the magnitude of 


average velocity. Give an example that illustrates the difference 
between these two quantities. 


Exercise: 
Problem: 
Does a car’s odometer measure position or displacement? Does its 
speedometer measure speed or velocity? 

Exercise: 
Problem: 
If you divide the total distance traveled on a car trip (as determined by 
the odometer) by the time for the trip, are you calculating the average 


speed or the magnitude of the average velocity? Under what 
circumstances are these two quantities the same? 


Exercise: 


Problem: 
How are instantaneous velocity and instantaneous speed related to one 
another? How do they differ? 

Problems & Exercises 


Exercise: 


Problem: 


(a) Calculate Earth’s average speed relative to the Sun. (b) What is its 
average velocity over a period of one year? 


Solution: 
(a) 3.0 x 104 m/s 


(b) 0 m/s 
Exercise: 
Problem: 
A helicopter blade spins at exactly 100 revolutions per minute. Its tip 
is 5.00 m from the center of rotation. (a) Calculate the average speed 


of the blade tip in the helicopter’s frame of reference. (b) What is its 
average velocity over one revolution? 


Exercise: 
Problem: 
The North American and European continents are moving apart at a 


rate of about 3 cm/y. At this rate how long will it take them to drift 500 
km farther apart than they are at present? 


Solution: 


2 x 10’ years 


Exercise: 


Problem: 


Land west of the San Andreas fault in southern California is moving at 
an average velocity of about 6 cm/y northwest relative to land east of 
the fault. Los Angeles is west of the fault and may thus someday be at 
the same latitude as San Francisco, which is east of the fault. How far 
in the future will this occur if the displacement to be made is 590 km 
northwest, assuming the motion remains constant? 


Exercise: 
Problem: 
On May 26, 1934, a streamlined, stainless steel diesel train called the 
Zephyr set the world’s nonstop long-distance speed record for trains. 
Its run from Denver to Chicago took 13 hours, 4 minutes, 58 seconds, 
and was witnessed by more than a million people along the route. The 


total distance traveled was 1633.8 km. What was its average speed in 
km/h and m/s? 


Solution: 


34.689 m/s = 124.88 km/h 
Exercise: 
Problem: 
Tidal friction is slowing the rotation of the Earth. As a result, the orbit 
of the Moon is increasing in radius at a rate of approximately 4 


cm/year. Assuming this to be a constant rate, how many years will pass 
before the radius of the Moon’s orbit increases by 3.84 x 10° m (1%)? 


Exercise: 


Problem: 


A student drove to the university from her home and noted that the 
odometer reading of her car increased by 12.0 km. The trip took 18.0 
min. (a) What was her average speed? (b) If the straight-line distance 
from her home to the university is 10.3 km in a direction 25.0° south 
of east, what was her average velocity? (c) If she returned home by the 
same path 7 h 30 min after she left, what were her average speed and 
velocity for the entire trip? 


Solution: 
(a) 40.0 km/h 


(b) 34.3 km/h, 25° S of E. 


(c) average speed = 3.20 km/h, v = 0. 

Exercise: 
Problem: 
The speed of propagation of the action potential (an electrical signal) 
in a nerve cell depends (inversely) on the diameter of the axon (nerve 
fiber). If the nerve cell connecting the spinal cord to your feet is 1.1 m 


long, and the nerve impulse speed is 18 m/s, how long does it take for 
the nerve signal to travel this distance? 


Exercise: 


Problem: 


Conversations with astronauts on the lunar surface were characterized 
by a kind of echo in which the earthbound person’s voice was so loud 
in the astronaut’s space helmet that it was picked up by the astronaut’s 
microphone and transmitted back to Earth. It is reasonable to assume 
that the echo time equals the time necessary for the radio wave to 
travel from the Earth to the Moon and back (that is, neglecting any 
time delays in the electronic equipment). Calculate the distance from 
Earth to the Moon given that the echo time was 2.56 s and that radio 
waves travel at the speed of light (3.00 x 10° m/s). 


Solution: 


384,000 km 
Exercise: 


Problem: 


A football quarterback runs 15.0 m straight down the playing field in 
2.50 s. He is then hit and pushed 3.00 m straight backward in 1.75 s. 
He breaks the tackle and runs straight forward another 21.0 m in 5.20 
s. Calculate his average velocity (a) for each of the three intervals and 
(b) for the entire motion. 


Exercise: 


Problem: 


The planetary model of the atom pictures electrons orbiting the atomic 
nucleus much as planets orbit the Sun. In this model you can view 
hydrogen, the simplest atom, as having a single electron in a circular 
orbit 1.06 x 10~?° m in diameter. (a) If the average speed of the 
electron in this orbit is known to be 2.20 x 10° m /s, calculate the 
number of revolutions per second it makes about the nucleus. (b) What 
is the electron’s average velocity? 


Solution: 


(a) 6.61 x 10’° rev/s 


(b) 0 m/s 


Glossary 


average speed 
distance traveled divided by time during which motion occurs 


average velocity 
displacement divided by time over which displacement occurs 


instantaneous velocity 
velocity at a specific instant, or the average velocity over an 
infinitesimal time interval 


instantaneous speed 
magnitude of the instantaneous velocity 


time 
change, or the interval over which change occurs 


model 
simplified description that contains only those elements necessary to 
describe the physics of a physical situation 


elapsed time 
the difference between the ending time and beginning time 


Acceleration 


e Define and distinguish between instantaneous acceleration, average 
acceleration, and deceleration. 

¢ Calculate acceleration given initial time, initial velocity, final time, and 
final velocity. 


A plane decelerates, or slows 
down, as it comes in for landing 
in St. Maarten. Its acceleration 
is opposite in direction to its 
velocity. (credit: Steve Conry, 
Flickr) 


In everyday conversation, to accelerate means to speed up. The accelerator 
in a car can in fact cause it to speed up. The greater the acceleration, the 
greater the change in velocity over a given time. The formal definition of 
acceleration is consistent with these notions, but more inclusive. 


Note: 
Average Acceleration 


Average Acceleration is the rate at which velocity changes, 
Equation: 


where a is average acceleration, v is velocity, and t is time. (The bar over 
the a means average acceleration.) 


Because acceleration is velocity in m/s divided by time in s, the SI units for 
acceleration are m/ 5” meters per second squared or meters per second per 

second, which literally means by how many meters per second the velocity 
changes every second. 


Recall that velocity is a vector—it has both magnitude and direction. This 
means that a change in velocity can be a change in magnitude (or speed), 
but it can also be a change in direction. For example, if a car turns a corner 
at constant speed, it is accelerating because its direction is changing. The 
quicker you turn, the greater the acceleration. So there is an acceleration 
when velocity changes either in magnitude (an increase or decrease in 
speed) or in direction, or both. 


Note: 

Acceleration as a Vector 

Acceleration is a vector in the same direction as the change in velocity, Av 
. Since velocity is a vector, it can change either in magnitude or in 
direction. Acceleration is therefore a change in either speed or direction, or 
both. 


Keep in mind that although acceleration is in the direction of the change in 
velocity, it is not always in the direction of motion. When an object slows 
down, its acceleration is opposite to the direction of its motion. This is 
known as deceleration. 


A subway train in Sao Paulo, 
Brazil, decelerates as it comes 
into a station. It is accelerating 
in a direction opposite to its 
direction of motion. (credit: 
Yusuke Kawasaki, Flickr) 


Note: 

Misconception Alert: Deceleration vs. Negative Acceleration 
Deceleration always refers to acceleration in the direction opposite to the 
direction of the velocity. Deceleration always reduces speed. Negative 
acceleration, however, is acceleration in the negative direction in the 
chosen coordinate system. Negative acceleration may or may not be 
deceleration, and deceleration may or may not be considered negative 
acceleration. For example, consider [link]. 
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(b) 


(c) 


(d) 
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(a) This car is speeding up as it moves 
toward the right. It therefore has 
positive acceleration in our coordinate 
system. (b) This car is slowing down 
as it moves toward the right. 
Therefore, it has negative acceleration 
in our coordinate system, because its 
acceleration is toward the left. The car 
is also decelerating: the direction of its 
acceleration is opposite to its direction 
of motion. (c) This car is moving 


toward the left, but slowing down 
over time. Therefore, its acceleration 
is positive in our coordinate system 
because it is toward the right. 
However, the car is decelerating 
because its acceleration is opposite to 
its motion. (d) This car is speeding up 
as it moves toward the left. It has 
negative acceleration because it is 
accelerating toward the left. However, 
because its acceleration is in the same 
direction as its motion, it is speeding 
up (not decelerating). 


Example: 

Calculating Acceleration: A Racehorse Leaves the Gate 

A racehorse coming out of the gate accelerates from rest to a velocity of 
15.0 m/s due west in 1.80 s. What is its average acceleration? 


(credit: Jon Sullivan, PD 
Photo.org) 


Strategy 

First we draw a sketch and assign a coordinate system to the problem. This 
is a simple problem, but it always helps to visualize it. Notice that we 
assign east as positive and west as negative. Thus, in this case, we have 
negative velocity. 


4=2 N(+y) 
Ah, 
e W (-x) EB (+x) 
ae 0 
ve = —15.0 m/s S(-)) 


We can solve this problem by identifying Av and At from the given 
information and then calculating the average acceleration directly from the 


equationa = 4% = rae 


Solution 

1. Identify the knowns. vp = 0, ve = —15.0 m/s (the negative sign 
indicates direction toward the west), At = 1.80 s. 

2. Find the change in velocity. Since the horse is going from zero to 
—15.0 m/s, its change in velocity equals its final velocity: 
Ava — —_15.0m/s. 


3. Plug in the known values (Av and At) and solve for the unknown a. 
Equation: 


Av —15.0 m/s 9 
— = —« —_———— _ = — 8.33 ; 
At 1.80 s Oe 


a= 
Discussion 
The negative sign for acceleration indicates that acceleration is toward the 
west. An acceleration of 8.33 m/ s” due west means that the horse 
increases its velocity by 8.33 m/s due west each second, that is, 8.33 
meters per second per second, which we write as 8.33 m/ s”. This is truly 
an average acceleration, because the ride is not smooth. We shall see later 
that an acceleration of this magnitude would require the rider to hang on 
with a force nearly equal to his weight. 


Instantaneous Acceleration 


Instantaneous acceleration a, or the acceleration at a specific instant in 
time, is obtained by the same process as discussed for instantaneous 
velocity in Time, Velocity, and Speed—that is, by considering an 
infinitesimally small interval of time. How do we find instantaneous 
acceleration using only algebra? The answer is that we choose an average 
acceleration that is representative of the motion. [link] shows graphs of 
instantaneous acceleration versus time for two very different motions. In 
[link](a), the acceleration varies slightly and the average over the entire 
interval is nearly the same as the instantaneous acceleration at any time. In 
this case, we should treat this motion as if it had a constant acceleration 
equal to the average (in this case about 1.8 m/ 5’). In [link](b), the 
acceleration varies drastically over time. In such situations it is best to 
consider smaller time intervals and choose an average acceleration for each. 
For example, we could consider motion over the time intervals from 0 to 
1.0 s and from 1.0 to 3.0 s as separate motions with accelerations of 


+3.0 m/s” and —2.0 m/s’, respectively. 
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Graphs of instantaneous acceleration versus time for two 
different one-dimensional motions. (a) Here acceleration varies 
only slightly and is always in the same direction, since it is 
positive. The average over the interval is nearly the same as the 


acceleration at any given time. (b) Here the acceleration varies 
greatly, perhaps representing a package on a post office 
conveyor belt that is accelerated forward and backward as it 
bumps along. It is necessary to consider small time intervals 
(such as from 0 to 1.0 s) with constant or nearly constant 
acceleration in such a situation. 


The next several examples consider the motion of the subway train shown 
in [link]. In (a) the shuttle moves to the right, and in (b) it moves to the left. 
The examples are designed to further illustrate aspects of motion and to 
illustrate some of the reasoning that goes into solving problems. 


ixf=3.75km x= 5.25 km 


One-dimensional motion of a subway train considered in [link], 
[link], [link], [link], [link], and [link]. Here we have chosen the x- 
axis so that + means to the right and — means to the left for 
displacements, velocities, and accelerations. (a) The subway train 
moves to the right from x9 to x¢. Its displacement Az is +2.0 km. (b) 
The train moves to the left from x/o to x/¢. Its displacement Az’ is 


—1.5 km. (Note that the prime symbol (') is used simply to 
distinguish between displacement in the two different situations. The 
distances of travel and the size of the cars are on different scales to fit 

everything into the diagram.) 


Example: 

Calculating Displacement: A Subway Train 

What are the magnitude and sign of displacements for the motions of the 
subway train shown in parts (a) and (b) of [link]? 

Strategy 

A drawing with a coordinate system is already provided, so we don’t need 
to make a sketch, but we should analyze it to make sure we understand 
what it is showing. Pay particular attention to the coordinate system. To 
find displacement, we use the equation Ax = x¢ — 2p. This is 
straightforward since the initial and final positions are given. 

Solution 

1. Identify the knowns. In the figure we see that x; = 6.70 km and 

xo = 4.70 km for part (a), and x/; = 3.75 km and x/p = 5.25 km for part 
(b). 

2. Solve for displacement in part (a). 

Equation: 


Ax = x¢ — £9 = 6.70 km — 4.70 km= +2.00 km 


3. Solve for displacement in part (b). 
Equation: 


Axl= axle — 2!lo = 3.75 km — 5.25 km = —1.50 km 


Discussion 

The direction of the motion in (a) is to the right and therefore its 
displacement has a positive sign, whereas motion in (b) is to the left and 
thus has a negative sign. 


Example: 

Comparing Distance Traveled with Displacement: A Subway Train 
What are the distances traveled for the motions shown in parts (a) and (b) 
of the subway train in [link]? 

Strategy 

To answer this question, think about the definitions of distance and 
distance traveled, and how they are related to displacement. Distance 
between two positions is defined to be the magnitude of displacement, 
which was found in [link]. Distance traveled is the total length of the path 
traveled between the two positions. (See Displacement.) In the case of the 
subway train shown in [link], the distance traveled is the same as the 
distance between the initial and final positions of the train. 

Solution 

1. The displacement for part (a) was +2.00 km. Therefore, the distance 
between the initial and final positions was 2.00 km, and the distance 
traveled was 2.00 km. 

2. The displacement for part (b) was —1.5 km. Therefore, the distance 
between the initial and final positions was 1.50 km, and the distance 
traveled was 1.50 km. 

Discussion 

Distance is a scalar. It has magnitude but no sign to indicate direction. 


Example: 

Calculating Acceleration: A Subway Train Speeding Up 

Suppose the train in [link ](a) accelerates from rest to 30.0 km/h in the first 
20.0 s of its motion. What is its average acceleration during that time 
interval? 

Strategy 

It is worth it at this point to make a simple sketch: 
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a=? 


This problem involves three steps. First we must determine the change in 
velocity, then we must determine the change in time, and finally we use 
these values to calculate the acceleration. 

Solution 

1. Identify the knowns. vp = 0 (the trains starts at rest), ve = 30.0 km/h, 
and At = 20.0s. 

2. Calculate Av. Since the train starts from rest, its change in velocity is 
Av= +30.0 km/h, where the plus sign means velocity to the right. 


3. Plug in known values and solve for the unknown, a. 
Equation: 


Av _ +30.0 km/h 


a= —= 
At 20.0s 

4. Since the units are mixed (we have both hours and seconds for time), we 
need to convert everything into SI units of meters and seconds. (See 
Physical Quantities and Units for more guidance.) 

Equation: 


= +30 km/h \ / 10? m 1h 3 
= | —____ = 0.417 
( 20.0 s ) ($2) (es) ae 
Discussion 
The plus sign means that acceleration is to the right. This is reasonable 
because the train starts from rest and ends up with a velocity to the right 


(also positive). So acceleration is in the same direction as the change in 
velocity, as is always the case. 


Example: 

Calculate Acceleration: A Subway Train Slowing Down 

Now suppose that at the end of its trip, the train in [link](a) slows to a stop 
from a speed of 30.0 km/h in 8.00 s. What is its average acceleration while 
stopping? 

Strategy 


ee 


e 
es 0 km/h 
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a=? 


In this case, the train is decelerating and its acceleration is negative 
because it is toward the left. As in the previous example, we must find the 
change in velocity and the change in time and then solve for acceleration. 
Solution 

1. Identify the knowns. vp = 30.0 km/h, vp = 0 km/h (the train is 
stopped, so its velocity is 0), and At = 8.00 s. 

2. Solve for the change in velocity, Av. 

Equation: 


Av = us — v9 = 0 — 30.0 km/h = —30.0 km/h 


3. Plug in the knowns, Av and At, and solve for a. 
Equation: 


- Av _ —30.0km/h 


“At ‘8.005 
4. Convert the units to meters and seconds. 
Equation: 
—~ Av —30.0 km/h \ / 10? m 1h 1.04 m/s? 
PS ee | eee ee ere es = —1.04 m/s’. 
ORE 8.00 s 1km ) \ 3600s 
Discussion 


The minus sign indicates that acceleration is to the left. This sign is 
reasonable because the train initially has a positive velocity in this 
problem, and a negative acceleration would oppose the motion. Again, 
acceleration is in the same direction as the change in velocity, which is 
negative here. This acceleration can be called a deceleration because it has 
a direction opposite to the velocity. 


The graphs of position, velocity, and acceleration vs. time for the trains in 
[link] and [link] are displayed in [link]. (We have taken the velocity to 
remain constant from 20 to 40 s, after which the train decelerates.) 
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(a) Position of the train over time. 
Notice that the train’s position 
changes slowly at the beginning of the 
journey, then more and more quickly 
as it picks up speed. Its position then 
changes more slowly as it slows down 
at the end of the journey. In the 
middle of the journey, while the 
velocity remains constant, the position 
changes at a constant rate. (b) Velocity 
of the train over time. The train’s 
velocity increases as it accelerates at 
the beginning of the journey. It 
remains the same in the middle of the 
journey (where there is no 
acceleration). It decreases as the train 
decelerates at the end of the journey. 
(c) The acceleration of the train over 
time. The train has positive 
acceleration as it speeds up at the 
beginning of the journey. It has no 
acceleration as it travels at constant 
velocity in the middle of the journey. 
Its acceleration is negative as it slows 
down at the end of the journey. 


Example: 

Calculating Average Velocity: The Subway Train 

What is the average velocity of the train in part b of [link], and shown 
again below, if it takes 5.00 min to make its trip? 


Strategy 

Average velocity is displacement divided by time. It will be negative here, 
since the train moves to the left and has a negative displacement. 
Solution 

1. Identify the knowns. x/¢ = 3.75 km, r/y = 5.25 km, At = 5.00 min. 
2. Determine displacement, Ax/. We found Az/ to be —1.5 km in [link]. 
3. Solve for average velocity. 


Equation: 
me Al _ —1.50 km 
~ At 5.00 min 
4. Convert units. 
Equation: 
a Agzl —1.50 km 60 min 
aoe See iy eee = —18.0k 
"= "At (Faam )( ih ) em 
Discussion 


The negative velocity indicates motion to the left. 


Example: 

Calculating Deceleration: The Subway Train 

Finally, suppose the train in [link] slows to a stop from a velocity of 20.0 
km/h in 10.0 s. What is its average acceleration? 

Strategy 

Once again, let’s draw a sketch: 


—————————— 
Y = —20 km/h y 
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a=? 


As before, we must find the change in velocity and the change in time to 
calculate average acceleration. 

Solution 

1. Identify the knowns. vp = —20 km/h, v¢ = 0 km/h, At = 10.0. 
2. Calculate Av. The change in velocity here is actually positive, since 
Equation: 


Av = vt — vp = 0 — (—20 km/h)=+20 km/h. 
3. Solve for a. 
Equation: 


—~ Av +20.0km/h 
= = —_____ 


At —«*10.0s 
4. Convert units. 
Equation: 
_ +20.0 km/h \ / 10? m 1h 2 
7 ( 10.0 s ) ( ikm ) (saaz > poe 
Discussion 


The plus sign means that acceleration is to the right. This is reasonable 
because the train initially has a negative velocity (to the left) in this 
problem and a positive acceleration opposes the motion (and so it is to the 
right). Again, acceleration is in the same direction as the change in 
velocity, which is positive here. As in [link], this acceleration can be called 
a deceleration since it is in the direction opposite to the velocity. 


Sign and Direction 


Perhaps the most important thing to note about these examples is the signs 
of the answers. In our chosen coordinate system, plus means the quantity is 
to the right and minus means it is to the left. This is easy to imagine for 
displacement and velocity. But it is a little less obvious for acceleration. 
Most people interpret negative acceleration as the slowing of an object. 
This was not the case in [link], where a positive acceleration slowed a 
negative velocity. The crucial distinction was that the acceleration was in 
the opposite direction from the velocity. In fact, a negative acceleration will 
increase a negative velocity. For example, the train moving to the left in 
[link] is sped up by an acceleration to the left. In that case, both v and a are 
negative. The plus and minus signs give the directions of the accelerations. 
If acceleration has the same sign as the velocity, the object is speeding up. If 
acceleration has the opposite sign as the velocity, the object is slowing 
down. 

Exercise: 

Check Your Understanding 


Problem: 


An airplane lands on a runway traveling east. Describe its acceleration. 


Solution: 


If we take east to be positive, then the airplane has negative 
acceleration, as it is accelerating toward the west. It is also 
decelerating: its acceleration is opposite in direction to its velocity. 


Note: 

PhET Explorations: Moving Man Simulation 

Learn about position, velocity, and acceleration graphs. Move the little man 
back and forth with the mouse and plot his motion. Set the position, 
velocity, or acceleration and let the simulation move the man for you. 
https://archive.cnx.org/specials/e2ca52af-8c6b-450e-ac2f- 
9300b38e8739/moving-man/ 


Section Summary 


e Acceleration is the rate at which velocity changes. In symbols, 
average acceleration a is 
Equation: 


A) Up — Vo 
a= — = ——., 
At tf — t9 


e The SI unit for acceleration is m/ s”. 

e Acceleration is a vector, and thus has a both a magnitude and direction. 

e Acceleration can be caused by either a change in the magnitude or the 
direction of the velocity. 

e Instantaneous acceleration a is the acceleration at a specific instant in 
time. 

e Deceleration is an acceleration with a direction opposite to that of the 
velocity. 


Conceptual Questions 


Exercise: 
Problem: 
Is it possible for speed to be constant while acceleration is not zero? 
Give an example of such a situation. 

Exercise: 
Problem: 
Is it possible for velocity to be constant while acceleration is not zero? 
Explain. 

Exercise: 


Problem: 


Give an example in which velocity is zero yet acceleration is not. 


Exercise: 
Problem: 
If a subway train is moving to the left (has a negative velocity) and 


then comes to a stop, what is the direction of its acceleration? Is the 
acceleration positive or negative? 


Exercise: 
Problem: 
Plus and minus signs are used in one-dimensional motion to indicate 


direction. What is the sign of an acceleration that reduces the 
magnitude of a negative velocity? Of a positive velocity? 


Problems & Exercises 


Exercise: 


Problem: 


A cheetah can accelerate from rest to a speed of 30.0 m/s in 7.00 s. 
What is its acceleration? 


Solution: 


4.29 m/s” 


Exercise: 


Problem: Professional Application 


Dr. John Paul Stapp was U.S. Air Force officer who studied the effects 
of extreme deceleration on the human body. On December 10, 1954, 
Stapp rode a rocket sled, accelerating from rest to a top speed of 282 
m/s (1015 km/h) in 5.00 s, and was brought jarringly back to rest in 
only 1.40 s! Calculate his (a) acceleration and (b) deceleration. 


Express each in multiples of g (9.80 m/s”) by taking its ratio to the 
acceleration of gravity. 


Exercise: 


Problem: 


A commuter backs her car out of her garage with an acceleration of 
1.40 m/ 37. (a) How long does it take her to reach a speed of 2.00 m/s? 
(b) If she then brakes to a stop in 0.800 s, what is her deceleration? 


Solution: 
(a) 1.43 s 
(b) —2.50 m/s” 


Exercise: 


Problem: 


Assume that an intercontinental ballistic missile goes from rest to a 
suborbital speed of 6.50 km/s in 60.0 s (the actual speed and time are 
classified). What is its average acceleration in m/ s” and in multiples 
of g (9.80 m/s”)? 


Glossary 


acceleration 
the rate of change in velocity; the change in velocity over time 


average acceleration 
the change in velocity divided by the time over which it changes 


instantaneous acceleration 
acceleration at a specific point in time 


deceleration 


acceleration in the direction opposite to velocity; acceleration that 
results in a decrease in velocity 


Motion Equations for Constant Acceleration in One Dimension 


e Calculate displacement of an object that is not accelerating, given 
initial position and velocity. 

¢ Calculate final velocity of an accelerating object, given initial velocity, 
acceleration, and time. 

¢ Calculate displacement and final position of an accelerating object, 
given initial position, initial velocity, time, and acceleration. 


Kinematic equations can help us 
describe and predict the motion 
of moving objects such as these 
kayaks racing in Newbury, 
England. (credit: Barry Skeates, 
Flickr) 


We might know that the greater the acceleration of, say, a car moving away 
from a stop sign, the greater the displacement in a given time. But we have 
not developed a specific equation that relates acceleration and 
displacement. In this section, we develop some convenient equations for 
kinematic relationships, starting from the definitions of displacement, 
velocity, and acceleration already covered. 


Notation: t, x, v, a 


First, let us make some simplifications in notation. Taking the initial time to 
be zero, as if time is measured with a stopwatch, is a great simplification. 
Since elapsed time is At = ts — to, taking tg = 0 means that At = tg, the 
final time on the stopwatch. When initial time is taken to be zero, we use 
the subscript 0 to denote initial values of position and velocity. That is, x9 is 
the initial position and vg is the initial velocity. We put no subscripts on the 
final values. That is, ¢ is the final time, x is the final position, and v is the 
final velocity. This gives a simpler expression for elapsed time—now, 

At = t. It also simplifies the expression for displacement, which is now 
Ax = x — 29. Also, it simplifies the expression for change in velocity, 
which is now Av = v — vo. To summarize, using the simplified notation, 
with the initial time taken to be zero, 


Equation: 
ri a 
Ar = 2-29 
Av = v—vW9 


where the subscript 0 denotes an initial value and the absence of a 
subscript denotes a final value in whatever motion is under consideration. 


We now make the important assumption that acceleration is constant. This 
assumption allows us to avoid using calculus to find instantaneous 
acceleration. Since acceleration is constant, the average and instantaneous 
accelerations are equal. That is, 

Equation: 


a =a =constant, 


so we use the symbol a for acceleration at all times. Assuming acceleration 
to be constant does not seriously limit the situations we can study nor 
degrade the accuracy of our treatment. For one thing, acceleration is 
constant in a great number of situations. Furthermore, in many other 
situations we can accurately describe motion by assuming a constant 
acceleration equal to the average acceleration for that motion. Finally, in 


motions where acceleration changes drastically, such as a car accelerating to 
top speed and then braking to a stop, the motion can be considered in 
separate parts, each of which has its own constant acceleration. 


Note: 

Solving for Displacement (Az) and Final Position (a) from Average 
Velocity when Acceleration (a) is Constant 

To get our first two new equations, we start with the definition of average 
velocity: 

Equation: 


ae 
At” 


= 


Substituting the simplified notation for Az and At yields 
Equation: 


«w&— XO 
t 


Y= 


Solving for x yields 
Equation: 


z= 2 + vt, 


where the average velocity is 


Equation: 
= OT Fone) 
C= a (constant a). 
The equation v = — ” reflects the fact that, when acceleration is constant, 


v is just the simple average of the initial and final velocities. For example, if 


you steadily increase your velocity (that is, with constant acceleration) from 
30 to 60 km/h, then your average velocity during this steady increase is 45 
Ugtvu 


km/h. Using the equation v = 5 to check this, we see that 
Equation: 


bs 30 km/h + 60 km/h 
p= Toe _ Dias Oe = 45 km/h, 


which seems logical. 


Example: 

Calculating Displacement: How Far does the Jogger Run? 

A jogger runs down a straight stretch of road with an average velocity of 
4.00 m/s for 2.00 min. What is his final position, taking his initial position 
to be zero? 

Strategy 

Draw a sketch. 


Dv = 4.00 m/s y 


x x=? 


The final position x is given by the equation 
Equation: 


Z = ao + vt. 


To find x, we identify the values of zo, v, and t from the statement of the 
problem and substitute them into the equation. 
Solution 


1. Identify the knowns. v = 4.00 m/s, At = 2.00 min, and zp = 0 m. 
2. Enter the known values into the equation. 
Equation: 


z= 29+ vt = 0+ (4.00 m/s)(120s) = 480 m 


Discussion 


Velocity and final displacement are both positive, which means they are in 
the same direction. 


The equation z = zo + vt gives insight into the relationship between 
displacement, average velocity, and time. It shows, for example, that 
displacement is a linear function of average velocity. (By linear function, 


we mean that displacement depends on v rather than on v raised to some 
2 
other power, such as v . When graphed, linear functions look like straight 


lines with a constant slope.) On a car trip, for example, we will get twice as 
far in a given time if we average 90 km/h than if we average 45 km/h. 


Displacement vs. Velocity 
for a given time, t 


Displacement, Ax (m) 


Average velocity, 0 (m/s) 


There is a linear relationship 
between displacement and 
average velocity. For a given 
time ¢, an object moving twice 
as fast as another object will 


move twice as far as the other 
object. 


Note: 

Solving for Final Velocity 

We can derive another useful equation by manipulating the definition of 
acceleration. 

Equation: 


_ Av 
ey 


Substituting the simplified notation for Av and At gives us 
Equation: 


u-—v 
= ° (constant a). 
Solving for v yields 
Equation: 
v = vo + at (constant a). 
Example: 


Calculating Final Velocity: An Airplane Slowing Down after Landing 
An airplane lands with an initial velocity of 70.0 m/s and then decelerates 
at 1.50 m/ s” for 40.0 s. What is its final velocity? 

Strategy 

Draw a sketch. We draw the acceleration vector in the direction opposite 
the velocity vector because the plane is decelerating. 


———_ SS 
vy = 70.0 m/s J 


<< 
a = —1.50 m/s? 
— x 


vp=? 


Solution 

1. Identify the knowns. vp = 70.0 m/s, a = —1.50 m/s”, 1 — 400s: 
2. Identify the unknown. In this case, it is final velocity, v¢. 

3. Determine which equation to use. We can calculate the final velocity 
using the equation v = vg + at. 

4. Plug in the known values and solve. 

Equation: 


v= vp + at = 70.0 m/s + (—1.50 m/s”) (40.0s) = 10.0 m/s 


Discussion 

The final velocity is much less than the initial velocity, as desired when 
slowing down, but still positive. With jet engines, reverse thrust could be 
maintained long enough to stop the plane and start moving it backward. 
That would be indicated by a negative final velocity, which is not the case 
here. 


= 70.0 m/ = 10. 
a L, % m/s a Ll, v = 10.0 m/s 


Peed 


7 <——. y a —— 
a=-1.5 m/s? a =—1.50 m/s? 


fy = 0 t= 40.0s 


The airplane lands with an initial velocity of 70.0 m/s and slows to a 
final velocity of 10.0 m/s before heading for the terminal. Note that 
the acceleration is negative because its direction is opposite to its 
velocity, which is positive. 


In addition to being useful in problem solving, the equation v = vp + at 
gives us insight into the relationships among velocity, acceleration, and 
time. From it we can see, for example, that 


e final velocity depends on how large the acceleration is and how long it 
lasts 

e if the acceleration is zero, then the final velocity equals the initial 
velocity (v = vg), as expected (i.e., velocity is constant) 

e if a is negative, then the final velocity is less than the initial velocity 


(All of these observations fit our intuition, and it is always useful to 
examine basic equations in light of our intuition and experiences to check 
that they do indeed describe nature accurately.) 


Note: 
Making Connections: Real-World Connection 


The Space Shuttle Endeavor 
blasts off from the Kennedy 
Space Center in February 2010. 
(credit: Matthew Simantov, 
Flickr) 


An intercontinental ballistic missile (ICBM) has a larger average 
acceleration than the Space Shuttle and achieves a greater velocity in the 


first minute or two of flight (actual ICBM burn times are classified—short- 
burn-time missiles are more difficult for an enemy to destroy). But the 
Space Shuttle obtains a greater final velocity, so that it can orbit the earth 
rather than come directly back down as an ICBM does. The Space Shuttle 
does this by accelerating for a longer time. 


Note: 

Solving for Final Position When Velocity is Not Constant (a ~ 0) 

We can combine the equations above to find a third equation that allows us 
to calculate the final position of an object experiencing constant 
acceleration. We start with 

Equation: 


i — Ug 4 abe 


Adding vp to each side of this equation and dividing by 2 gives 
Equation: 


Uinaewt 1 
i ee 
2 2 
Since — “ — y for constant acceleration, then 
Equation: 
v emer 
v=vo + —at. 
Pe 


Now we substitute this expression for v into the equation for displacement, 


x= £0 + vt, yielding 
Equation: 


ee 
Ho Onion Unt ae aut (constant a). 


Example: 

Calculating Displacement of an Accelerating Object: Dragsters 
Dragsters can achieve average accelerations of 26.0 m/ 3”. Suppose such a 
dragster accelerates from rest at this rate for 5.56 s. How far does it travel 
in this time? 


U.S. Army Top Fuel pilot 
Tony “The Sarge” 
Schumacher begins a race 
with a controlled burnout. 
(credit: Lt. Col. William 
Thurmond. Photo 
Courtesy of U.S. Army.) 


Strategy 
Draw a sketch. 


xO x=? 


a= 26.0 m/s? 


We are asked to find displacement, which is x if we take x9 to be zero. 
(Think about it like the starting line of a race. It can be anywhere, but we 
call it 0 and measure all other positions relative to it.) We can use the 
equation z = %p + vot + + at” once we identify vg, a, and t from the 
statement of the problem. 

Solution 


1. Identify the knowns. Starting from rest means that vp = 0, a is given as 
26.0 m/s” and t is given as 5.56 s. 

2. Plug the known values into the equation to solve for the unknown z: 
Equation: 


1 4 
SS ag oe Gh ar Ge 


Since the initial position and velocity are both zero, this simplifies to 
Equation: 


1 
— —qt? 
oe 
Substituting the identified values of a and ¢ gives 
Equation: 
_¢ 2 2 
Tai 26.0 m/s* )(5.56s)", 
yielding 
Equation: 
x = 402 m. 
Discussion 


If we convert 402 m to miles, we find that the distance covered is very 
close to one quarter of a mile, the standard distance for drag racing. So the 
answer is reasonable. This is an impressive displacement in only 5.56 s, 
but top-notch dragsters can do a quarter mile in even less time than this. 


What else can we learn by examining the equation x = Zp + vot + Sat”? 
We see that: 


e displacement depends on the square of the elapsed time when 
acceleration is not zero. In [link], the dragster covers only one fourth 
of the total distance in the first half of the elapsed time 


e if acceleration is zero, then the initial velocity equals average velocity 


(vp = v) and z = xp + vot + Fat” becomes z = xo + vot 


Note: 

Solving for Final Velocity when Velocity Is Not Constant (a # 0) 
A fourth useful equation can be obtained from another algebraic 
manipulation of previous equations. 

If we solve v = vo + at for t, we get 


Equation: 
U-—v 
— 
a 

Substituting this and v = a ” into x = xo + vt, we get 
Equation: 

vy? = v2 + 2a(ax — x0) (constant a). 
Example: 


Calculating Final Velocity: Dragsters 

Calculate the final velocity of the dragster in [link] without using 
information about time. 

Strategy 

Draw a sketch. 


a= 26.0 m/s2 


The equation v? = v? + 2a(x — zo) is ideally suited to this task because it 
relates velocities, acceleration, and displacement, and no time information 
is required. 


Solution 
1. Identify the known values. We know that vg = 0, since the dragster 
starts from rest. Then we note that x — zg = 402 m (this was the answer 


in [link]). Finally, the average acceleration was given to be a = 26.0 m/ 5° 


2. Plug the knowns into the equation v? = v2 + 2a(ax — zo) and solve for 
v. 
Equation: 


v=0+ 2(26.0 m/s”) (402 m). 


Thus 
Equation: 


vy? = 2.09 x 104 m?/s”. 


To get v, we take the square root: 
Equation: 


v= \/2.09 x 104 m?/s? = 145 m/s. 


Discussion 

145 m/s is about 522 km/h or about 324 mi/h, but even this breakneck 
speed is short of the record for the quarter mile. Also, note that a square 
root has two values; we took the positive value to indicate a velocity in the 
same direction as the acceleration. 


An examination of the equation v? = ve + 2a(x — xo) can produce further 
insights into the general relationships among physical quantities: 


e The final velocity depends on how large the acceleration is and the 
distance over which it acts 

e For a fixed deceleration, a car that is going twice as fast doesn’t simply 
stop in twice the distance—it takes much further to stop. (This is why 


we have reduced speed zones near schools.) 


Putting Equations Together 


In the following examples, we further explore one-dimensional motion, but 
in situations requiring slightly more algebraic manipulation. The examples 
also give insight into problem-solving techniques. The box below provides 
easy reference to the equations needed. 


Note: 
Summary of Kinematic Equations (constant a) 
Equation: 

= Coe ut 
Equation: 

= Ug 4 U 

U= 

2 

Equation: 

Vv = Vo + at 
Equation: 

1 
1 == A ae ae ao mee 

Equation: 


vy? = vs + 2a(ax — 20) 


Example: 


Calculating Displacement: How Far Does a Car Go When Coming to a 
Halt? 


On dry concrete, a car can decelerate at a rate of 7.00 m/ 57, whereas on 


wet concrete it can decelerate at only 5.00 m/ s”. Find the distances 
necessary to stop a car moving at 30.0 m/s (about 110 km/h) (a) on dry 
concrete and (b) on wet concrete. (c) Repeat both calculations, finding the 
displacement from the point where the driver sees a traffic light turn red, 
taking into account his reaction time of 0.500 s to get his foot on the brake. 
Strategy 

Draw a sketch. 


Ax =? 
vp = 30.0 m/s ve = 0 m/s 
——$—____—_—__ > & 


Adry = —7.00 m/s? 
Awe = —5.00 m/s? 


In order to determine which equations are best to use, we need to list all of 
the known values and identify exactly what we need to solve for. We shall 
do this explicitly in the next several examples, using tables to set them off. 
Solution for (a) 

1. Identify the knowns and what we want to solve for. We know that 

Up = 30.0 m/s; v = 0; a = —7.00 m/s” (a is negative because it is in a 
direction opposite to velocity). We take x9 to be 0. We are looking for 
displacement Az, or x — Zo. 

2. Identify the equation that will help up solve the problem. The best 
equation to use is 

Equation: 

vu? = va + 2a(ax — 20). 

This equation is best because it includes only one unknown, xz. We know 
the values of all the other variables in this equation. (There are other 
equations that would allow us to solve for x, but they require us to know 


the stopping time, t, which we do not know. We could use them but it 
would entail additional calculations.) 
3. Rearrange the equation to solve for z. 


Equation: 
v? — v2 
wt — LO a 

4. Enter known values. 
Equation: 

0? — (30.0 m/s)” 

Cee 

2 (—7.00 m/s”) 
Thus, 
Equation: 


x = 64.3 m on dry concrete. 


Solution for (b) 

This part can be solved in exactly the same manner as Part A. The only 
difference is that the deceleration is — 5.00 m/ s”. The result is 
Equation: 


Lwet — 90.0 m on wet concrete. 


Solution for (c) 

Once the driver reacts, the stopping distance is the same as it is in Parts A 
and B for dry and wet concrete. So to answer this question, we need to 
calculate how far the car travels during the reaction time, and then add that 
to the stopping time. It is reasonable to assume that the velocity remains 
constant during the driver’s reaction time. 

1. Identify the knowns and what we want to solve for. We know that 


Vv = 30.0 m/s; Eeeeuon — 0.500 S; Qreaction = 0. We take Z0—reaction to be 
0. We are looking for Zyeaction. 
2. Identify the best equation to use. 


x = £9 + vt works well because the only unknown value is z, which is 
what we want to solve for. 

3. Plug in the knowns to solve the equation. 

Equation: 


xz = 0+ (30.0 m/s)(0.500 s) = 15.0 m. 


This means the car travels 15.0 m while the driver reacts, making the total 
displacements in the two cases of dry and wet concrete 15.0 m greater than 
if he reacted instantly. 

4. Add the displacement during the reaction time to the displacement when 
braking. 

Equation: 


& braking + Lreaction = Ltotal 


a. 64.3 m + 15.0 m = 79.3 m when dry 
b. 90.0 m + 15.0 m = 105 m when wet 


64.3m 
" 90.0m 


@ 
wet 


Reaction 


79.3m 
" 105m 


@ 
wet 


Position x (m) 


The distance necessary to stop a car varies greatly, depending on road 
conditions and driver reaction time. Shown here are the braking 
distances for dry and wet pavement, as calculated in this example, for 
a car initially traveling at 30.0 m/s. Also shown are the total distances 
traveled from the point where the driver first sees a light turn red, 
assuming a 0.500 s reaction time. 


Discussion 

The displacements found in this example seem reasonable for stopping a 
fast-moving car. It should take longer to stop a car on wet rather than dry 
pavement. It is interesting that reaction time adds significantly to the 
displacements. But more important is the general approach to solving 
problems. We identify the knowns and the quantities to be determined and 
then find an appropriate equation. There is often more than one way to 
solve a problem. The various parts of this example can in fact be solved by 
other methods, but the solutions presented above are the shortest. 


Example: 

Calculating Time: A Car Merges into Traffic 

Suppose a car merges into freeway traffic on a 200-m-long ramp. If its 
initial velocity is 10.0 m/s and it accelerates at 2.00 m/ s*, how long does 
it take to travel the 200 m up the ramp? (Such information might be useful 
to a traffic engineer.) 

Strategy 

Draw a sketch. 


t=? 


[-———— 


xo = 0 x = 200 m 
vy = 10.0 m/s v=? 
———— ————————— 
a = 2.00 m/s? 
———— 


We are asked to solve for the time t. As before, we identify the known 
quantities in order to choose a convenient physical relationship (that is, an 
equation with one unknown, f). 

Solution 

1. Identify the knowns and what we want to solve for. We know that 

vo — 10m /s;o — 2-00 m/s”; and x = 200 m. 

2. We need to solve for t. Choose the best equation. x = x9 + vot + + at? 
works best because the only unknown in the equation is the variable ¢ for 
which we need to solve. 


3. We will need to rearrange the equation to solve for ft. In this case, it will 
be easier to plug in the knowns first. 
Equation: 


1 
200 m = 0m + (10.0 m/s)t + > (2.00 m/s”) t 


4. Simplify the equation. The units of meters (m) cancel because they are 
in each term. We can get the units of seconds (s) to cancel by taking t = ¢ s 
, where ¢ is the magnitude of time and s is the unit. Doing so leaves 
Equation: 


200 = 10¢ + t?. 


5. Use the quadratic formula to solve for f. 
(a) Rearrange the equation to get 0 on one side of the equation. 
Equation: 


t? +10¢ — 200 =0 


This is a quadratic equation of the form 
Equation: 


ae kb eo =O) 


where the constants are a = 1.00, b = 10.0, and c = —200. 
(b) Its solutions are given by the quadratic formula: 
Equation: 
ed + /b2 — dac dac 

2a 


This yields two solutions for t, which are 
Equation: 


t = 10.0 and — 20.0. 


In this case, then, the time is t = t in seconds, or 
Equation: 


t = 10.0 s and — 20.0 s. 


A negative value for time is unreasonable, since it would mean that the 
event happened 20 s before the motion began. We can discard that solution. 
Thus, 

Equation: 


t= 10.0 s. 


Discussion 

Whenever an equation contains an unknown squared, there will be two 
solutions. In some problems both solutions are meaningful, but in others, 
such as the above, only one solution is reasonable. The 10.0 s answer 
seems reasonable for a typical freeway on-ramp. 


With the basics of kinematics established, we can go on to many other 
interesting examples and applications. In the process of developing 
kinematics, we have also glimpsed a general approach to problem solving 
that produces both correct answers and insights into physical relationships. 
Problem-Solving Basics discusses problem-solving basics and outlines an 
approach that will help you succeed in this invaluable task. 


Note: 

Making Connections: Take-Home Experiment—Breaking News 

We have been using SI units of meters per second squared to describe some 
examples of acceleration or deceleration of cars, runners, and trains. To 
achieve a better feel for these numbers, one can measure the braking 
deceleration of a car doing a slow (and safe) stop. Recall that, for average 


acceleration, a = Av/At. While traveling in a car, slowly apply the 
brakes as you come up to a stop sign. Have a passenger note the initial 
speed in miles per hour and the time taken (in seconds) to stop. From this, 
calculate the deceleration in miles per hour per second. Convert this to 
meters per second squared and compare with other decelerations 
mentioned in this chapter. Calculate the distance traveled in braking. 


Exercise: 
Check Your Understanding 


Problem: 


A manned rocket accelerates at a rate of 20 m/ 3” during launch. How 
long does it take the rocket to reach a velocity of 400 m/s? 


Solution: 


To answer this, choose an equation that allows you to solve for time ¢, 
given only a, vo, and v. 
Equation: 


V = Ug + at 


Rearrange to solve for t. 
Equation: 


pe. 400 m/s—Om/s _ 


5 20s 
a 20 m/s 


Section Summary 


¢ To simplify calculations we take acceleration to be constant, so that 


a = aatall times. 

e We also take initial time to be zero. 

e Initial position and velocity are given a subscript 0; final values have 
no subscript. Thus, 
Equation: 


> 
8 
| 
8 
| 
8 
fan) 


e The following kinematic equations for motion with constant a are 


useful: 
Equation: 

(lat 1 ut 
Equation: 

= Vo + U 

i 

2 

Equation: 

U= vg al 
Equation: 

1 
L= A+ volt + gu 

Equation: 


vu? = v2 + 2a(ax — 29) 


e In vertical motion, y is substituted for x. 


Problems & Exercises 


Exercise: 


Problem: 


An Olympic-class sprinter starts a race with an acceleration of 
4.50 m/ a (a) What is her speed 2.40 s later? (b) Sketch a graph of 
her position vs. time for this period. 


Solution: 


(a) 10.8 m/s 


(b) 


Position vs. Time 


Position (m) 
w 
r=) 


Exercise: 


Problem: 


A well-thrown ball is caught in a well-padded mitt. If the deceleration 
of the ball is 2.10 x 10* m/s”, and 1.85 ms (1 ms = 107° s) elapses 


from the time the ball first touches the mitt until it stops, what was the 
initial velocity of the ball? 


Solution: 


38.9 m/s (about 87 miles per hour) 
Exercise: 


Problem: 


A bullet in a gun is accelerated from the firing chamber to the end of 
the barrel at an average rate of 6.20 x 10° m/s” for 8.10 x 10°“. 
What is its muzzle velocity (that is, its final velocity)? 


Exercise: 


Problem: 


(a) A light-rail commuter train accelerates at a rate of 1.35 m/ s’. How 
long does it take to reach its top speed of 80.0 km/h, starting from rest? 
(b) The same train ordinarily decelerates at a rate of 1.65 m/ s’. How 
long does it take to come to a stop from its top speed? (c) In 
emergencies the train can decelerate more rapidly, coming to rest from 
80.0 km/h in 8.30 s. What is its emergency deceleration in m/ s?? 


Solution: 
(a) 16.5s 
(b) 13.5 s 


(c) — 2.68 m/s” 
Exercise: 


Problem: 


While entering a freeway, a car accelerates from rest at a rate of 

2.40 m/ s* for 12.0. (a) Draw a sketch of the situation. (b) List the 
knowns in this problem. (c) How far does the car travel in those 12.0 
s? To solve this part, first identify the unknown, and then discuss how 
you chose the appropriate equation to solve for it. After choosing the 
equation, show your steps in solving for the unknown, check your 
units, and discuss whether the answer is reasonable. (d) What is the 
car’s final velocity? Solve for this unknown in the same manner as in 
part (c), showing all steps explicitly. 


Exercise: 
Problem: 
At the end of a race, a runner decelerates from a velocity of 9.00 m/s at 


a rate of 2.00 m/ af (a) How far does she travel in the next 5.00 s? (b) 
What is her final velocity? (c) Evaluate the result. Does it make sense? 


Solution: 
(a) 20.0 m 
(b) —1.00 m/s 


(c) This result does not really make sense. If the runner starts at 9.00 
m/s and decelerates at 2.00 m/ s”, then she will have stopped after 
4.50 s. If she continues to decelerate, she will be running backwards. 


Exercise: 


Problem:Professional Application: 


Blood is accelerated from rest to 30.0 cm/s in a distance of 1.80 cm by 
the left ventricle of the heart. (a) Make a sketch of the situation. (b) 
List the knowns in this problem. (c) How long does the acceleration 
take? To solve this part, first identify the unknown, and then discuss 
how you chose the appropriate equation to solve for it. After choosing 
the equation, show your steps in solving for the unknown, checking 
your units. (d) Is the answer reasonable when compared with the time 
for a heartbeat? 


Exercise: 


Problem: 


In a slap shot, a hockey player accelerates the puck from a velocity of 
8.00 m/s to 40.0 m/s in the same direction. If this shot takes 
3.33 x 10°? s, calculate the distance over which the puck accelerates. 


Solution: 


0.799 m 


Exercise: 


Problem: 


A powerful motorcycle can accelerate from rest to 26.8 m/s (100 
km/h) in only 3.90 s. (a) What is its average acceleration? (b) How far 
does it travel in that time? 


Exercise: 
Problem: 
Freight trains can produce only relatively small accelerations and 
decelerations. (a) What is the final velocity of a freight train that 
accelerates at a rate of 0.0500 m/ s” for 8.00 min, starting with an 
initial velocity of 4.00 m/s? (b) If the train can slow down at a rate of 


0.550 m/ 5”, how long will it take to come to a stop from this velocity? 
(c) How far will it travel in each case? 


Solution: 
(a) 28.0 m/s 
(b) 50.9 s 


(c) 7.68 km to accelerate and 713 m to decelerate 
Exercise: 
Problem: 
A fireworks shell is accelerated from rest to a velocity of 65.0 m/s over 


a distance of 0.250 m. (a) How long did the acceleration last? (b) 
Calculate the acceleration. 


Exercise: 


Problem: 


A swan on a lake gets airbome by flapping its wings and running on 
top of the water. (a) If the swan must reach a velocity of 6.00 m/s to 
take off and it accelerates from rest at an average rate of 0.350 m/ s”, 
how far will it travel before becoming airborne? (b) How long does 
this take? 


Solution: 
(a) 51.4m 


(b) 17.1 


Exercise: 


Problem: Professional Application: 


A woodpecker’s brain is specially protected from large decelerations 
by tendon-like attachments inside the skull. While pecking on a tree, 
the woodpecker’s head comes to a stop from an initial velocity of 

0.600 m/s in a distance of only 2.00 mm. (a) Find the acceleration in 


m/s” and in multiples of g (9 = 9.80 m/s”), (b) Calculate the 


stopping time. (c) The tendons cradling the brain stretch, making its 
stopping distance 4.50 mm (greater than the head and, hence, less 
deceleration of the brain). What is the brain’s deceleration, expressed 
in multiples of g? 


Exercise: 
Problem: 
An unwary football player collides with a padded goalpost while 
running at a velocity of 7.50 m/s and comes to a full stop after 


compressing the padding and his body 0.350 m. (a) What is his 
deceleration? (b) How long does the collision last? 


Solution: 


(a) —80.4 m/s” 


(b) 9.33 x 10°? s 
Exercise: 


Problem: 


In World War II, there were several reported cases of airmen who 
jumped from their flaming airplanes with no parachute to escape 
certain death. Some fell about 20,000 feet (6000 m), and some of them 
survived, with few life-threatening injuries. For these lucky pilots, the 
tree branches and snow drifts on the ground allowed their deceleration 
to be relatively small. If we assume that a pilot’s speed upon impact 
was 123 mph (54 m/s), then what was his deceleration? Assume that 
the trees and snow stopped him over a distance of 3.0 m. 


Exercise: 
Problem: 
Consider a grey squirrel falling out of a tree to the ground. (a) If we 
ignore air resistance in this case (only for the sake of this problem), 
determine a squirrel’s velocity just before hitting the ground, assuming 
it fell from a height of 3.0 m. (b) If the squirrel stops in a distance of 


2.0 cm through bending its limbs, compare its deceleration with that of 
the airman in the previous problem. 


Solution: 
(a) 7.7 m/s 


(b) —15 x 10? m i s*. This is about 3 times the deceleration of the 
pilots, who were falling from thousands of meters high! 


Exercise: 


Problem: 


An express train passes through a station. It enters with an initial 
velocity of 22.0 m/s and decelerates at a rate of 0.150 m/ s” as it goes 
through. The station is 210 m long. (a) How long is the nose of the 
train in the station? (b) How fast is it going when the nose leaves the 
station? (c) If the train is 130 m long, when does the end of the train 
leave the station? (d) What is the velocity of the end of the train as it 
leaves? 


Exercise: 


Problem: 


Dragsters can actually reach a top speed of 145 m/s in only 4.45 s— 
considerably less time than given in [link] and [link]. (a) Calculate the 
average acceleration for such a dragster. (b) Find the final velocity of 
this dragster starting from rest and accelerating at the rate found in (a) 
for 402 m (a quarter mile) without using any information on time. (c) 
Why is the final velocity greater than that used to find the average 
acceleration? Hint: Consider whether the assumption of constant 
acceleration is valid for a dragster. If not, discuss whether the 
acceleration would be greater at the beginning or end of the run and 
what effect that would have on the final velocity. 


Solution: 
(a) 32.6 m/s” 
(b) 162 m/s 


(c) v > Umax, because the assumption of constant acceleration is not 
valid for a dragster. A dragster changes gears, and would have a 
greater acceleration in first gear than second gear than third gear, etc. 
The acceleration would be greatest at the beginning, so it would not be 
accelerating at 32.6 m/ — during the last few meters, but substantially 
less, and the final velocity would be less than 162 m/s. 


Exercise: 


Problem: 


A bicycle racer sprints at the end of a race to clinch a victory. The 
racer has an initial velocity of 11.5 m/s and accelerates at the rate of 
0.500 m/ s” for 7.00 s. (a) What is his final velocity? (b) The racer 
continues at this velocity to the finish line. If he was 300 m from the 
finish line when he started to accelerate, how much time did he save? 
(c) One other racer was 5.00 m ahead when the winner started to 
accelerate, but he was unable to accelerate, and traveled at 11.8 m/s 
until the finish line. How far ahead of him (in meters and in seconds) 
did the winner finish? 


Exercise: 
Problem: 
In 1967, New Zealander Burt Munro set the world record for an Indian 
motorcycle, on the Bonneville Salt Flats in Utah, with a maximum 
speed of 183.58 mi/h. The one-way course was 5.00 mi long. 
Acceleration rates are often described by the time it takes to reach 60.0 
mi/h from rest. If this time was 4.00 s, and Burt accelerated at this rate 


until he reached his maximum speed, how long did it take Burt to 
complete the course? 


Solution: 


104 s 


Exercise: 


Problem: 


(a) A world record was set for the men’s 100-m dash in the 2008 
Olympic Games in Beijing by Usain Bolt of Jamaica. Bolt “coasted” 
across the finish line with a time of 9.69 s. If we assume that Bolt 
accelerated for 3.00 s to reach his maximum speed, and maintained 
that speed for the rest of the race, calculate his maximum speed and his 
acceleration. (b) During the same Olympics, Bolt also set the world 
record in the 200-m dash with a time of 19.30 s. Using the same 
assumptions as for the 100-m dash, what was his maximum speed for 
this race? 


Solution: 
(a) v = 12.2 m/s; a = 4.07 m/s” 


(b) v= 11.2 m/s 


Problem-Solving Basics for One-Dimensional Kinematics 


e Apply problem-solving steps and strategies to solve problems of one- 
dimensional kinematics. 

e Apply strategies to determine whether or not the result of a problem is 
reasonable, and if not, determine the cause. 


Problem-solving skills are 
essential to your success in 
Physics. (credit: scui3asteveo, 
Flickr) 


Problem-solving skills are obviously essential to success in a quantitative 
course in physics. More importantly, the ability to apply broad physical 
principles, usually represented by equations, to specific situations is a very 
powerful form of knowledge. It is much more powerful than memorizing a 
list of facts. Analytical skills and problem-solving abilities can be applied to 
new situations, whereas a list of facts cannot be made long enough to 
contain every possible circumstance. Such analytical skills are useful both 
for solving problems in this text and for applying physics in everyday and 
professional life. 


Problem-Solving Steps 


While there is no simple step-by-step method that works for every problem, 
the following general procedures facilitate problem solving and make it 
more meaningful. A certain amount of creativity and insight is required as 
well. 


Step 1 


Examine the situation to determine which physical principles are involved. 
It often helps to draw a simple sketch at the outset. You will also need to 
decide which direction is positive and note that on your sketch. Once you 
have identified the physical principles, it is much easier to find and apply 
the equations representing those principles. Although finding the correct 
equation is essential, keep in mind that equations represent physical 
principles, laws of nature, and relationships among physical quantities. 
Without a conceptual understanding of a problem, a numerical solution is 
meaningless. 


Step 2 


Make a list of what is given or can be inferred from the problem as stated 
(identify the knowns). Many problems are stated very succinctly and require 
some inspection to determine what is known. A sketch can also be very 
useful at this point. Formally identifying the knowns is of particular 
importance in applying physics to real-world situations. Remember, 
“stopped” means velocity is zero, and we often can take initial time and 
position as zero. 


Step 3 


Identify exactly what needs to be determined in the problem (identify the 
unknowns). In complex problems, especially, it is not always obvious what 
needs to be found or in what sequence. Making a list can help. 


Step 4 


Find an equation or set of equations that can help you solve the problem. 
Your list of knowns and unknowns can help here. It is easiest if you can 
find equations that contain only one unknown—that is, all of the other 
variables are known, so you can easily solve for the unknown. If the 
equation contains more than one unknown, then an additional equation is 
needed to solve the problem. In some problems, several unknowns must be 
determined to get at the one needed most. In such problems it is especially 
important to keep physical principles in mind to avoid going astray in a sea 
of equations. You may have to use two (or more) different equations to get 
the final answer. 


Step 5 


Substitute the knowns along with their units into the appropriate equation, 
and obtain numerical solutions complete with units. This step produces the 
numerical answer; it also provides a check on units that can help you find 
errors. If the units of the answer are incorrect, then an error has been made. 
However, be warned that correct units do not guarantee that the numerical 
part of the answer is also correct. 


Step 6 


Check the answer to see if it is reasonable: Does it make sense? This final 
step is extremely important—the goal of physics is to accurately describe 
nature. To see if the answer is reasonable, check both its magnitude and its 
sign, in addition to its units. Your judgment will improve as you solve more 
and more physics problems, and it will become possible for you to make 
finer and finer judgments regarding whether nature is adequately described 
by the answer to a problem. This step brings the problem back to its 
conceptual meaning. If you can judge whether the answer is reasonable, you 
have a deeper understanding of physics than just being able to mechanically 
solve a problem. 


When solving problems, we often perform these steps in different order, and 
we also tend to do several steps simultaneously. There is no rigid procedure 
that will work every time. Creativity and insight grow with experience, and 
the basics of problem solving become almost automatic. One way to get 
practice is to work out the text’s examples for yourself as you read. Another 
is to work as many end-of-section problems as possible, starting with the 
easiest to build confidence and progressing to the more difficult. Once you 
become involved in physics, you will see it all around you, and you can 
begin to apply it to situations you encounter outside the classroom, just as is 
done in many of the applications in this text. 


Unreasonable Results 


Physics must describe nature accurately. Some problems have results that 
are unreasonable because one premise is unreasonable or because certain 
premises are inconsistent with one another. The physical principle applied 
correctly then produces an unreasonable result. For example, if a person 
starting a foot race accelerates at 0.40 m/ s” for 100 s, his final speed will 
be 40 m/s (about 150 km/h)—clearly unreasonable because the time of 100 
s is an unreasonable premise. The physics is correct in a sense, but there is 
more to describing nature than just manipulating equations correctly. 
Checking the result of a problem to see if it is reasonable does more than 
help uncover errors in problem solving—it also builds intuition in judging 
whether nature is being accurately described. 


Use the following strategies to determine whether an answer is reasonable 
and, if it is not, to determine what is the cause. 


Step 1 


Solve the problem using strategies as outlined and in the format followed in 
the worked examples in the text. In the example given in the preceding 
paragraph, you would identify the givens as the acceleration and time and 
use the equation below to find the unknown final velocity. That is, 
Equation: 


v=) +at=0+4 (0.40 m/s”) (100 s) = 40 m/s. 


Step 2 


Check to see if the answer is reasonable. Is it too large or too small, or does 
it have the wrong sign, improper units, ...? In this case, you may need to 
convert meters per second into a more familiar unit, such as miles per hour. 
Equation: 


40 m 3.28 ft 1 mi 60s 60 min \ _ 89 moh 
8 m 5280 ft / \ min iho ee 


This velocity is about four times greater than a person can run—-so it is too 
large. 


Step 3 


If the answer is unreasonable, look for what specifically could cause the 
identified difficulty. In the example of the runner, there are only two 
assumptions that are suspect. The acceleration could be too great or the time 
too long. First look at the acceleration and think about what the number 
means. If someone accelerates at 0.40 m/ 3". their velocity is increasing by 
0.4 m/s each second. Does this seem reasonable? If so, the time must be too 
long. It is not possible for someone to accelerate at a constant rate of 

0.40 m/ s” for 100 s (almost two minutes). 


Section Summary 


e The six basic problem solving steps for physics are: 


Step 1. Examine the situation to determine which physical principles 
are involved. 


Step 2. Make a list of what is given or can be inferred from the 
problem as stated (identify the knowns). 


Step 3. Identify exactly what needs to be determined in the problem 
(identify the unknowns). 


Step 4. Find an equation or set of equations that can help you solve the 
problem. 


Step 5. Substitute the knowns along with their units into the 
appropriate equation, and obtain numerical solutions complete with 
units. 


Step 6. Check the answer to see if it is reasonable: Does it make sense? 


Conceptual Questions 


Exercise: 
Problem: 
What information do you need in order to choose which equation or 
equations to use to solve a problem? Explain. 

Exercise: 


Problem: 


What is the last thing you should do when solving a problem? Explain. 


Falling Objects 


e Describe the effects of gravity on objects in motion. 
e Describe the motion of objects that are in free fall. 
e Calculate the position and velocity of objects in free fall. 


Falling objects form an interesting class of motion problems. For example, we 
can estimate the depth of a vertical mine shaft by dropping a rock into it and 
listening for the rock to hit the bottom. By applying the kinematics developed 
so far to falling objects, we can examine some interesting situations and learn 
much about gravity in the process. 


Gravity 


The most remarkable and unexpected fact about falling objects is that, if air 
resistance and friction are negligible, then in a given location all objects fall 
toward the center of Earth with the same constant acceleration, independent 
of their mass. This experimentally determined fact is unexpected, because we 
are so accustomed to the effects of air resistance and friction that we expect 
light objects to fall slower than heavy ones. 


Lh 


In air In a vacuum In a vacuum (the hard way) 


A hammer and a feather will 
fall with the same constant 
acceleration if air resistance is 
considered negligible. This is a 
general characteristic of gravity 
not unique to Earth, as astronaut 
David R. Scott demonstrated on 
the Moon in 1971, where the 


acceleration due to gravity is 
only 1.67 m/s’. 


In the real world, air resistance can cause a lighter object to fall slower than a 
heavier object of the same size. A tennis ball will reach the ground after a 
hard baseball dropped at the same time. (It might be difficult to observe the 
difference if the height is not large.) Air resistance opposes the motion of an 
object through the air, while friction between objects—such as between 
clothes and a laundry chute or between a stone and a pool into which it is 
dropped—also opposes motion between them. For the ideal situations of these 
first few chapters, an object falling without air resistance or friction is defined 
to be in free-fall. 


The force of gravity causes objects to fall toward the center of Earth. The 
acceleration of free-falling objects is therefore called the acceleration due to 
gravity. The acceleration due to gravity is constant, which means we can 
apply the kinematics equations to any falling object where air resistance and 
friction are negligible. This opens a broad class of interesting situations to us. 
The acceleration due to gravity is so important that its magnitude is given its 
own symbol, g. It is constant at any given location on Earth and has the 
average value 

Equation: 


g = 9.80 m/s’. 


Although g varies from 9.78 m/s” to 9.83 m/s”, depending on latitude, 
altitude, underlying geological formations, and local topography, the average 
value of 9.80 m/ s” will be used in this text unless otherwise specified. The 
direction of the acceleration due to gravity is downward (towards the center of 
Earth). In fact, its direction defines what we call vertical. Note that whether 
the acceleration a in the kinematic equations has the value +g or —g depends 
on how we define our coordinate system. If we define the upward direction as 
positive, then a = —g = —9.80 m/ s”, and if we define the downward 


direction as positive, then a = g = 9.80 m/ 3”. 


One-Dimensional Motion Involving Gravity 


The best way to see the basic features of motion involving gravity is to start 
with the simplest situations and then progress toward more complex ones. So 
we Start by considering straight up and down motion with no air resistance or 
friction. These assumptions mean that the velocity (if there is any) is vertical. 
If the object is dropped, we know the initial velocity is zero. Once the object 
has left contact with whatever held or threw it, the object is in free-fall. Under 
these circumstances, the motion is one-dimensional and has constant 
acceleration of magnitude g. We will also represent vertical displacement with 
the symbol y and use zx for horizontal displacement. 


Note: 
Kinematic Equations for Objects in Free-Fall where Acceleration = -g 
Equation: 


V = U9 — gt 
Equation: 
1 2 
Sins Une 
Equation: 
v” = vp — 2g(y — yo) 
Example: 


Calculating Position and Velocity of a Falling Object: A Rock Thrown 
Upward 

A person standing on the edge of a high cliff throws a rock straight up with 
an initial velocity of 13.0 m/s. The rock misses the edge of the cliff as it falls 
back to earth. Calculate the position and velocity of the rock 1.00 s, 2.00 s, 
and 3.00 s after it is thrown, neglecting the effects of air resistance. 


Strategy 
Draw a sketch. 


J 
Uy =13.0 m/s | a =-9.8 m/s? | 
x 


We are asked to determine the position y at various times. It is reasonable to 
take the initial position yo to be zero. This problem involves one-dimensional 
motion in the vertical direction. We use plus and minus signs to indicate 
direction, with up being positive and down negative. Since up is positive, and 
the rock is thrown upward, the initial velocity must be positive too. The 
acceleration due to gravity is downward, so a is negative. It is crucial that the 
initial velocity and the acceleration due to gravity have opposite signs. 
Opposite signs indicate that the acceleration due to gravity opposes the initial 
motion and will slow and eventually reverse it. 

Since we are asked for values of position and velocity at three times, we will 
refer to these as y; and v4; yg and v9; and y3 and v3. 

Solution for Position y; 

1. Identify the knowns. We know that yo = 0; vp = 13.0 m/s; 

a = —g = —9.80 m/s’; andt = 1.00s. 

2. Identify the best equation to use. We will use y = yo + vot + Sat? 
because it includes only one unknown, y (or yj, here), which is the value we 
want to find. 

3. Plug in the known values and solve for y}. 

Equation: 


1 
yi = 0+ (13.0 m/s)(1.00 s) + (—9.80 m/s”) (1.00s)* = 8.10 m 


Discussion 

The rock is 8.10 m above its starting point at £ = 1.00 s, since y; > yo. It 
could be moving up or down; the only way to tell is to calculate v; and find 
out if it is positive or negative. 

Solution for Velocity v1 

1. Identify the knowns. We know that yo = 0; vo = 13.0 m/s; 

a= —g = —9.80 m/s”; and t = 1.00 s. We also know from the solution 
above that y; = 8.10 m. 


2. Identify the best equation to use. The most straightforward is v = v9 — gt 
(from v = vo + at, where a = gravitational acceleration = —g). 

3. Plug in the knowns and solve. 

Equation: 


Hy ee ean (9.80 m/s”) (1.00 s) = 3.20 m/s 


Discussion 

The positive value for v; means that the rock is still heading upward at 

t = 1.00 s. However, it has slowed from its original 13.0 m/s, as expected. 
Solution for Remaining Times 

The procedures for calculating the position and velocity at t = 2.00 s and 
3.00 s are the same as those above. The results are summarized in [link] and 
illustrated in [link]. 


Time, t Position, y Velocity, v Acceleration, a 

1.00 s 8.10 m 3.20 m/s —9.80 m/s” 

2.00 s 6.40 m —6.60 m/s —9.80 m/s” 

3.00 s —5.10m —16.4 m/s ~9.80 m/s” 
Results 


Graphing the data helps us understand it more clearly. 


Velocity (m/s) Vertical Position (m) 


Acceleration (m/s?) 


Position vs. Time 


Time (s) 


Velocity vs. Time 


Time (s) 


Acceleration vs. Time 


1 Dy, 3 4 


Time (s) 


Vertical position, vertical 
velocity, and vertical 


acceleration vs. time for a rock 
thrown vertically up at the edge 


of a cliff. Notice that velocity 
changes linearly with time and 


that acceleration is constant. 
Misconception Alert! Notice 


that the position vs. time graph 


shows vertical position only. It 


is easy to get the impression 
that the graph shows some 


horizontal motion—the shape of 
the graph looks like the path of 
a projectile. But this is not the 
case; the horizontal axis is time, 
not space. The actual path of the 
rock in space is straight up, and 
straight down. 


Discussion 

The interpretation of these results is important. At 1.00 s the rock is above its 
Starting point and heading upward, since y; and v, are both positive. At 2.00 
s, the rock is still above its starting point, but the negative velocity means it is 
moving downward. At 3.00 s, both y3 and v3 are negative, meaning the rock 
is below its starting point and continuing to move downward. Notice that 
when the rock is at its highest point (at 1.5 s), its velocity is zero, but its 
acceleration is still —9.80 m/s’. Its acceleration is —9.80 m/s” for the 
whole trip—while it is moving up and while it is moving down. Note that the 
values for y are the positions (or displacements) of the rock, not the total 
distances traveled. Finally, note that free-fall applies to upward motion as 
well as downward. Both have the same acceleration—the acceleration due to 
gravity, which remains constant the entire time. Astronauts training in the 
famous Vomit Comet, for example, experience free-fall while arcing up as 
well as down, as we will discuss in more detail later. 


Note: 

Making Connections: Take-Home Experiment—Reaction Time 

A simple experiment can be done to determine your reaction time. Have a 
friend hold a ruler between your thumb and index finger, separated by about 
1 cm. Note the mark on the ruler that is right between your fingers. Have 
your friend drop the ruler unexpectedly, and try to catch it between your two 
fingers. Note the new reading on the ruler. Assuming acceleration is that due 
to gravity, calculate your reaction time. How far would you travel in a car 
(moving at 30 m/s) if the time it took your foot to go from the gas pedal to 
the brake was twice this reaction time? 


Example: 

Calculating Velocity of a Falling Object: A Rock Thrown Down 

What happens if the person on the cliff throws the rock straight down, instead 
of straight up? To explore this question, calculate the velocity of the rock 
when it is 5.10 m below the starting point, and has been thrown downward 
with an initial speed of 13.0 m/s. 

Strategy 

Draw a sketch. 


i 
vo = —13.0 | IF = —9.8 m/s? | 
x 


Since up is positive, the final position of the rock will be negative because it 
finishes below the starting point at yo = 0. Similarly, the initial velocity is 
downward and therefore negative, as is the acceleration due to gravity. We 
expect the final velocity to be negative since the rock will continue to move 
downward. 

Solution 

1. Identify the knowns. yo = 0; yi = —5.10 m; vo = —13.0 m/s; 

a = —g = —9.80 m/s’. 

2. Choose the kinematic equation that makes it easiest to solve the problem. 
The equation v? = v2 + 2a(y — yo) works well because the only unknown 
in it is v. (We will plug y; in for y.) 

3. Enter the known values 

Equation: 


y? = (—13.0 m/s)? + 2(—9.80 m/s”) (—5.10 m — 0 m) = 268.96 m?/s”, 


where we have retained extra significant figures because this is an 
intermediate result. 

Taking the square root, and noting that a square root can be positive or 
negative, gives 

Equation: 


v= +16.4 m/s. 


The negative root is chosen to indicate that the rock is still heading down. 
Thus, 
Equation: 


v= -—16.4m/s. 


Discussion 

Note that this is exactly the same velocity the rock had at this position when 
it was thrown straight upward with the same initial speed. (See [link] and 
[link ](a).) This is not a coincidental result. Because we only consider the 
acceleration due to gravity in this problem, the speed of a falling object 
depends only on its initial speed and its vertical position relative to the 
starting point. For example, if the velocity of the rock is calculated at a 
height of 8.10 m above the starting point (using the method from [link]) 
when the initial velocity is 13.0 m/s straight up, a result of +3.20 m/s is 
obtained. Here both signs are meaningful; the positive value occurs when the 
rock is at 8.10 m and heading up, and the negative value occurs when the 
rock is at 8.10 m and heading back down. It has the same speed but the 
opposite direction. 


j2 = 6.40 m 


vu = —13 m/s 


33 = —5.10 m y= —5.10 m 


v3 = —16.4 m/s v = —16.4 m/s 


(a) (b) 


(a) A person throws a rock straight up, as explored 
in [link]. The arrows are velocity vectors at 0, 
1.00, 2.00, and 3.00 s. (b) A person throws a rock 
straight down from a cliff with the same initial 
speed as before, as in [link]. Note that at the same 
distance below the point of release, the rock has 
the same velocity in both cases. 


Another way to look at it is this: In [link], the rock is thrown up with an 
initial velocity of 13.0 m/s. It rises and then falls back down. When its 


position is y = 0 on its way back down, its velocity is —13.0 m/s. That is, it 
has the same speed on its way down as on its way up. We would then expect 
its velocity at a position of y = —5.10 m to be the same whether we have 
thrown it upwards at +13.0 m/s or thrown it downwards at —13.0 m/s. The 
velocity of the rock on its way down from y = 0 is the same whether we 
have thrown it up or down to start with, as long as the speed with which it 
was initially thrown is the same. 


Example: 

Find g from Data on a Falling Object 

The acceleration due to gravity on Earth differs slightly from place to place, 
depending on topography (e.g., whether you are on a hill or in a valley) and 
subsurface geology (whether there is dense rock like iron ore as opposed to 
light rock like salt beneath you.) The precise acceleration due to gravity can 
be calculated from data taken in an introductory physics laboratory course. 
An object, usually a metal ball for which air resistance is negligible, is 
dropped and the time it takes to fall a known distance is measured. See, for 
example, [link]. Very precise results can be produced with this method if 
sufficient care is taken in measuring the distance fallen and the elapsed time. 
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Positions and velocities of a metal ball released from rest when 
air resistance is negligible. Velocity is seen to increase linearly 
with time while displacement increases with time squared. 
Acceleration is a constant and is equal to gravitational 

acceleration. 


Suppose the ball falls 1.0000 m in 0.45173 s. Assuming the ball is not 
affected by air resistance, what is the precise acceleration due to gravity at 
this location? 

Strategy 

Draw a sketch. 


J 
uy =O0m/s @ Vie | 
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We need to solve for acceleration a. Note that in this case, displacement is 
downward and therefore negative, as is acceleration. 

Solution 

1. Identify the knowns. yo = 0; y = —1.0000 m; ¢ = 0.45173; vo = 0. 

2. Choose the equation that allows you to solve for a using the known values. 
Equation: 


1 2 
DS Ud ar He == soak: 


3. Substitute 0 for vg and rearrange the equation to solve for a. Substituting 0 
for Uo yields 


Equation: 
1 » 
¥=Yor Ble : 
Solving for a gives 
Equation: 
Hy — 
ro (y — yo) 
t2 


4. Substitute known values yields 
Equation: 


2(—1. = 
ce 2 ee onion ee 
(0.45173 s)2 


so, because a = —g with the directions we have chosen, 
Equation: 


g = 9.8010 m/s”. 


Discussion 

The negative value for a indicates that the gravitational acceleration is 
downward, as expected. We expect the value to be somewhere around the 
average value of 9.80 m/s”, so 9.8010 m/s” makes sense. Since the data 
going into the calculation are relatively precise, this value for g is more 
precise than the average value of 9.80 m/ 5”: it represents the local value for 
the acceleration due to gravity. 


Exercise: 
Check Your Understanding 


Problem: 
A chunk of ice breaks off a glacier and falls 30.0 meters before it hits the 


water. Assuming it falls freely (there is no air resistance), how long does 
it take to hit the water? 


Solution: 


We know that initial position yo = 0, final position y = —30.0 m, and 


a=-—g= —9.80 m/s”. We can then use the equation 
Y = Yo+ vot + sat? to solve for t. Inserting a = —g, we obtain 
Equation: 
y = 040-49? 
2 _ fy 
i —9 


tf = + B= 4,/2CBOm, — 2 /eIIST = 247s 255 


where we take the positive value as the physically relevant answer. Thus, 
it takes about 2.5 seconds for the piece of ice to hit the water. 


Note: 

PhET Explorations: Equation Grapher 

Learn about graphing polynomials. The shape of the curve changes as the 
constants are adjusted. View the curves for the individual terms (e.g. y = bx) 
to see how they add to generate the polynomial curve. 


Section Summary 


An object in free-fall experiences constant acceleration if air resistance is 
negligible. 

On Earth, all free-falling objects have an acceleration due to gravity g, 
which averages 

Equation: 


g = 9.80 m/s’. 


Whether the acceleration a should be taken as +g or —g is determined 
by your choice of coordinate system. If you choose the upward direction 
as positive, a = —g = —9.80 m/ s” is negative. In the opposite case, 

a = +g = 9.80 m/ s” is positive. Since acceleration is constant, the 
kinematic equations above can be applied with the appropriate +g or —g 
substituted for a. 

For objects in free-fall, up is normally taken as positive for displacement, 
velocity, and acceleration. 


Conceptual Questions 


Exercise: 


Problem: 


What is the acceleration of a rock thrown straight upward on the way up? 
At the top of its flight? On the way down? 


Exercise: 


Problem: 


An object that is thrown straight up falls back to Earth. This is one- 
dimensional motion. (a) When is its velocity zero? (b) Does its velocity 
change direction? (c) Does the acceleration due to gravity have the same 
sign on the way up as on the way down? 


Exercise: 


Problem: 


Suppose you throw a rock nearly straight up at a coconut in a palm tree, 
and the rock misses on the way up but hits the coconut on the way down. 
Neglecting air resistance, how does the speed of the rock when it hits the 
coconut on the way down compare with what it would have been if it had 
hit the coconut on the way up? Is it more likely to dislodge the coconut 
on the way up or down? Explain. 


Exercise: 


Problem: 


If an object is thrown straight up and air resistance is negligible, then its 
speed when it returns to the starting point is the same as when it was 
released. If air resistance were not negligible, how would its speed upon 
return compare with its initial speed? How would the maximum height to 
which it rises be affected? 


Exercise: 
Problem: 
The severity of a fall depends on your speed when you strike the ground. 
All factors but the acceleration due to gravity being the same, how many 


times higher could a safe fall on the Moon be than on Earth 
(gravitational acceleration on the Moon is about 1/6 that of the Earth)? 


Exercise: 
Problem: 
How many times higher could an astronaut jump on the Moon than on 


Earth if his takeoff speed is the same in both locations (gravitational 
acceleration on the Moon is about 1/6 of g on Earth)? 


Problems & Exercises 


Assume air resistance is negligible unless otherwise stated. 
Exercise: 
Problem: 
Calculate the displacement and velocity at times of (a) 0.500, (b) 1.00, 


(c) 1.50, and (d) 2.00 s for a ball thrown straight up with an initial 
velocity of 15.0 m/s. Take the point of release to be yo = 0. 


Solution: 

(a) yi = 6.28 m; v; = 10.1 m/s 
(b) yo = 10.1 m; ve = 5.20 m/s 
(c) y3 = 11.5 m; v3 = 0.300 m/s 
(d) ys = 10.4 m; v4 = —4.60 m/s 


Exercise: 


Problem: 


Calculate the displacement and velocity at times of (a) 0.500, (b) 1.00, 
(c) 1.50, (d) 2.00, and (e) 2.50 s for a rock thrown straight down with an 
initial velocity of 14.0 m/s from the Verrazano Narrows Bridge in New 
York City. The roadway of this bridge is 70.0 m above the water. 


Exercise: 


Problem: 


A basketball referee tosses the ball straight up for the starting tip-off. At 
what velocity must a basketball player leave the ground to rise 1.25 m 
above the floor in an attempt to get the ball? 


Solution: 


vp = 4.95 m/s 
Exercise: 


Problem: 


A rescue helicopter is hovering over a person whose boat has sunk. One 
of the rescuers throws a life preserver straight down to the victim with an 
initial velocity of 1.40 m/s and observes that it takes 1.8 s to reach the 
water. (a) List the knowns in this problem. (b) How high above the water 
was the preserver released? Note that the downdraft of the helicopter 
reduces the effects of air resistance on the falling life preserver, so that 
an acceleration equal to that of gravity is reasonable. 


Exercise: 


Problem: 


A dolphin in an aquatic show jumps straight up out of the water at a 
velocity of 13.0 m/s. (a) List the knowns in this problem. (b) How high 
does his body rise above the water? To solve this part, first note that the 
final velocity is now a known and identify its value. Then identify the 
unknown, and discuss how you chose the appropriate equation to solve 
for it. After choosing the equation, show your steps in solving for the 
unknown, checking units, and discuss whether the answer is reasonable. 
(c) How long is the dolphin in the air? Neglect any effects due to his size 
or orientation. 


Solution: 


(a) a = —9.80 m/s”; vp = 13.0 m/s; yo = Om 


(b) v = 0m/s. Unknown is distance y to top of trajectory, where velocity 
is zero. Use equation v? = ve + 2a(y — yo) because it contains all 
known values except for y, so we can solve for y. Solving for y gives 
Equation: 


2 


ve—vg = 2a(y— yo) 
Ry er 
a hoe. 
2_y2 m/s)”—(13.0 m/s)? 
y — yo + 2 =0m + Ca C0)” — 862m 


2 (—9.80 m/s”) 


Dolphins measure about 2 meters long and can jump several times their 
length out of the water, so this is a reasonable result. 
(c) 2.65 s 

Exercise: 
Problem: 
A swimmer bounces straight up from a diving board and falls feet first 
into a pool. She starts with a velocity of 4.00 m/s, and her takeoff point is 
1.80 m above the pool. (a) How long are her feet in the air? (b) What is 


her highest point above the board? (c) What is her velocity when her feet 
hit the water? 


Exercise: 
Problem: 
(a) Calculate the height of a cliff if it takes 2.35 s for a rock to hit the 
ground when it is thrown straight up from the cliff with an initial 


velocity of 8.00 m/s. (b) How long would it take to reach the ground if it 
is thrown straight down with the same speed? 


Solution: 


(a) 8.26 m 


(b) 0.717 s 
Exercise: 
Problem: 
A very strong, but inept, shot putter puts the shot straight up vertically 
with an initial velocity of 11.0 m/s. How long does he have to get out of 


the way if the shot was released at a height of 2.20 m, and he is 1.80 m 
tall? 


Exercise: 
Problem: 
You throw a ball straight up with an initial velocity of 15.0 m/s. It passes 
a tree branch on the way up at a height of 7.00 m. How much additional 


time will pass before the ball passes the tree branch on the way back 
down? 


Solution: 


191s 
Exercise: 
Problem: 
A kangaroo can jump over an object 2.50 m high. (a) Calculate its 
vertical speed when it leaves the ground. (b) How long is it in the air? 


Exercise: 


Problem: 


Standing at the base of one of the cliffs of Mt. Arapiles in Victoria, 
Australia, a hiker hears a rock break loose from a height of 105 m. He 
can’t see the rock right away but then does, 1.50 s later. (a) How far 
above the hiker is the rock when he can see it? (b) How much time does 
he have to move before the rock hits his head? 


Solution: 
(a) 94.0 m 


(b) 3.13 s 
Exercise: 


Problem: 


An object is dropped from a height of 75.0 m above ground level. (a) 
Determine the distance traveled during the first second. (b) Determine 
the final velocity at which the object hits the ground. (c) Determine the 
distance traveled during the last second of motion before hitting the 
ground. 


Exercise: 
Problem: 
There is a 250-m-high cliff at Half Dome in Yosemite National Park in 
California. Suppose a boulder breaks loose from the top of this cliff. (a) 
How fast will it be going when it strikes the ground? (b) Assuming a 
reaction time of 0.300 s, how long will a tourist at the bottom have to get 
out of the way after hearing the sound of the rock breaking loose 


(neglecting the height of the tourist, which would become negligible 
anyway if hit)? The speed of sound is 335 m/s on this day. 


Solution: 
(a) -70.0 m/s (downward) 


(b) 6.10 s 


Exercise: 


Problem: 


A ball is thrown straight up. It passes a 2.00-m-high window 7.50 m off 
the ground on its path up and takes 0.312 s to go past the window. What 
was the ball’s initial velocity? Hint: First consider only the distance 
along the window, and solve for the ball's velocity at the bottom of the 
window. Next, consider only the distance from the ground to the bottom 
of the window, and solve for the initial velocity using the velocity at the 
bottom of the window as the final velocity. 


Exercise: 


Problem: 


Suppose you drop a rock into a dark well and, using precision 
equipment, you measure the time for the sound of a splash to return. (a) 
Neglecting the time required for sound to travel up the well, calculate the 
distance to the water if the sound returns in 2.0000 s. (b) Now calculate 
the distance taking into account the time for sound to travel up the well. 
The speed of sound is 332.00 m/s in this well. 


Solution: 
(a) 19.6 m 


(b) 18.5 m 
Exercise: 


Problem: 


A steel ball is dropped onto a hard floor from a height of 1.50 m and 
rebounds to a height of 1.45 m. (a) Calculate its velocity just before it 
strikes the floor. (b) Calculate its velocity just after it leaves the floor on 
its way back up. (c) Calculate its acceleration during contact with the 
floor if that contact lasts 0.0800 ms (8.00 x 10~° s). (d) How much did 
the ball compress during its collision with the floor, assuming the floor is 
absolutely rigid? 


Exercise: 


Problem: 


A coin is dropped from a hot-air balloon that is 300 m above the ground 
and rising at 10.0 m/s upward. For the coin, find (a) the maximum height 
reached, (b) its position and velocity 4.00 s after being released, and (c) 
the time before it hits the ground. 


Solution: 
(a) 305 m 
(b) 262 m, -29.2 m/s 


(c) 8.91 s 
Exercise: 


Problem: 


A soft tennis ball is dropped onto a hard floor from a height of 1.50 m 
and rebounds to a height of 1.10 m. (a) Calculate its velocity just before 
it strikes the floor. (b) Calculate its velocity just after it leaves the floor 
on its way back up. (c) Calculate its acceleration during contact with the 
floor if that contact lasts 3.50 ms (3.50 x 107° s). (d) How much did the 
ball compress during its collision with the floor, assuming the floor is 
absolutely rigid? 


Glossary 


free-fall 
the state of movement that results from gravitational force only 


acceleration due to gravity 
acceleration of an object as a result of gravity 


Graphical Analysis of One-Dimensional Motion 


e Describe a straight-line graph in terms of its slope and y-intercept. 

e Determine average velocity or instantaneous velocity from a graph of 
position vs. time. 

e Determine average or instantaneous acceleration from a graph of 
velocity vs. time. 

e Derive a graph of velocity vs. time from a graph of position vs. time. 

e Derive a graph of acceleration vs. time from a graph of velocity vs. 
time. 


A graph, like a picture, is worth a thousand words. Graphs not only contain 
numerical information; they also reveal relationships between physical 
quantities. This section uses graphs of position, velocity, and acceleration 
versus time to illustrate one-dimensional kinematics. 


Slopes and General Relationships 


First note that graphs in this text have perpendicular axes, one horizontal 
and the other vertical. When two physical quantities are plotted against one 
another in such a graph, the horizontal axis is usually considered to be an 
independent variable and the vertical axis a dependent variable. If we 
call the horizontal axis the x-axis and the vertical axis the y-axis, as in 
[link], a straight-line graph has the general form 

Equation: 


y= mx-+ b. 
Here m is the slope, defined to be the rise divided by the run (as seen in the 


figure) of the straight line. The letter 6 is used for the y-intercept, which is 
the point at which the line crosses the vertical axis. 


Intercept 


b 


A straight-line graph. The 
equation for a straight line is 
y=mx+b. 


Graph of Position vs. Time (a = 0, so v is constant) 


Time is usually an independent variable that other quantities, such as 
position, depend upon. A graph of position versus time would, thus, have x 
on the vertical axis and ¢ on the horizontal axis. [link] is just such a straight- 
line graph. It shows a graph of position versus time for a jet-powered car on 


a very flat dry lake bed in Nevada. 


Graph of position versus time for a jet-powered car 
on the Bonneville Salt Flats. 


Using the relationship between dependent and independent variables, we 


see that the slope in the graph above is average velocity v and the intercept 
is position at time zero—that is, 79. Substituting these symbols into 

y = mx + b gives 

Equation: 


z=vt+29 


or 
Equation: 


r= 2+ vt. 


Thus a graph of position versus time gives a general relationship among 
displacement(change in position), velocity, and time, as well as giving 
detailed numerical information about a specific situation. 


Note: 

The Slope of x vs. t 

The slope of the graph of position x vs. time ¢ is velocity v. 
Equation: 


Az 
slope = —— =v 


At 


Notice that this equation is the same as that derived algebraically from 
other motion equations in Motion Equations for Constant Acceleration in 
One Dimension. 


From the figure we can see that the car has a position of 25 m at 0.50 s and 
2000 m at 6.40 s. Its position at other times can be read from the graph; 
furthermore, information about its velocity and acceleration can also be 
obtained from the graph. 


Example: 

Determining Average Velocity from a Graph of Position versus Time: 
Jet Car 

Find the average velocity of the car whose position is graphed in [link]. 
Strategy 

The slope of a graph of x vs. t is average velocity, since slope equals rise 
over run. In this case, rise = change in position and run = change in time, 
so that 

Equation: 


Since the slope is constant here, any two points on the graph can be used to 
find the slope. (Generally speaking, it is most accurate to use two widely 
separated points on the straight line. This is because any error in reading 
data from the graph is proportionally smaller if the interval is larger.) 
Solution 

1. Choose two points on the line. In this case, we choose the points labeled 
on the graph: (6.4 s, 2000 m) and (0.50 s, 525 m). (Note, however, that you 
could choose any two points.) 

2. Substitute the x and ¢ values of the chosen points into the equation. 
Remember in calculating change (A) we always use final value minus 
initial value. 

Equation: 


om Az — 2000m—525m 
~ At 64s—0.50s ’ 

yielding 

Equation: 


v = 250 m/s. 


Discussion 

This is an impressively large land speed (900 km/h, or about 560 mi/h): 
much greater than the typical highway speed limit of 60 mi/h (27 m/s or 96 
km/h), but considerably shy of the record of 343 m/s (1234 km/h or 766 
mi/h) set in 1997. 


Graphs of Motion when a is constant but a + 0 


The graphs in [link] below represent the motion of the jet-powered car as it 
accelerates toward its top speed, but only during the time when its 
acceleration is constant. Time starts at zero for this motion (as if measured 
with a stopwatch), and the position and velocity are initially 200 m and 15 
m/s, respectively. 
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Graphs of motion of a jet- 
powered car during the time 
span when its acceleration is 

constant. (a) The slope of an x 
vs. t graph is velocity. This is 


shown at two points, and the 
instantaneous velocities 
obtained are plotted in the next 
graph. Instantaneous velocity at 
any point is the slope of the 
tangent at that point. (b) The 
slope of the v vs. ¢ graph is 
constant for this part of the 
motion, indicating constant 
acceleration. (c) Acceleration 
has the constant value of 
5.0 m/s” over the time interval 
plotted. 


A USS. Air Force jet car speeds 
down a track. (credit: Matt 
Trostle, Flickr) 


The graph of position versus time in [link](a) is a curve rather than a 
straight line. The slope of the curve becomes steeper as time progresses, 


showing that the velocity is increasing over time. The slope at any point on 
a position-versus-time graph is the instantaneous velocity at that point. It is 
found by drawing a straight line tangent to the curve at the point of interest 
and taking the slope of this straight line. Tangent lines are shown for two 
points in [link](a). If this is done at every point on the curve and the values 
are plotted against time, then the graph of velocity versus time shown in 
[link](b) is obtained. Furthermore, the slope of the graph of velocity versus 
time is acceleration, which is shown in [link](c). 


Example: 
Determining Instantaneous Velocity from the Slope at a Point: Jet Car 
Calculate the velocity of the jet car at a time of 25 s by finding the slope of 
the x vs. t graph in the graph below. 
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The slope of an z vs. t graph is 
velocity. This is shown at two 
points. Instantaneous velocity at 
any point is the slope of the 
tangent at that point. 


Strategy 

The slope of a curve at a point is equal to the slope of a straight line 
tangent to the curve at that point. This principle is illustrated in [link], 
where Q is the point at t = 25s. 

Solution 

1. Find the tangent line to the curve at t = 25s. 


2. Determine the endpoints of the tangent. These correspond to a position 
of 1300 m at time 19 s and a position of 3120 m at time 32 s. 

3. Plug these endpoints into the equation to solve for the slope, v. 
Equation: 


Azg  (3120m-— 1300 m) 


Lope = oe = —S = 
pee 82 “Rto (32s — 19s) 
Thus, 
Equation: 
1820 m 
— = 14 
VQ te 0 m/s 
Discussion 


This is the value given in this figure’s table for v at £ = 25 s. The value of 
140 m/s for vq is plotted in [link]. The entire graph of v vs. ¢ can be 
obtained in this fashion. 


Carrying this one step further, we note that the slope of a velocity versus 
time graph is acceleration. Slope is rise divided by run; on a v vs. ¢ graph, 
rise = change in velocity Av and run = change in time At. 


Note: 

The Slope of v vs. t 

The slope of a graph of velocity v vs. time t is acceleration a. 
Equation: 


l —_——— 
slope me a 


Since the velocity versus time graph in [link](b) is a straight line, its slope is 
the same everywhere, implying that acceleration is constant. Acceleration 
versus time is graphed in [link](c). 


Additional general information can be obtained from [link] and the 
expression for a straight line, y= mx + b. 


In this case, the vertical axis y is V, the intercept b is vo, the slope ™ is a, 
and the horizontal axis x is t. Substituting these symbols yields 
Equation: 


V=UVo + at. 


A general relationship for velocity, acceleration, and time has again been 
obtained from a graph. Notice that this equation was also derived 
algebraically from other motion equations in Motion Equations for Constant 
Acceleration in One Dimension. 


It is not accidental that the same equations are obtained by graphical 
analysis as by algebraic techniques. In fact, an important way to discover 
physical relationships is to measure various physical quantities and then 
make graphs of one quantity against another to see if they are correlated in 
any way. Correlations imply physical relationships and might be shown by 
smooth graphs such as those above. From such graphs, mathematical 
relationships can sometimes be postulated. Further experiments are then 
performed to determine the validity of the hypothesized relationships. 


Graphs of Motion Where Acceleration is Not Constant 


Now consider the motion of the jet car as it goes from 165 m/s to its top 
velocity of 250 m/s, graphed in [link]. Time again starts at zero, and the 
initial position and velocity are 2900 m and 165 m/s, respectively. (These 
were the final position and velocity of the car in the motion graphed in 
[link].) Acceleration gradually decreases from 5.0 m/ s” to zero when the 
car hits 250 m/s. The slope of the z vs. ¢ graph increases until t = 55s, 
after which time the slope is constant. Similarly, velocity increases until 55 


s and then becomes constant, since acceleration decreases to zero at 55s 
and remains zero afterward. 


Jet Car Position 
25 


20 


Position, x (km) 


Time, t (s) 


(a) 


Jet Car Velocity 


250 ae 


Velocity, v (m/s) 
So 


10 20 30 40 50 60 70 80 
Time, t (s) 


(b) 


Jet Car Acceleration 


Acceleration, a (m/s?) 
Sa & & we 


0 10 20 30 40 50 60 70 80 
Time, t (s) 


(c) 


Graphs of motion of a jet-powered car 
as it reaches its top velocity. This 
motion begins where the motion in 


[link] ends. (a) The slope of this graph 
is velocity; it is plotted in the next 
graph. (b) The velocity gradually 

approaches its top value. The slope of 

this graph is acceleration; it is plotted 
in the final graph. (c) Acceleration 
gradually declines to zero when 
velocity becomes constant. 


Example: 

Calculating Acceleration from a Graph of Velocity versus Time 
Calculate the acceleration of the jet car at a time of 25 s by finding the 
slope of the v vs. ¢ graph in [link](b). 

Strategy 

The slope of the curve at t = 25 s is equal to the slope of the line tangent 
at that point, as illustrated in [link ](b). 

Solution 

Determine endpoints of the tangent line from the figure, and then plug 
them into the equation to solve for slope, a. 


Equation: 

\ Av (260 m/s — 210 m/s) 

siope = OO EE oar 

At (51s — 1.0s) 
Equation: 
50 
ih ous = 1.0 m/s’. 
50s 

Discussion 


Note that this value for a is consistent with the value plotted in [link ](c) at 
i — 20 5 


A graph of position versus time can be used to generate a graph of velocity 
versus time, and a graph of velocity versus time can be used to generate a 
graph of acceleration versus time. We do this by finding the slope of the 
graphs at every point. If the graph is linear (i.e., a line with a constant 
slope), it is easy to find the slope at any point and you have the slope for 
every point. Graphical analysis of motion can be used to describe both 
specific and general characteristics of kinematics. Graphs can also be used 
for other topics in physics. An important aspect of exploring physical 
relationships is to graph them and look for underlying relationships. 
Exercise: 

Check Your Understanding 


Problem: 
A graph of velocity vs. time of a ship coming into a harbor is shown 


below. (a) Describe the motion of the ship based on the graph. (b)What 
would a graph of the ship’s acceleration look like? 


Solution: 


(a) The ship moves at constant velocity and then begins to decelerate 
at a constant rate. At some point, its deceleration rate decreases. It 
maintains this lower deceleration rate until it stops moving. 


(b) A graph of acceleration vs. time would show zero acceleration in 
the first leg, large and constant negative acceleration in the second leg, 
and constant negative acceleration. 


a 


Section Summary 


¢ Graphs of motion can be used to analyze motion. 

e Graphical solutions yield identical solutions to mathematical methods 
for deriving motion equations. 

¢ The slope of a graph of displacement z vs. time ¢ is velocity v. 

e The slope of a graph of velocity v vs. time t graph is acceleration a. 

e Average velocity, instantaneous velocity, and acceleration can all be 
obtained by analyzing graphs. 


Conceptual Questions 


Exercise: 


Problem: 


(a) Explain how you can use the graph of position versus time in [Link] 
to describe the change in velocity over time. Identify (b) the time (¢q, 
tp, tc, ta, or te) at which the instantaneous velocity is greatest, (c) the 
time at which it is zero, and (d) the time at which it is negative. 


| 


Position x 


Time t 


Exercise: 


Problem: 


(a) Sketch a graph of velocity versus time corresponding to the graph 
of position versus time given in [link]. (b) Identify the time or times ( 
ta, tp, tc, etc.) at which the instantaneous velocity is greatest. (c) At 
which times is it zero? (d) At which times is it negative? 


Position x 


Time t 


Exercise: 
Problem: 
(a) Explain how you can determine the acceleration over time from a 


velocity versus time graph such as the one in [link]. (b) Based on the 
graph, how does acceleration change over time? 


Velocity v 


Time t 


Exercise: 


Problem: 


(a) Sketch a graph of acceleration versus time corresponding to the 
graph of velocity versus time given in [link]. (b) Identify the time or 
times (t,, tp, t-, etc.) at which the acceleration is greatest. (c) At which 
times is it zero? (d) At which times is it negative? 


Velocity v 


Time t 


Exercise: 


Problem: 


Consider the velocity vs. time graph of a person in an elevator shown 
in [link]. Suppose the elevator is initially at rest. It then accelerates for 
3 seconds, maintains that velocity for 15 seconds, then decelerates for 
5 seconds until it stops. The acceleration for the entire trip is not 
constant so we cannot use the equations of motion from Motion 
Equations for Constant Acceleration in One Dimension for the 
complete trip. (We could, however, use them in the three individual 
sections where acceleration is a constant.) Sketch graphs of (a) 
position vs. time and (b) acceleration vs. time for this trip. 


Velocity vs. Time | 


Velocity v (m/s) 


0 5 10 15 20 25 
Time ¢(s) 


Exercise: 


Problem: 


A cylinder is given a push and then rolls up an inclined plane. If the 
origin is the starting point, sketch the position, velocity, and 


acceleration of the cylinder vs. time as it goes up and then down the 
plane. 


Problems & Exercises 


Note: There is always uncertainty in numbers taken from graphs. If your 
answers differ from expected values, examine them to see if they are within 
data extraction uncertainties estimated by you. 

Exercise: 


Problem: 


(a) By taking the slope of the curve in [link], verify that the velocity of 
the jet car is 115 m/s at t = 20 s. (b) By taking the slope of the curve 
at any point in [link], verify that the jet car’s acceleration is 5.0 m/ 57, 


Position vs. Time 


0 5 10 15 20 25 30 35) 
Time (s) 


Velocity vs. Time 


o 
A) 


10 15 20 25 30 35 
Time (s) 


Solution: 
(a) 115 m/s 


(b) 5.0 m/s” 
Exercise: 
Problem: 
Using approximate values, calculate the slope of the curve in [link] to 


verify that the velocity at ¢ = 10.0 s is 0.208 m/s. Assume all values 
are known to 3 significant figures. 


Position vs. Time 


25 
20 


Position (m) 


Exercise: 


Problem: 


Using approximate values, calculate the slope of the curve in [link] to 
verify that the velocity at £ = 30.0 s is approximately 0.24 m/s. 


Solution: 


Equation: 
11.7 — 6.95) x 10° 
v= (11.7 — 6.95) x 10° m — 238 m/s 
(40.0 — 20.0) S 
Exercise: 
Problem: 


By taking the slope of the curve in [link], verify that the acceleration is 
3.2 m/s” att = 10s. 
Velocity vs. Time 
300 
250 


Velocity (m/s) 
= = iw) 
wv oS W So 
aS 6 eo 6 


Exercise: 
Problem: 
Construct the position graph for the subway shuttle train as shown in 
[link](a). Your graph should show the position of the train, in 


kilometers, from t = 0 to 20 s. You will need to use the information on 
acceleration and velocity given in the examples for this figure. 


Solution: 


Position vs. Time 


0 5 10 15 20 A) 
Time (s) 


Exercise: 
Problem: 
(a) Take the slope of the curve in [link] to find the jogger’s velocity at 


t = 2.5 s. (b) Repeat at 7.5 s. These values must be consistent with the 
graph in [link]. 


Position vs. Time 
30 
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5) 
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Acceleration vs. Time 
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Exercise: 
Problem: 


A graph of v(t) is shown for a world-class track sprinter in a 100-m 
race. (See [link]). (a) What is his average velocity for the first 4 s? (b) 
What is his instantaneous velocity at £ = 5 s? (c) What is his average 
acceleration between 0 and 4 s? (d) What is his time for the race? 


Runner Velocity vs. Time 
14 
12 
@ 10 
é 8 
£ 
B 4 
2 
0 
0 3 4 6 8 10 2 
Time (s) 
Solution: 
(a) 6 m/s 
(b) 12 m/s 
2 
(c)3 m/s 


(d) 10s 


Exercise: 


Problem: 


[link] shows the position graph for a particle for 5 s. Draw the 
corresponding velocity and acceleration graphs. 


Position vs. Time 


3 
2 
1 
0 


Position (m) 


Time (s) 


Glossary 


independent variable 


the variable that the dependent variable is measured with respect to; 
usually plotted along the x-axis 


dependent variable 
the variable that is being measured; usually plotted along the y-axis 


slope 


the difference in y-value (the rise) divided by the difference in x-value 
(the run) of two points on a straight line 


y-intercept 
the y-value when x= 0, or when the graph crosses the y-axis 


Introduction to Dynamics: Newton’s Laws of Motion 
class="introduction" 


Newton’ 
s laws of 
motion 
describe 
the 
motion 
of the 
dolphin’s 
path. 
(credit: 
Jin Jang) 


Motion draws our attention. Motion itself can be beautiful, causing us to 
marvel at the forces needed to achieve spectacular motion, such as that of a 


dolphin jumping out of the water, or a pole vaulter, or the flight of a bird, or 
the orbit of a satellite. The study of motion is kinematics, but kinematics 
only describes the way objects move—their velocity and their acceleration. 
Dynamics considers the forces that affect the motion of moving objects and 
systems. Newton’s laws of motion are the foundation of dynamics. These 
laws provide an example of the breadth and simplicity of principles under 
which nature functions. They are also universal laws in that they apply to 
similar situations on Earth as well as in space. 


Isaac Newton’s (1642-1727) laws of motion were just one part of the 
monumental work that has made him legendary. The development of 
Newton’s laws marks the transition from the Renaissance into the modern 
era. This transition was characterized by a revolutionary change in the way 
people thought about the physical universe. For many centuries natural 
philosophers had debated the nature of the universe based largely on certain 
rules of logic with great weight given to the thoughts of earlier classical 
philosophers such as Aristotle (884-322 BC). Among the many great 
thinkers who contributed to this change were Newton and Galileo. 


| PHILOSOPHIE |, 


NATURALIS — |] ° 


Isaac Newton’s 
monumental work, 
Philosophiae 
Naturalis Principia 
Mathematica, was 
published in 1687. It 
proposed scientific 


laws that are still 
used today to 
describe the motion 
of objects. (credit: 
Service commun de 
la documentation de 
l'Université de 
Strasbourg) 


Galileo was instrumental in establishing observation as the absolute 
determinant of truth, rather than “logical” argument. Galileo’s use of the 
telescope was his most notable achievement in demonstrating the 
importance of observation. He discovered moons orbiting Jupiter and made 
other observations that were inconsistent with certain ancient ideas and 
religious dogma. For this reason, and because of the manner in which he 
dealt with those in authority, Galileo was tried by the Inquisition and 
punished. He spent the final years of his life under a form of house arrest. 
Because others before Galileo had also made discoveries by observing the 
nature of the universe, and because repeated observations verified those of 
Galileo, his work could not be suppressed or denied. After his death, his 
work was verified by others, and his ideas were eventually accepted by the 
church and scientific communities. 


Galileo also contributed to the formation of what is now called Newton’s 
first law of motion. Newton made use of the work of his predecessors, 
which enabled him to develop laws of motion, discover the law of gravity, 
invent calculus, and make great contributions to the theories of light and 
color. It is amazing that many of these developments were made with 
Newton working alone, without the benefit of the usual interactions that 
take place among scientists today. 


It was not until the advent of modern physics early in the 20th century that 
it was discovered that Newton’s laws of motion produce a good 
approximation to motion only when the objects are moving at speeds much, 
much less than the speed of light and when those objects are larger than the 


size of most molecules (about 10°? m in diameter). These constraints 
define the realm of classical mechanics, as discussed in Introduction to the 
Nature of Science and Physics. At the beginning of the 20" century, Albert 
Einstein (1879-1955) developed the theory of relativity and, along with 
many other scientists, developed quantum theory. This theory does not have 
the constraints present in classical physics. All of the situations we consider 
in this chapter, and all those preceding the introduction of relativity in 
Special Relativity, are in the realm of classical physics. 


Note: 

Making Connections: Past and Present Philosophy 

The importance of observation and the concept of cause and effect were 
not always so entrenched in human thinking. This realization was a part of 
the evolution of modern physics from natural philosophy. The 
achievements of Galileo, Newton, Einstein, and others were key milestones 
in the history of scientific thought. Most of the scientific theories that are 
described in this book descended from the work of these scientists. 


Development of Force Concept 
e Understand the definition of force. 


Dynamics is the study of the forces that cause objects and systems to move. 
To understand this, we need a working definition of force. Our intuitive 
definition of force—that is, a push or a pull—is a good place to start. We 
know that a push or pull has both magnitude and direction (therefore, it is a 
vector quantity) and can vary considerably in each regard. For example, a 
cannon exerts a strong force on a cannonball that is launched into the air. In 
contrast, Earth exerts only a tiny downward pull on a flea. Our everyday 
experiences also give us a good idea of how multiple forces add. If two 
people push in different directions on a third person, as illustrated in [link], 
we might expect the total force to be in the direction shown. Since force is a 
vector, it adds just like other vectors, as illustrated in [link](a) for two ice 
skaters. Forces, like other vectors, are represented by arrows and can be 
added using the familiar head-to-tail method or by trigonometric methods. 
These ideas were developed in Two-Dimensional Kinematics. 


Free-body diagram 


(a) (b) 


Part (a) shows an overhead view of two ice 
skaters pushing on a third. Forces are 
vectors and add like other vectors, so the 
total force on the third skater is in the 
direction shown. In part (b), we see a free- 
body diagram representing the forces acting 
on the third skater. 


[link](b) is our first example of a free-body diagram, which is a technique 
used to illustrate all the external forces acting on a body. The body is 
represented by a single isolated point (or free body), and only those forces 
acting on the body from the outside (external forces) are shown. (These 
forces are the only ones shown, because only external forces acting on the 
body affect its motion. We can ignore any internal forces within the body.) 
Free-body diagrams are very useful in analyzing forces acting on a system 
and are employed extensively in the study and application of Newton’s laws 
of motion. 


A more quantitative definition of force can be based on some standard 
force, just as distance is measured in units relative to a standard distance. 
One possibility is to stretch a spring a certain fixed distance, as illustrated in 
[link], and use the force it exerts to pull itself back to its relaxed shape— 
called a restoring force—as a standard. The magnitude of all other forces 
can be stated as multiples of this standard unit of force. Many other 
possibilities exist for standard forces. (One that we will encounter in 
Magnetism is the magnetic force between two wires carrying electric 
current.) Some alternative definitions of force will be given later in this 
chapter. 


be 


(a) 
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(c) 


The force exerted by a stretched spring can 
be used as a standard unit of force. (a) This 
spring has a length x when undistorted. (b) 
When stretched a distance Az, the spring 
exerts a restoring force, Fyestore, which is 
reproducible. (c) A spring scale is one 
device that uses a spring to measure force. 
The force Fyestore is exerted on whatever is 
attached to the hook. Here Fyrestore has a 


magnitude of 6 units in the force standard 
being employed. 


Note: 

Take-Home Experiment: Force Standards 

To investigate force standards and cause and effect, get two identical 
rubber bands. Hang one rubber band vertically on a hook. Find a small 
household item that could be attached to the rubber band using a paper 
clip, and use this item as a weight to investigate the stretch of the rubber 
band. Measure the amount of stretch produced in the rubber band with one, 
two, and four of these (identical) items suspended from the rubber band. 
What is the relationship between the number of items and the amount of 
stretch? How large a stretch would you expect for the same number of 
items suspended from two rubber bands? What happens to the amount of 
stretch of the rubber band (with the weights attached) if the weights are 
also pushed to the side with a pencil? 


Section Summary 


e Dynamics is the study of how forces affect the motion of objects. 

e Force is a push or pull that can be defined in terms of various 
standards, and it is a vector having both magnitude and direction. 

e External forces are any outside forces that act on a body. A free-body 
diagram is a drawing of all external forces acting on a body. 


Conceptual Questions 


Exercise: 


Problem: 


Propose a force standard different from the example of a stretched 
spring discussed in the text. Your standard must be capable of 
producing the same force repeatedly. 


Exercise: 


Problem: 


What properties do forces have that allow us to classify them as 
vectors? 


Glossary 


dynamics 
the study of how forces affect the motion of objects and systems 


external force 
a force acting on an object or system that originates outside of the 
object or system 


free-body diagram 
a sketch showing all of the external forces acting on an object or 
system; the system is represented by a dot, and the forces are 
represented by vectors extending outward from the dot 


force 
a push or pull on an object with a specific magnitude and direction; 
can be represented by vectors; can be expressed as a multiple of a 
standard force 


Newton’s First Law of Motion: Inertia 


e Define mass and inertia. 
e Understand Newton's first law of motion. 


Experience suggests that an object at rest will remain at rest if left alone, 
and that an object in motion tends to slow down and stop unless some effort 
is made to keep it moving. What Newton’s first law of motion states, 
however, is the following: 


Note: 

Newton’s First Law of Motion 

A body at rest remains at rest, or, if in motion, remains in motion at a 
constant velocity unless acted on by a net external force. 


Note the repeated use of the verb “remains.” We can think of this law as 
preserving the status quo of motion. 


Rather than contradicting our experience, Newton’s first law of motion 
States that there must be a cause (which is a net external force) for there to 
be any change in velocity (either a change in magnitude or direction). We 
will define net external force in the next section. An object sliding across a 
table or floor slows down due to the net force of friction acting on the 
object. If friction disappeared, would the object still slow down? 


The idea of cause and effect is crucial in accurately describing what 
happens in various situations. For example, consider what happens to an 
object sliding along a rough horizontal surface. The object quickly grinds to 
a halt. If we spray the surface with talcum powder to make the surface 
smoother, the object slides farther. If we make the surface even smoother by 
rubbing lubricating oil on it, the object slides farther yet. Extrapolating to a 
frictionless surface, we can imagine the object sliding in a straight line 
indefinitely. Friction is thus the cause of the slowing (consistent with 
Newton’s first law). The object would not slow down at all if friction were 


completely eliminated. Consider an air hockey table. When the air is turned 
off, the puck slides only a short distance before friction slows it to a stop. 
However, when the air is turned on, it creates a nearly frictionless surface, 
and the puck glides long distances without slowing down. Additionally, if 
we know enough about the friction, we can accurately predict how quickly 
the object will slow down. Friction is an external force. 


Newton’s first law is completely general and can be applied to anything 
from an object sliding on a table to a satellite in orbit to blood pumped from 
the heart. Experiments have thoroughly verified that any change in velocity 
(speed or direction) must be caused by an external force. The idea of 
generally applicable or universal laws is important not only here—it is a 
basic feature of all laws of physics. Identifying these laws is like 
recognizing patterns in nature from which further patterns can be 
discovered. The genius of Galileo, who first developed the idea for the first 
law, and Newton, who clarified it, was to ask the fundamental question, 
“What is the cause?” Thinking in terms of cause and effect is a worldview 
fundamentally different from the typical ancient Greek approach when 
questions such as “Why does a tiger have stripes?” would have been 
answered in Aristotelian fashion, “That is the nature of the beast.” True 
perhaps, but not a useful insight. 


Mass 


The property of a body to remain at rest or to remain in motion with 
constant velocity is called inertia. Newton’s first law is often called the law 
of inertia. As we know from experience, some objects have more inertia 
than others. It is obviously more difficult to change the motion of a large 
boulder than that of a basketball, for example. The inertia of an object is 
measured by its mass. Roughly speaking, mass is a measure of the amount 
of “stuff” (or matter) in something. The quantity or amount of matter in an 
object is determined by the numbers of atoms and molecules of various 
types it contains. Unlike weight, mass does not vary with location. The 
mass of an object is the same on Earth, in orbit, or on the surface of the 
Moon. In practice, it is very difficult to count and identify all of the atoms 
and molecules in an object, so masses are not often determined in this 


manner. Operationally, the masses of objects are determined by comparison 
with the standard kilogram. 

Exercise: 

Check Your Understanding 


Problem: 


Which has more mass: a kilogram of cotton balls or a kilogram of 
gold? 


Solution: 
Answer 


They are equal. A kilogram of one substance is equal in mass to a 
kilogram of another substance. The quantities that might differ 
between them are volume and density. 


Section Summary 


e Newton’s first law of motion states that a body at rest remains at rest, 
or, if in motion, remains in motion at a constant velocity unless acted 
on by a net external force. This is also known as the law of inertia. 

e Inertia is the tendency of an object to remain at rest or remain in 
motion. Inertia is related to an object’s mass. 

e Mass is the quantity of matter in a substance. 


Conceptual Questions 
Exercise: 
Problem: How are inertia and mass related? 


Exercise: 


Problem: 


What is the relationship between weight and mass? Which is an 
intrinsic, unchanging property of a body? 


Glossary 


inertia 
the tendency of an object to remain at rest or remain in motion 


law of inertia 
see Newton’s first law of motion 


mass 
the quantity of matter in a substance; measured in kilograms 


Newton’s first law of motion 
a body at rest remains at rest, or, if in motion, remains in motion at a 
constant velocity unless acted on by a net external force; also known 
as the law of inertia 


Newton’s Second Law of Motion: Concept of a System 


e Define net force, external force, and system. 
e Understand Newton’s second law of motion. 
e Apply Newton’s second law to determine the weight of an object. 


Newton’s second law of motion is closely related to Newton’s first law of 
motion. It mathematically states the cause and effect relationship between 
force and changes in motion. Newton’s second law of motion is more 
quantitative and is used extensively to calculate what happens in situations 
involving a force. Before we can write down Newton’s second law as a 
simple equation giving the exact relationship of force, mass, and 
acceleration, we need to sharpen some ideas that have already been 
mentioned. 


First, what do we mean by a change in motion? The answer is that a change 
in motion is equivalent to a change in velocity. A change in velocity means, 
by definition, that there is an acceleration. Newton’s first law says that a 
net external force causes a change in motion; thus, we see that a net 
external force causes acceleration. 


Another question immediately arises. What do we mean by an external 
force? An intuitive notion of external is correct—an external force acts 
from outside the system of interest. For example, in [link](a) the system of 
interest is the wagon plus the child in it. The two forces exerted by the other 
children are external forces. An internal force acts between elements of the 
system. Again looking at [link](a), the force the child in the wagon exerts to 
hang onto the wagon is an internal force between elements of the system of 
interest. Only external forces affect the motion of a system, according to 
Newton’s first law. (The internal forces actually cancel, as we shall see in 
the next section.) You must define the boundaries of the system before you 
can determine which forces are external. Sometimes the system is obvious, 
whereas other times identifying the boundaries of a system is more subtle. 
The concept of a system is fundamental to many areas of physics, as is the 
correct application of Newton’s laws. This concept will be revisited many 
times on our journey through physics. 


Free-body diagram 


on the system 
adds to produce 


a net force, Fret. 


Each force acting Fret ae 
w/iN 


Different forces exerted on the same mass 
produce different accelerations. (a) Two 
children push a wagon with a child in it. 

Arrows representing all external forces are 

shown. The system of interest is the wagon 
and its rider. The weight w of the system 
and the support of the ground N are also 

shown for completeness and are assumed to 
cancel. The vector f represents the friction 
acting on the wagon, and it acts to the left, 
opposing the motion of the wagon. (b) All of 
the external forces acting on the system add 
together to produce a net force, Fyet. The 
free-body diagram shows all of the forces 
acting on the system of interest. The dot 
represents the center of mass of the system. 
Each force vector extends from this dot. 
Because there are two forces acting to the 
right, we draw the vectors collinearly. (c) A 
larger net external force produces a larger 


acceleration (af > a) when an adult pushes 
the child. 


Now, it seems reasonable that acceleration should be directly proportional 
to and in the same direction as the net (total) external force acting on a 
system. This assumption has been verified experimentally and is illustrated 
in [link]. In part (a), a smaller force causes a smaller acceleration than the 
larger force illustrated in part (c). For completeness, the vertical forces are 
also shown; they are assumed to cancel since there is no acceleration in the 
vertical direction. The vertical forces are the weight w and the support of 
the ground N, and the horizontal force f represents the force of friction. 
These will be discussed in more detail in later sections. For now, we will 
define friction as a force that opposes the motion past each other of objects 
that are touching. [link](b) shows how vectors representing the external 
forces add together to produce a net force, Fret. 


To obtain an equation for Newton’s second law, we first write the 
relationship of acceleration and net external force as the proportionality 
Equation: 


ac Bes 


where the symbol « means “proportional to,” and F yet is the net external 
force. (The net external force is the vector sum of all external forces and 
can be determined graphically, using the head-to-tail method, or 
analytically, using components. The techniques are the same as for the 
addition of other vectors, and are covered in Two-Dimensional Kinematics. ) 
This proportionality states what we have said in words—acceleration is 
directly proportional to the net external force. Once the system of interest is 
chosen, it is important to identify the external forces and ignore the internal 
ones. It is a tremendous simplification not to have to consider the numerous 
internal forces acting between objects within the system, such as muscular 
forces within the child’s body, let alone the myriad of forces between atoms 
in the objects, but by doing so, we can easily solve some very complex 
problems with only minimal error due to our simplification 


Now, it also seems reasonable that acceleration should be inversely 
proportional to the mass of the system. In other words, the larger the mass 
(the inertia), the smaller the acceleration produced by a given force. And 
indeed, as illustrated in [link], the same net external force applied to a car 
produces a much smaller acceleration than when applied to a basketball. 
The proportionality is written as 

Equation: 


where ™ is the mass of the system. Experiments have shown that 
acceleration is exactly inversely proportional to mass, just as it is exactly 
‘tab proportional to the net external force. 


all (eee 


(a) (b) 
The free-body diagrams for both objects are the same. 
i all — 


(c) 


The same force exerted on systems of 
different masses produces different 
accelerations. (a) A basketball player pushes 
on a basketball to make a pass. (The effect 
of gravity on the ball is ignored.) (b) The 
same player exerts an identical force on a 
stalled SUV and produces a far smaller 
acceleration (even if friction is negligible). 
(c) The free-body diagrams are identical, 
permitting direct comparison of the two 
situations. A series of patterns for the free- 
body diagram will emerge as you do more 
problems. 


It has been found that the acceleration of an object depends only on the net 
external force and the mass of the object. Combining the two 
proportionalities just given yields Newton's second law of motion. 


Note: 

Newton’s Second Law of Motion 

The acceleration of a system is directly proportional to and in the same 
direction as the net external force acting on the system, and inversely 
proportional to its mass. 

In equation form, Newton’s second law of motion is 

Equation: 


F net 
m 


aL 


This is often written in the more familiar form 
Equation: 


F net = ma. 


When only the magnitude of force and acceleration are considered, this 
equation is simply 
Equation: 


Pima. 


Although these last two equations are really the same, the first gives more 
insight into what Newton’s second law means. The law is a cause and effect 
relationship among three quantities that is not simply based on their 
definitions. The validity of the second law is completely based on 
experimental verification. 


Units of Force 


F net = ma is used to define the units of force in terms of the three basic 
units for mass, length, and time. The SI unit of force is called the newton 
(abbreviated N) and is the force needed to accelerate a 1-kg system at the 
rate of 1m/ s”. That is, since Fnct = ma, 

Equation: 


1N=1kg-m/s’. 


While almost the entire world uses the newton for the unit of force, in the 
United States the most familiar unit of force is the pound (lb), where 1 N = 
0.225 lb. 


Weight and the Gravitational Force 


When an object is dropped, it accelerates toward the center of Earth. 
Newton’s second law states that a net force on an object is responsible for 
its acceleration. If air resistance is negligible, the net force on a falling 
object is the gravitational force, commonly called its weight w. Weight can 
be denoted as a vector w because it has a direction; down is, by definition, 
the direction of gravity, and hence weight is a downward force. The 
magnitude of weight is denoted as w. Galileo was instrumental in showing 
that, in the absence of air resistance, all objects fall with the same 
acceleration g. Using Galileo’s result and Newton’s second law, we can 
derive an equation for weight. 


Consider an object with mass m falling downward toward Earth. It 
experiences only the downward force of gravity, which has magnitude w. 
Newton’s second law states that the magnitude of the net external force on 
an object is Pye, = ma. 


Since the object experiences only the downward force of gravity, Fret = w. 
We know that the acceleration of an object due to gravity is g, or a = g. 
Substituting these into Newton’s second law gives 


Note: 

Weight 

This is the equation for weight—the gravitational force on a mass ™: 
Equation: 


w = mg. 


Since g = 9.80 m/ s* on Earth, the weight of a 1.0 kg object on Earth is 
9.8 N, as we see: 
Equation: 


w = mg = (1.0 kg)(9.80 m/s”) = 9.8N. 


Recall that g can take a positive or negative value, depending on the 
positive direction in the coordinate system. Be sure to take this into 
consideration when solving problems with weight. 


When the net external force on an object is its weight, we say that it is in 
free-fall. That is, the only force acting on the object is the force of gravity. 
In the real world, when objects fall downward toward Earth, they are never 
truly in free-fall because there is always some upward force from the air 
acting on the object. 


The acceleration due to gravity g varies slightly over the surface of Earth, 
so that the weight of an object depends on location and is not an intrinsic 
property of the object. Weight varies dramatically if one leaves Earth’s 
surface. On the Moon, for example, the acceleration due to gravity is only 
1.67 m/s”. A 1.0-kg mass thus has a weight of 9.8 N on Earth and only 
about 1.7 N on the Moon. 


The broadest definition of weight in this sense is that the weight of an 
object is the gravitational force on it from the nearest large body, such as 
Earth, the Moon, the Sun, and so on. This is the most common and useful 
definition of weight in physics. It differs dramatically, however, from the 
definition of weight used by NASA and the popular media in relation to 
space travel and exploration. When they speak of “weightlessness” and 


“microgravity,” they are really referring to the phenomenon we call “free- 
fall” in physics. We shall use the above definition of weight, and we will 
make careful distinctions between free-fall and actual weightlessness. 


It is important to be aware that weight and mass are very different physical 
quantities, although they are closely related. Mass is the quantity of matter 
(how much “stuff’”) and does not vary in classical physics, whereas weight 
is the gravitational force and does vary depending on gravity. It is tempting 
to equate the two, since most of our examples take place on Earth, where 
the weight of an object only varies a little with the location of the object. 
Furthermore, the terms mass and weight are used interchangeably in 
everyday language; for example, our medical records often show our 
“weight” in kilograms, but never in the correct units of newtons. 


Note: 

Common Misconceptions: Mass vs. Weight 

Mass and weight are often used interchangeably in everyday language. 
However, in science, these terms are distinctly different from one another. 
Mass is a measure of how much matter is in an object. The typical measure 
of mass is the kilogram (or the “slug” in English units). Weight, on the 
other hand, is a measure of the force of gravity acting on an object. Weight 
is equal to the mass of an object (m) multiplied by the acceleration due to 
gravity (g). Like any other force, weight is measured in terms of newtons 
(or pounds in English units). 

Assuming the mass of an object is kept intact, it will remain the same, 
regardless of its location. However, because weight depends on the 
acceleration due to gravity, the weight of an object can change when the 
object enters into a region with stronger or weaker gravity. For example, 
the acceleration due to gravity on the Moon is 1.67 m/ 5° (which is much 


less than the acceleration due to gravity on Earth, 9.80 m/ ay If you 
measured your weight on Earth and then measured your weight on the 
Moon, you would find that you “weigh” much less, even though you do 
not look any skinnier. This is because the force of gravity is weaker on the 
Moon. In fact, when people say that they are “losing weight,” they really 


mean that they are losing “mass” (which in turn causes them to weigh 
less). 


Note: 

Take-Home Experiment: Mass and Weight 

What do bathroom scales measure? When you stand on a bathroom scale, 
what happens to the scale? It depresses slightly. The scale contains springs 
that compress in proportion to your weight—similar to rubber bands 
expanding when pulled. The springs provide a measure of your weight (for 
an object which is not accelerating). This is a force in newtons (or pounds). 
In most countries, the measurement is divided by 9.80 to give a reading in 
mass units of kilograms. The scale measures weight but is calibrated to 
provide information about mass. While standing on a bathroom scale, push 
down on a table next to you. What happens to the reading? Why? Would 
your scale measure the same “mass” on Earth as on the Moon? 


Example: 

What Acceleration Can a Person Produce when Pushing a Lawn 
Mower? 

Suppose that the net external force (push minus friction) exerted on a lawn 
mower is 51 N (about 11 |b) parallel to the ground. The mass of the mower 
is 24 kg. What is its acceleration? 


The net force on a lawn mower is 51 


N to the right. At what rate does the 
lawn mower accelerate to the right? 


Strategy 
Since F’,¢¢ and m are given, the acceleration can be calculated directly 
from Newton’s second law as stated in F,., = ma. 


Solution 
The magnitude of the acceleration a is a = Fae . Entering known values 
gives 
Equation: 

51 N 

= —— 

24 kg 
Substituting the units kg - m/s” for N yields 
Equation: 

51 kg- m/s 
C= dl kg: m/s” — 2.1 m/s”. 
24 kg 

Discussion 


The direction of the acceleration is the same direction as that of the net 
force, which is parallel to the ground. There is no information given in this 
example about the individual external forces acting on the system, but we 
can say something about their relative magnitudes. For example, the force 
exerted by the person pushing the mower must be greater than the friction 
opposing the motion (since we know the mower moves forward), and the 
vertical forces must cancel if there is to be no acceleration in the vertical 
direction (the mower is moving only horizontally). The acceleration found 
is small enough to be reasonable for a person pushing a mower. Such an 
effort would not last too long because the person’s top speed would soon 
be reached. 


Example: 


What Rocket Thrust Accelerates This Sled? 

Prior to manned space flights, rocket sleds were used to test aircraft, 
missile equipment, and physiological effects on human subjects at high 
speeds. They consisted of a platform that was mounted on one or two rails 
and propelled by several rockets. Calculate the magnitude of force exerted 
by each rocket, called its thrust 'T, for the four-rocket propulsion system 
shown in [link]. The sled’s initial acceleration is 49 m/ a the mass of the 
system is 2100 kg, and the force of friction opposing the motion is known 
to be 650 N. 


A sled experiences a rocket thrust 
that accelerates it to the right. Each 
rocket creates an identical thrust T 
. As in other situations where there 
is only horizontal acceleration, the 
vertical forces cancel. The ground 
exerts an upward force N on the 
system that is equal in magnitude 
and opposite in direction to its 
weight, w. The system here is the 
sled, its rockets, and rider, so none 
of the forces between these objects 
are considered. The arrow 
representing friction (f) is drawn 
larger than scale. 


Strategy 

Although there are forces acting vertically and horizontally, we assume the 
vertical forces cancel since there is no vertical acceleration. This leaves us 
with only horizontal forces and a simpler one-dimensional problem. 
Directions are indicated with plus or minus signs, with right taken as the 
positive direction. See the free-body diagram in the figure. 

Solution 

Since acceleration, mass, and the force of friction are given, we start with 
Newton’s second law and look for ways to find the thrust of the engines. 
Since we have defined the direction of the force and acceleration as acting 
“to the right,” we need to consider only the magnitudes of these quantities 
in the calculations. Hence we begin with 

Equation: 


Boe = ma, 


where Fret is the net force along the horizontal direction. We can see from 
[link] that the engine thrusts add, while friction opposes the thrust. In 
equation form, the net external force is 

Equation: 


Fret = 4T — f. 


Substituting this into Newton’s second law gives 
Equation: 


Fyet = ma = 4T — f. 


Using a little algebra, we solve for the total thrust 4T: 
Equation: 


AT = ma¢ f. 


Substituting known values yields 
Equation: 


4T = ma+ f = (2100 kg)(49 m/s”) + 650 N. 


So the total thrust is 
Equation: 


AT = 1.0 x 10°N, 


and the individual thrusts are 
Equation: 


_ 10x 10°N 


—2.6x10°N. 
i 6 x 10 


T 


Discussion 

The numbers are quite large, so the result might surprise you. Experiments 
such as this were performed in the early 1960s to test the limits of human 
endurance and the setup designed to protect human subjects in jet fighter 
emergency ejections. Speeds of 1000 km/h were obtained, with 
accelerations of 45 g's. (Recall that g, the acceleration due to gravity, is 
9.80 m/s”. When we say that an acceleration is 45 g's, it is 459.80 m/s” 


, which is approximately 440 m/ 5”) While living subjects are not used 
any more, land speeds of 10,000 km/h have been obtained with rocket 
sleds. In this example, as in the preceding one, the system of interest is 
obvious. We will see in later examples that choosing the system of interest 
is crucial—and the choice is not always obvious. 

Newton’s second law of motion is more than a definition; it is a 
relationship among acceleration, force, and mass. It can help us make 
predictions. Each of those physical quantities can be defined 
independently, so the second law tells us something basic and universal 
about nature. The next section introduces the third and final law of motion. 


Section Summary 


e Acceleration, a, is defined as a change in velocity, meaning a change 
in its magnitude or direction, or both. 

e An external force is one acting on a system from outside the system, as 
opposed to internal forces, which act between components within the 


system. 

e Newton’s second law of motion states that the acceleration of a system 
is directly proportional to and in the same direction as the net external 
force acting on the system, and inversely proportional to its mass. 

e In equation form, Newton’s second law of motion is a = Fast 


e This is often written in the more familiar form: F,., = ma. 
¢ The weight w of an object is defined as the force of gravity acting on 
an object of mass m. The object experiences an acceleration due to 


gravity g: 
Equation: 


w = mg. 


e If the only force acting on an object is due to gravity, the object is in 
free fall. 

e Friction is a force that opposes the motion past each other of objects 
that are touching. 


Conceptual Questions 


Exercise: 
Problem: 
Which statement is correct? (a) Net force causes motion. (b) Net force 
causes change in motion. Explain your answer and give an example. 
Exercise: 
Problem: 
Why can we neglect forces such as those holding a body together 
when we apply Newton’s second law of motion? 


Exercise: 


Problem: 
Explain how the choice of the “system of interest” affects which forces 
must be considered when applying Newton’s second law of motion. 
Exercise: 
Problem: 
Describe a situation in which the net external force on a system is not 
zero, yet its speed remains constant. 
Exercise: 
Problem: 
A system can have a nonzero velocity while the net external force on it 
is zero. Describe such a situation. 
Exercise: 
Problem: 
A rock is thrown straight up. What is the net external force acting on 
the rock when it is at the top of its trajectory? 
Exercise: 
Problem: 
(a) Give an example of different net external forces acting on the same 
system to produce different accelerations. (b) Give an example of the 
same net external force acting on systems of different masses, 


producing different accelerations. (c) What law accurately describes 
both effects? State it in words and as an equation. 


Exercise: 
Problem: 
If the acceleration of a system is zero, are no external forces acting on 
it? What about internal forces? Explain your answers. 


Exercise: 


Problem: 
If a constant, nonzero force is applied to an object, what can you say 
about the velocity and acceleration of the object? 

Exercise: 
Problem: 
The gravitational force on the basketball in [link] is ignored. When 
gravity is taken into account, what is the direction of the net external 


force on the basketball—above horizontal, below horizontal, or still 
horizontal? 


Problem Exercises 


You may assume data taken from illustrations is accurate to three 
digits. 
Exercise: 


Problem: 


A 63.0-kg sprinter starts a race with an acceleration of 4.20 m/ 3”. 
What is the net external force on him? 


Solution: 


265 N 
Exercise: 
Problem: 
If the sprinter from the previous problem accelerates at that rate for 20 


m, and then maintains that velocity for the remainder of the 100-m 
dash, what will be his time for the race? 


Exercise: 


Problem: 


A cleaner pushes a 4.50-kg laundry cart in such a way that the net 
external force on it is 60.0 N. Calculate the magnitude of its 
acceleration. 


Solution: 


13.3 m/s” 
Exercise: 


Problem: 


Since astronauts in orbit are apparently weightless, a clever method of 
measuring their masses is needed to monitor their mass gains or losses 
to adjust diets. One way to do this is to exert a known force on an 
astronaut and measure the acceleration produced. Suppose a net 
external force of 50.0 N is exerted and the astronaut’s acceleration is 
measured to be 0.893 m/ s*. (a) Calculate her mass. (b) By exerting a 
force on the astronaut, the vehicle in which they orbit experiences an 
equal and opposite force. Discuss how this would affect the 
measurement of the astronaut’s acceleration. Propose a method in 
which recoil of the vehicle is avoided. 


Exercise: 
Problem: 
In [link], the net external force on the 24-kg mower is stated to be 51 
N. If the force of friction opposing the motion is 24 N, what force F’ 
(in newtons) is the person exerting on the mower? Suppose the mower 


is moving at 1.5 m/s when the force F’ is removed. How far will the 
mower go before stopping? 


Exercise: 


Problem: 


The same rocket sled drawn in [link] is decelerated at a rate of 


196 m/ s”. What force is necessary to produce this deceleration? 
Assume that the rockets are off. The mass of the system is 2100 kg. 


Exercise: 


Problem: 


(a) If the rocket sled shown in [link] starts with only one rocket 
burning, what is the magnitude of its acceleration? Assume that the 
mass of the system is 2100 kg, the thrust T is 2.4 x 104 N, and the 
force of friction opposing the motion is known to be 650 N. (b) Why is 
the acceleration not one-fourth of what it is with all rockets burning? 


Solution: 
(a) 12 m/s’. 
(b) The acceleration is not one-fourth of what it was with all rockets 


burning because the frictional force is still as large as it was with all 
rockets burning. 


Exercise: 


Problem: 


What is the deceleration of the rocket sled if it comes to rest in 1.1 s 
from a speed of 1000 km/h? (Such deceleration caused one test subject 
to black out and have temporary blindness.) 


Exercise: 


Problem: 


Suppose two children push horizontally, but in exactly opposite 
directions, on a third child in a wagon. The first child exerts a force of 
75.0 N, the second a force of 90.0 N, friction is 12.0 N, and the mass 
of the third child plus wagon is 23.0 kg. (a) What is the system of 
interest if the acceleration of the child in the wagon is to be calculated? 
(b) Draw a free-body diagram, including all forces acting on the 
system. (c) Calculate the acceleration. (d) What would the acceleration 
be if friction were 15.0 N? 


Solution: 


(a) The system is the child in the wagon plus the wagon. 


(c) a = 0.130 m/ s” in the direction of the second child’s push. 


(d) a = 0.00 m/s? 
Exercise: 


Problem: 


A powerful motorcycle can produce an acceleration of 3.50 m/ 3” 
while traveling at 90.0 km/h. At that speed the forces resisting motion, 
including friction and air resistance, total 400 N. (Air resistance is 
analogous to air friction. It always opposes the motion of an object.) 
What is the magnitude of the force the motorcycle exerts backward on 
the ground to produce its acceleration if the mass of the motorcycle 
with rider is 245 kg? 


Exercise: 
Problem: 
The rocket sled shown in [link] accelerates at a rate of 49.0 m/ 5. Its 
passenger has a mass of 75.0 kg. (a) Calculate the horizontal 
component of the force the seat exerts against his body. Compare this 


with his weight by using a ratio. (b) Calculate the direction and 
magnitude of the total force the seat exerts against his body. 


Solution: 
(a) 3.68 x 10° N.. This force is 5.00 times greater than his weight. 


(b) 3750 N; 11.3° above horizontal 


Exercise: 


Problem: 


Repeat the previous problem for the situation in which the rocket sled 
decelerates at a rate of 201 m/ s”. In this problem, the forces are 
exerted by the seat and restraining belts. 


Exercise: 
Problem: 
The weight of an astronaut plus his space suit on the Moon is only 250 


N. How much do they weigh on Earth? What is the mass on the Moon? 
On Earth? 


Solution: 


1.5 x 10° N, 150 kg, 150 kg 

Exercise: 
Problem: 
Suppose the mass of a fully loaded module in which astronauts take off 
from the Moon is 10,000 kg. The thrust of its engines is 30,000 N. (a) 
Calculate its the magnitude of acceleration in a vertical takeoff from 


the Moon. (b) Could it lift off from Earth? If not, why not? If it could, 
calculate the magnitude of its acceleration. 


Glossary 


acceleration 
the rate at which an object’s velocity changes over a period of time 


free-fall 
a situation in which the only force acting on an object is the force due 
to gravity 


friction 


a force past each other of objects that are touching; examples include 
rough surfaces and air resistance 


net external force 
the vector sum of all external forces acting on an object or system; 
causes a mass to accelerate 


Newton’s second law of motion 
the net external force Fy.4 on an object with mass m is proportional to 
and in the same direction as the acceleration of the object, a, and 
inversely proportional to the mass; defined mathematically as 


a= Fret 
m 


system 
defined by the boundaries of an object or collection of objects being 
observed; all forces originating from outside of the system are 
considered external forces 


weight 
the force wdue to gravity acting on an object of mass m; defined 
mathematically as: w = mg, where g is the magnitude and direction 
of the acceleration due to gravity 


Newton’s Third Law of Motion: Symmetry in Forces 


e Understand Newton's third law of motion. 
e Apply Newton's third law to define systems and solve problems of 
motion. 


There is a passage in the musical Man of la Mancha that relates to 
Newton’s third law of motion. Sancho, in describing a fight with his wife to 
Don Quixote, says, “Of course I hit her back, Your Grace, but she’s a lot 
harder than me and you know what they say, ‘Whether the stone hits the 
pitcher or the pitcher hits the stone, it’s going to be bad for the pitcher.’” 
This is exactly what happens whenever one body exerts a force on another 
—the first also experiences a force (equal in magnitude and opposite in 
direction). Numerous common experiences, such as stubbing a toe or 
throwing a ball, confirm this. It is precisely stated in Newton’s third law of 
motion. 


Note: 

Newton’s Third Law of Motion 

Whenever one body exerts a force on a second body, the first body 
experiences a force that is equal in magnitude and opposite in direction to 
the force that it exerts. 


This law represents a certain symmetry in nature: Forces always occur in 
pairs, and one body cannot exert a force on another without experiencing a 
force itself. We sometimes refer to this law loosely as “action-reaction,” 
where the force exerted is the action and the force experienced as a 
consequence is the reaction. Newton’s third law has practical uses in 
analyzing the origin of forces and understanding which forces are external 
to a system. 


We can readily see Newton’s third law at work by taking a look at how 
people move about. Consider a swimmer pushing off from the side of a 
pool, as illustrated in [link]. She pushes against the pool wall with her feet 


and accelerates in the direction opposite to that of her push. The wall has 
exerted an equal and opposite force back on the swimmer. You might think 
that two equal and opposite forces would cancel, but they do not because 
they act on different systems. In this case, there are two systems that we 
could investigate: the swimmer or the wall. If we select the swimmer to be 
the system of interest, as in the figure, then F wan on feet is an external force 
on this system and affects its motion. The swimmer moves in the direction 
of F wan on feet. IN contrast, the force F feet on wal] acts on the wall and not on 
our system of interest. Thus F fect on wall does not directly affect the motion 
of the system and does not cancel F wat on feet. Note that the swimmer 
pushes in the direction opposite to that in which she wishes to move. The 


reaction to her push is thus in the desired direction. 
System of interest 


Ty) Free-body Diagram 


BF 
F wait on feet e 


Ww 


Direction of 
acceleration 


When the swimmer exerts a force F geet on wal] On the wall, she 
accelerates in the direction opposite to that of her push. This means 
the net external force on her is in the direction opposite to F fect on wall 
. This opposition occurs because, in accordance with Newton’s third 
law of motion, the wall exerts a force F wan on feet on her, equal in 
magnitude but in the direction opposite to the one she exerts on it. 
The line around the swimmer indicates the system of interest. Note 
that F feet on wall does not act on this system (the swimmer) and, thus, 
does not cancel F wat on feet. Thus the free-body diagram shows only 
F wall on feet, W, the gravitational force, and BF, the buoyant force of 
the water supporting the swimmer’s weight. The vertical forces w 
and BF cancel since there is no vertical motion. 


Other examples of Newton’s third law are easy to find. As a professor paces 
in front of a whiteboard, she exerts a force backward on the floor. The floor 
exerts a reaction force forward on the professor that causes her to accelerate 
forward. Similarly, a car accelerates because the ground pushes forward on 
the drive wheels in reaction to the drive wheels pushing backward on the 
ground. You can see evidence of the wheels pushing backward when tires 
Spin on a gravel road and throw rocks backward. In another example, 
rockets move forward by expelling gas backward at high velocity. This 
means the rocket exerts a large backward force on the gas in the rocket 
combustion chamber, and the gas therefore exerts a large reaction force 
forward on the rocket. This reaction force is called thrust. It is a common 
misconception that rockets propel themselves by pushing on the ground or 
on the air behind them. They actually work better in a vacuum, where they 
can more readily expel the exhaust gases. Helicopters similarly create lift 
by pushing air down, thereby experiencing an upward reaction force. Birds 
and airplanes also fly by exerting force on air in a direction opposite to that 
of whatever force they need. For example, the wings of a bird force air 
downward and backward in order to get lift and move forward. An octopus 
propels itself in the water by ejecting water through a funnel from its body, 
similar to a jet ski. In a situation similar to Sancho’s, professional cage 
fighters experience reaction forces when they punch, sometimes breaking 
their hand by hitting an opponent’s body. 


Example: 

Getting Up To Speed: Choosing the Correct System 

A physics professor pushes a cart of demonstration equipment to a lecture 
hall, as seen in [link]. Her mass is 65.0 kg, the cart’s is 12.0 kg, and the 
equipment’s is 7.0 kg. Calculate the acceleration produced when the 
professor exerts a backward force of 150 N on the floor. All forces 
opposing the motion, such as friction on the cart’s wheels and air 
resistance, total 24.0 N. 


Free body diagrams 
N 


f F 
e floor 


w 
System 1 


System 2 


A professor pushes a cart of demonstration equipment. The 
lengths of the arrows are proportional to the magnitudes of the 
forces (except for f, since it is too small to draw to scale). 
Different questions are asked in each example; thus, the system 
of interest must be defined differently for each. System 1 is 
appropriate for this example, since it asks for the acceleration 
of the entire group of objects. Only Foor and f are external 
forces acting on System 1 along the line of motion. All other 
forces either cancel or act on the outside world. System 2 is 
chosen for [link] so that F'>,o¢ will be an external force and 
enter into Newton’s second law. Note that the free-body 
diagrams, which allow us to apply Newton’s second law, vary 
with the system chosen. 


Strategy 

Since they accelerate as a unit, we define the system to be the professor, 
cart, and equipment. This is System 1 in [link]. The professor pushes 
backward with a force F'¢.,4 of 150 N. According to Newton’s third law, 
the floor exerts a forward reaction force F foo, of 150 N on System 1. 
Because all motion is horizontal, we can assume there is no net force in the 
vertical direction. The problem is therefore one-dimensional along the 


horizontal direction. As noted, f opposes the motion and is thus in the 
opposite direction of Foor. Note that we do not include the forces F prof or 
F .art because these are internal forces, and we do not include F 9,4 
because it acts on the floor, not on the system. There are no other 
significant forces acting on System 1. If the net external force can be found 
from all this information, we can use Newton’s second law to find the 
acceleration as requested. See the free-body diagram in the figure. 
Solution 

Newton’s second law is given by 

Equation: 


me er 


m 


The net external force on System 1 is deduced from [link] and the 
discussion above to be 
Equation: 


Ee = Htloor =) — LOU N— 24.0 N— 126 N. 


The mass of System 1 is 
Equation: 


m = (65.0 + 12.0 + 7.0) kg = 84kg. 


These values of Fe, and m produce an acceleration of 


Equation: 
a — LEER , 
m 
— 126N 2 
a= ay 1.5 m/s 
Discussion 


None of the forces between components of System 1, such as between the 
professor’s hands and the cart, contribute to the net external force because 
they are internal to System 1. Another way to look at this is to note that 
forces between components of a system cancel because they are equal in 
magnitude and opposite in direction. For example, the force exerted by the 


professor on the cart results in an equal and opposite force back on her. In 
this case both forces act on the same system and, therefore, cancel. Thus 
internal forces (between components of a system) cancel. Choosing System 
1 was crucial to solving this problem. 


Example: 

Force on the Cart—Choosing a New System 

Calculate the force the professor exerts on the cart in [link] using data from 
the previous example if needed. 

Strategy 

If we now define the system of interest to be the cart plus equipment 
(System 2 in [link]), then the net external force on System 2 is the force the 
professor exerts on the cart minus friction. The force she exerts on the cart, 
F prof, is an external force acting on System 2. F'>,of was internal to System 
1, but it is external to System 2 and will enter Newton’s second law for 
System 2. 

Solution 

Newton’s second law can be used to find F pro¢. Starting with 

Equation: 


F net 


m 


and noting that the magnitude of the net external force on System 2 is 
Equation: 


ins: == iD are — if, 


we solve for Fpror, the desired quantity: 
Equation: 


Nop Irae oP afc 


The value of f is given, so we must calculate net Fy. That can be done 
since both the acceleration and mass of System 2 are known. Using 
Newton’s second law we see that 


Equation: 
Pret = Ma, 


where the mass of System 2 is 19.0 kg (m= 12.0 kg + 7.0 kg) and its 
acceleration was found to be a = 1.5 m/ s” in the previous example. Thus, 
Equation: 


Pret = ma, 
Equation: 
F 4 = (19.0 kg)(1.5 m/s”) = 29 N. 


Now we can find the desired force: 


Equation: 
at het at da 
Equation: 
Fyrop = 29 N + 24.0 N = 53 N. 
Discussion 


It is interesting that this force is significantly less than the 150-N force the 
professor exerted backward on the floor. Not all of that 150-N force is 
transmitted to the cart; some of it accelerates the professor. 

The choice of a system is an important analytical step both in solving 
problems and in thoroughly understanding the physics of the situation 
(which is not necessarily the same thing). 


Note: 

PhET Explorations: Gravity Force Lab 

Visualize the gravitational force that two objects exert on each other. 
Change properties of the objects in order to see how it changes the gravity 
force. 


https://phet.colorado.edu/sims/html/gravity-force-lab/latest/gravity-force- 
lab_en. html 


Section Summary 


¢ Newton’s third law of motion represents a basic symmetry in nature. 
It states: Whenever one body exerts a force on a second body, the first 
body experiences a force that is equal in magnitude and opposite in 
direction to the force that the first body exerts. 

e A thrust is a reaction force that pushes a body forward in response to 
a backward force. Rockets, airplanes, and cars are pushed forward by a 
thrust reaction force. 


Conceptual Questions 


Exercise: 
Problem: 
When you take off in a jet aircraft, there is a sensation of being pushed 
back into the seat. Explain why you move backward in the seat—is 


there really a force backward on you? (The same reasoning explains 
whiplash injuries, in which the head is apparently thrown backward.) 


Exercise: 
Problem: 
A device used since the 1940s to measure the kick or recoil of the 
body due to heart beats is the “ballistocardiograph.” What physics 


principle(s) are involved here to measure the force of cardiac 
contraction? How might we construct such a device? 


Exercise: 


Problem: 


Describe a situation in which one system exerts a force on another and, 
as a consequence, experiences a force that is equal in magnitude and 
opposite in direction. Which of Newton’s laws of motion apply? 


Exercise: 
Problem: 
Why does an ordinary rifle recoil (kick backward) when fired? The 
barrel of a recoilless rifle is open at both ends. Describe how Newton’s 


third law applies when one is fired. Can you safely stand close behind 
one when it is fired? 


Exercise: 
Problem: 
An American football lineman reasons that it is senseless to try to out- 
push the opposing player, since no matter how hard he pushes he will 
experience an equal and opposite force from the other player. Use 
Newton’s laws and draw a free-body diagram of an appropriate system 


to explain how he can still out-push the opposition if he is strong 
enough. 


Exercise: 


Problem: 

Newton’s third law of motion tells us that forces always occur in pairs 
of equal and opposite magnitude. Explain how the choice of the 
“system of interest” affects whether one such pair of forces cancels. 


Problem Exercises 


Exercise: 


Problem: 


What net external force is exerted on a 1100-kg artillery shell fired 
from a battleship if the shell is accelerated at 2.40 x 10* m/ s”? What 
is the magnitude of the force exerted on the ship by the artillery shell? 


Solution: 
Force on shell: 2.64 x 10’ N 


Force exerted on ship = —2.64 x 10’ N, by Newton’s third law 
Exercise: 


Problem: 


A brave but inadequate rugby player is being pushed backward by an 
opposing player who is exerting a force of 800 N on him. The mass of 
the losing player plus equipment is 90.0 kg, and he is accelerating at 
1.20 m/ s” backward. (a) What is the force of friction between the 
losing player’s feet and the grass? (b) What force does the winning 
player exert on the ground to move forward if his mass plus equipment 
is 110 kg? (c) Draw a sketch of the situation showing the system of 
interest used to solve each part. For this situation, draw a free-body 
diagram and write the net force equation. 


Glossary 


Newton’s third law of motion 
whenever one body exerts a force on a second body, the first body 
experiences a force that is equal in magnitude and opposite in direction 
to the force that the first body exerts 


thrust 
a reaction force that pushes a body forward in response to a backward 
force; rockets, airplanes, and cars are pushed forward by a thrust 
reaction force 


Normal, Tension, and Other Examples of Forces 


¢ Define normal and tension forces. 

e Apply Newton's laws of motion to solve problems involving a variety 
of forces. 

e Use trigonometric identities to resolve weight into components. 


Forces are given many names, such as push, pull, thrust, lift, weight, 
friction, and tension. Traditionally, forces have been grouped into several 
categories and given names relating to their source, how they are 
transmitted, or their effects. The most important of these categories are 
discussed in this section, together with some interesting applications. 
Further examples of forces are discussed later in this text. 


Normal Force 


Weight (also called force of gravity) is a pervasive force that acts at all 
times and must be counteracted to keep an object from falling. You 
definitely notice that you must support the weight of a heavy object by 
pushing up on it when you hold it stationary, as illustrated in [link](a). But 
how do inanimate objects like a table support the weight of a mass placed 
on them, such as shown in [link](b)? When the bag of dog food is placed on 
the table, the table actually sags slightly under the load. This would be 
noticeable if the load were placed on a card table, but even rigid objects 
deform when a force is applied to them. Unless the object is deformed 
beyond its limit, it will exert a restoring force much like a deformed spring 
(or trampoline or diving board). The greater the deformation, the greater the 
restoring force. So when the load is placed on the table, the table sags until 
the restoring force becomes as large as the weight of the load. At this point 
the net external force on the load is zero. That is the situation when the load 
is stationary on the table. The table sags quickly, and the sag is slight so we 
do not notice it. But it is similar to the sagging of a trampoline when you 
climb onto it. 


‘7 h 


= 
= 


Free-body diagrams 


(a) The person holding the bag of dog food 
must supply an upward force Fhana equal in 
magnitude and opposite in direction to the 
weight of the food w. (b) The card table 
sags when the dog food is placed on it, much 
like a stiff trampoline. Elastic restoring 
forces in the table grow as it sags until they 
supply a force N equal in magnitude and 
opposite in direction to the weight of the 
load. 


We must conclude that whatever supports a load, be it animate or not, must 
supply an upward force equal to the weight of the load, as we assumed in a 
few of the previous examples. If the force supporting a load is 
perpendicular to the surface of contact between the load and its support, this 
force is defined to be a normal force and here is given the symbol N. (This 
is not the unit for force N.) The word normal means perpendicular to a 


surface. The normal force can be less than the object’s weight if the object 
is on an incline, as you will see in the next example. 


Note: 

Common Misconception: Normal Force (N) vs. Newton (N) 

In this section we have introduced the quantity normal force, which is 
represented by the variable N. This should not be confused with the 
symbol for the newton, which is also represented by the letter N. These 
symbols are particularly important to distinguish because the units of a 
normal force (N) happen to be newtons (N). For example, the normal force 
N that the floor exerts on a chair might be N = 100 N. One important 
difference is that normal force is a vector, while the newton is simply a 
unit. Be careful not to confuse these letters in your calculations! You will 
encounter more similarities among variables and units as you proceed in 
physics. Another example of this is the quantity work (W) and the unit 
watts (W). 


Example: 

Weight on an Incline, a Two-Dimensional Problem 

Consider the skier on a slope shown in [link]. Her mass including 
equipment is 60.0 kg. (a) What is her acceleration if friction is negligible? 
(b) What is her acceleration if friction is known to be 45.0 N? 


Free-body diagram 


Since motion and friction are parallel to the 
slope, it is most convenient to project all 


forces onto a coordinate system where one 
axis is parallel to the slope and the other is 
perpendicular (axes shown to left of skier). 
N is perpendicular to the slope and f is 
parallel to the slope, but w has components 
along both axes, namely w, and Ww). N is 
equal in magnitude to w _, so that there is 
no motion perpendicular to the slope, but f 
is less than wy, so that there is a downslope 


acceleration (along the parallel axis). 


Strategy 

This is a two-dimensional problem, since the forces on the skier (the 
system of interest) are not parallel. The approach we have used in two- 
dimensional kinematics also works very well here. Choose a convenient 
coordinate system and project the vectors onto its axes, creating two 
connected one-dimensional problems to solve. The most convenient 
coordinate system for motion on an incline is one that has one coordinate 
parallel to the slope and one perpendicular to the slope. (Remember that 
motions along mutually perpendicular axes are independent.) We use the 
symbols | and || to represent perpendicular and parallel, respectively. This 
choice of axes simplifies this type of problem, because there is no motion 
perpendicular to the slope and because friction is always parallel to the 
surface between two objects. The only external forces acting on the system 
are the skier’s weight, friction, and the support of the slope, respectively 
labeled w, f, and N in [link]. N is always perpendicular to the slope, and 
f is parallel to it. But w is not in the direction of either axis, and so the first 
step we take is to project it into components along the chosen axes, 
defining wy) to be the component of weight parallel to the slope and w_ the 
component of weight perpendicular to the slope. Once this is done, we can 
consider the two separate problems of forces parallel to the slope and 
forces perpendicular to the slope. 

Solution 

The magnitude of the component of the weight parallel to the slope is 

Ww), = w sin (25°) = mg sin (25°), and the magnitude of the component of 


the weight perpendicular to the slope is 

(i= COS (Doni eg es (oo. 

(a) Neglecting friction. Since the acceleration is parallel to the slope, we 
need only consider forces parallel to the slope. (Forces perpendicular to the 
slope add to zero, since there is no acceleration in that direction.) The 
forces parallel to the slope are the amount of the skier’s weight parallel to 
the slope wy and friction f. Using Newton’s second law, with subscripts to 


denote quantities parallel to the slope, 
Equation: 


Beh 


m 


lees 


where Fyyet|) = wi = mg sin (25°), assuming no friction for this part, so 
that 
Equation: 


Fy, mg sin (25° 
a) = — = ee = g sin (25°) 


Equation: 
(9.80 m/s”) (0.4226) = 4.14 m/s” 


is the acceleration. 

(b) Including friction. We now have a given value for friction, and we 
know its direction is parallel to the slope and it opposes motion between 
surfaces in contact. So the net external force is now 


Equation: 
Fret = WU] — ifs 
° ° ° ° Joes . 
and substituting this into Newton’s second law, ay = —t gives 
Equation: 


ay = cell § WF __ mg sin (25°) — f 
ee nn mm . m 


We substitute known values to obtain 


Equation: 
_ 60.0 kg 
which yields 
Equation: 
a = 3.39 m/s”, 


which is the acceleration parallel to the incline when there is 45.0 N of 
opposing friction. 

Discussion 

Since friction always opposes motion between surfaces, the acceleration is 
smaller when there is friction than when there is none. In fact, it is a 
general result that if friction on an incline is negligible, then the 
acceleration down the incline is a = g sin0, regardless of mass. This is 
related to the previously discussed fact that all objects fall with the same 
acceleration in the absence of air resistance. Similarly, all objects, 
regardless of mass, slide down a frictionless incline with the same 
acceleration (if the angle is the same). 


Note: 
Resolving Weight into Components 


w = wsin(@) = mg sin(@) 
w, = wcos(@) = mgcos(é@) 


An object rests on an incline that makes an 


angle 8 with the horizontal. 


When an object rests on an incline that makes an angle 8 with the 
horizontal, the force of gravity acting on the object is divided into two 
components: a force acting perpendicular to the plane, w__, and a force 
acting parallel to the plane, w). The perpendicular force of weight, w _, is 
typically equal in magnitude and opposite in direction to the normal force, 
N. The force acting parallel to the plane, Ww), Causes the object to 


accelerate down the incline. The force of friction, f, opposes the motion of 
the object, so it acts upward along the plane. 

It is important to be careful when resolving the weight of the object into 
components. If the angle of the incline is at an angle @ to the horizontal, 
then the magnitudes of the weight components are 

Equation: 


w) = w sin (@) = mg sin (8) 


and 
Equation: 


w_ = wos (0) = mg cos (8). 


Instead of memorizing these equations, it is helpful to be able to determine 
them from reason. To do this, draw the right triangle formed by the three 
weight vectors. Notice that the angle @ of the incline is the same as the 
angle formed between w and w _. Knowing this property, you can use 
trigonometry to determine the magnitude of the weight components: 
Equation: 


cos(#) = = 

WL = wcos (6) = mg cos (6) 
Equation: 

sin(9) = al 


w sin (0) = mg sin (0) 


& 
| 


Note: 

Take-Home Experiment: Force Parallel 

To investigate how a force parallel to an inclined plane changes, find a 
rubber band, some objects to hang from the end of the rubber band, and a 
board you can position at different angles. How much does the rubber band 
stretch when you hang the object from the end of the board? Now place the 
board at an angle so that the object slides off when placed on the board. 
How much does the rubber band extend if it is lined up parallel to the 
board and used to hold the object stationary on the board? Try two more 
angles. What does this show? 


Tension 


A tension is a force along the length of a medium, especially a force carried 
by a flexible medium, such as a rope or cable. The word “tension” comes 
from a Latin word meaning “to stretch.” Not coincidentally, the flexible 
cords that carry muscle forces to other parts of the body are called tendons. 
Any flexible connector, such as a string, rope, chain, wire, or cable, can 
exert pulls only parallel to its length; thus, a force carried by a flexible 
connector is a tension with direction parallel to the connector. It is 
important to understand that tension is a pull in a connector. In contrast, 
consider the phrase: “You can’t push a rope.” The tension force pulls 
outward along the two ends of a rope. 


Consider a person holding a mass on a rope as shown in [Link]. 


Free-body diagram 


Ng 


When a perfectly 
flexible connector 
(one requiring no 
force to bend it) 
such as this rope 
transmits a force T, 
that force must be 
parallel to the 
length of the rope, 
as shown. The pull 
such a flexible 
connector exerts is 
a tension. Note that 
the rope pulls with 
equal force but in 
opposite directions 
on the hand and the 
supported mass 
(neglecting the 
weight of the rope). 
This is an example 
of Newton’s third 
law. The rope is the 
medium that carries 


the equal and 
opposite forces 
between the two 
objects. The 
tension anywhere 
in the rope between 
the hand and the 
mass is equal. Once 
you have 
determined the 
tension in one 
location, you have 
determined the 
tension at all 
locations along the 
rope. 


Tension in the rope must equal the weight of the supported mass, as we can 
prove using Newton’s second law. If the 5.00-kg mass in the figure is 
stationary, then its acceleration is zero, and thus F,., = 0. The only 
external forces acting on the mass are its weight w and the tension T 
supplied by the rope. Thus, 

Equation: 


Prt = T — w= 0, 
where 7’ and w are the magnitudes of the tension and weight and their signs 
indicate direction, with up being positive here. Thus, just as you would 


expect, the tension equals the weight of the supported mass: 
Equation: 


T= w= mg. 


For a 5.00-kg mass, then (neglecting the mass of the rope) we see that 


Equation: 


T = mg = (5.00 kg)(9.80 m/s”) = 49.0 N. 


If we cut the rope and insert a spring, the spring would extend a length 
corresponding to a force of 49.0 N, providing a direct observation and 
measure of the tension force in the rope. 


Flexible connectors are often used to transmit forces around corners, such 
as in a hospital traction system, a finger joint, or a bicycle brake cable. If 
there is no friction, the tension is transmitted undiminished. Only its 
direction changes, and it is always parallel to the flexible connector. This is 
illustrated in [link] (a) and (b). 


Extensor muscle 
Tendon 


(a) Flexor muscle Tendon 


(a) Tendons in the finger 
carry force T from the 
muscles to other parts of 
the finger, usually changing 
the force’s direction, but 
not its magnitude (the 


tendons are relatively 
friction free). (b) The brake 
cable on a bicycle carries 
the tension T from the 
handlebars to the brake 
mechanism. Again, the 
direction but not the 
magnitude of T is changed. 


Example: 

What Is the Tension in a Tightrope? 

Calculate the tension in the wire supporting the 70.0-kg tightrope walker 
shown in [link]. 


of 


The weight of a tightrope walker causes a 
wire to sag by 5.0 degrees. The system of 
interest here is the point in the wire at 
which the tightrope walker is standing. 


Strategy 

As you can see in the figure, the wire is not perfectly horizontal (it cannot 
be!), but is bent under the person’s weight. Thus, the tension on either side 
of the person has an upward component that can support his weight. As 


usual, forces are vectors represented pictorially by arrows having the same 
directions as the forces and lengths proportional to their magnitudes. The 
system is the tightrope walker, and the only external forces acting on him 
are his weight w and the two tensions Ty, (left tension) and TR (right 
tension), as illustrated. It is reasonable to neglect the weight of the wire 
itself. The net external force is zero since the system is stationary. A little 
trigonometry can now be used to find the tensions. One conclusion is 
possible at the outset—we can see from part (b) of the figure that the 
magnitudes of the tensions 7}, and TR must be equal. This is because there 
is no horizontal acceleration in the rope, and the only forces acting to the 
left and right are Ty, and Tr. Thus, the magnitude of those forces must be 
equal so that they cancel each other out. 

Whenever we have two-dimensional vector problems in which no two 
vectors are parallel, the easiest method of solution is to pick a convenient 
coordinate system and project the vectors onto its axes. In this case the best 
coordinate system has one axis horizontal and the other vertical. We call 
the horizontal the x-axis and the vertical the y-axis. 

Solution 


First, we need to resolve the tension vectors into their horizontal and 
vertical components. It helps to draw a new free-body diagram showing all 
of the horizontal and vertical components of each force acting on the 
system. 


Free-body diagram 


When the vectors are projected onto vertical and 
horizontal axes, their components along those axes 
must add to zero, since the tightrope walker is 
stationary. The small angle results in T’ being 
much greater than w. 


Consider the horizontal components of the forces (denoted with a subscript 
ce) 
Equation: 


eee a ote a TRe- 


The net external horizontal force Petz = 0, since the person is stationary. 
Thus, 
Equation: 


Pete = ig oo TRe 
Tie = Te 


Now, observe [link]. You can use trigonometry to determine the magnitude 
of Ty, and TR. Notice that: 


Equation: 

cos (5.0°) = ais 

The = Ty, cos (5.0°) 

oO = T Z. 

cos (5.0°) = a 

Tae = Trp cos (5.0°). 
Equating Tj, and TR;: 
Equation: 

Ty, cos (5.0°) = Tr cos (5.0°). 
Thus, 
Equation: 


T, =Ta =T, 


as predicted. Now, considering the vertical components (denoted by a 
subscript y), we can solve for T’. Again, since the person is stationary, 
Newton’s second law implies that net fF’, = 0. Thus, as illustrated in the 
free-body diagram in [link], 

Equation: 


ae = iN SIE TRy —w= 0. 


Observing [link], we can use trigonometry to determine the relationship 
between 71, Z’R,, and T’. As we determined from the analysis in the 
horizontal direction, 7], = TR = T: 


Equation: 
sin (5.0°) = > 
Tty = Ty sin (5.0°) = T sin (5.0°) 
sin (5.0°) = at 


Try = Tr sin (5.0°) = T sin (5.0). 


Now, we can substitute the values for Ty, and TR,,, into the net force 
equation in the vertical direction: 


Equation: 
Prety = Try + TRy — w= 0 
Eines = Tsin (5.0°) + T sin (5.0°) — w = 0 
2T sin (5.0°) -w = 0 
2 T sin (5.0°) = 
and 
Equation: 
_ Ww _ mg 
2sin (5.0°)  2sin (5.0°) ’ 
so that 
Equation: 


(70.0 kg) (9.80 m/s”) 
2(0.0872) 


and the tension is 
Equation: 


T' = 3900 N. 


Discussion 

Note that the vertical tension in the wire acts as a normal force that 
supports the weight of the tightrope walker. The tension is almost six times 
the 686-N weight of the tightrope walker. Since the wire is nearly 
horizontal, the vertical component of its tension is only a small fraction of 
the tension in the wire. The large horizontal components are in opposite 
directions and cancel, and so most of the tension in the wire is not used to 
support the weight of the tightrope walker. 


If we wish to create a very large tension, all we have to do is exert a force 
perpendicular to a flexible connector, as illustrated in [link]. As we saw in 
the last example, the weight of the tightrope walker acted as a force 
perpendicular to the rope. We saw that the tension in the roped related to the 
weight of the tightrope walker in the following way: 

Equation: 


_ w 
~ Qsin (6) ” 


We can extend this expression to describe the tension 7’ created when a 
perpendicular force (F | ) is exerted at the middle of a flexible connector: 
Equation: 


~ 2Qsin (6) ” 


Note that 6 is the angle between the horizontal and the bent connector. In 
this case, T’ becomes very large as 8 approaches zero. Even the relatively 
small weight of any flexible connector will cause it to sag, since an infinite 
tension would result if it were horizontal (i.e., 9 = 0 and sin 0 = 0). (See 
[link].) 


We can create a very large tension in the chain by pushing on it 
perpendicular to its length, as shown. Suppose we wish to pull 
a car out of the mud when no tow truck is available. Each time 
the car moves forward, the chain is tightened to keep it as 
nearly straight as possible. The tension in the chain is given by 
i oan ; since 0 is small, T is very large. This situation is 


analogous to the tightrope walker shown in [link], except that 
the tensions shown here are those transmitted to the car and the 
tree rather than those acting at the point where F , is applied. 


A | oD 
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Unless an infinite tension is 
exerted, any flexible connector— 
such as the chain at the bottom of 

the picture—will sag under its own 
weight, giving a characteristic 
curve when the weight is evenly 


distributed along the length. 
Suspension bridges—such as the 
Golden Gate Bridge shown in this 
image—are essentially very heavy 
flexible connectors. The weight of 
the bridge is evenly distributed 
along the length of flexible 
connectors, usually cables, which 
take on the characteristic shape. 
(credit: Leaflet, Wikimedia 
Commons) 


Extended Topic: Real Forces and Inertial Frames 


There is another distinction among forces in addition to the types already 
mentioned. Some forces are real, whereas others are not. Real forces are 
those that have some physical origin, such as the gravitational pull. 
Contrastingly, fictitious forces are those that arise simply because an 
observer is in an accelerating frame of reference, such as one that rotates 
(like a merry-go-round) or undergoes linear acceleration (like a car slowing 
down). For example, if a satellite is heading due north above Earth’s 
northern hemisphere, then to an observer on Earth it will appear to 
experience a force to the west that has no physical origin. Of course, what is 
happening here is that Earth is rotating toward the east and moves east 
under the satellite. In Earth’s frame this looks like a westward force on the 
satellite, or it can be interpreted as a violation of Newton’s first law (the law 
of inertia). An inertial frame of reference is one in which all forces are 
real and, equivalently, one in which Newton’s laws have the simple forms 
given in this chapter. 


Earth’s rotation is slow enough that Earth is nearly an inertial frame. You 
ordinarily must perform precise experiments to observe fictitious forces and 
the slight departures from Newton’s laws, such as the effect just described. 
On the large scale, such as for the rotation of weather systems and ocean 
currents, the effects can be easily observed. 


The crucial factor in determining whether a frame of reference is inertial is 
whether it accelerates or rotates relative to a known inertial frame. Unless 
stated otherwise, all phenomena discussed in this text are considered in 
inertial frames. 


All the forces discussed in this section are real forces, but there are a 
number of other real forces, such as lift and thrust, that are not discussed in 
this section. They are more specialized, and it is not necessary to discuss 
every type of force. It is natural, however, to ask where the basic simplicity 
we seek to find in physics is in the long list of forces. Are some more basic 
than others? Are some different manifestations of the same underlying 
force? The answer to both questions is yes, as will be seen in the next 
(extended) section and in the treatment of modern physics later in the text. 


Note: 

PhET Explorations: Forces in 1 Dimension 

Explore the forces at work when you try to push a filing cabinet. Create an 
applied force and see the resulting friction force and total force acting on 
the cabinet. Charts show the forces, position, velocity, and acceleration vs. 
time. View a free-body diagram of all the forces (including gravitational 
and normal forces). 


Forces in 
1 
Dimensio 
n 


Section Summary 


e¢ When objects rest on a surface, the surface applies a force to the object 
that supports the weight of the object. This supporting force acts 


perpendicular to and away from the surface. It is called a normal force, 
N. 

e When objects rest on a non-accelerating horizontal surface, the 
magnitude of the normal force is equal to the weight of the object: 
Equation: 


N = mg. 


e When objects rest on an inclined plane that makes an angle @ with the 
horizontal surface, the weight of the object can be resolved into 
components that act perpendicular (w , ) and parallel (w))) to the 


surface of the plane. These components can be calculated using: 
Equation: 


w| = w sin (6) = mg sin (6) 
Equation: 
w, = wcos (0) = mg cos (6). 


e The pulling force that acts along a stretched flexible connector, such as 
a rope or cable, is called tension, 'T. When a rope supports the weight 
of an object that is at rest, the tension in the rope is equal to the weight 
of the object: 

Equation: 


me; 
e In any inertial frame of reference (one that is not accelerated or 


rotated), Newton’s laws have the simple forms given in this chapter 
and all forces are real forces having a physical origin. 


Conceptual Questions 


Exercise: 


Problem: 


If a leg is suspended by a traction setup as shown in [link], what is the 
tension in the rope? 


A leg is suspended by 
a traction system in 
which wires are used 
to transmit forces. 
Frictionless pulleys 
change the direction of 
the force T without 
changing its 
magnitude. 


Exercise: 


Problem: 


In a traction setup for a broken bone, with pulleys and rope available, 
how might we be able to increase the force along the tibia using the 
same weight? (See [link].) (Note that the tibia is the shin bone shown 
in this image.) 


Problem Exercises 


Exercise: 


Problem: 


Two teams of nine members each engage in a tug of war. Each of the 
first team’s members has an average mass of 68 kg and exerts an 
average force of 1350 N horizontally. Each of the second team’s 
members has an average mass of 73 kg and exerts an average force of 
1365 N horizontally. (a) What is magnitude of the acceleration of the 
two teams? (b) What is the tension in the section of rope between the 
teams? 


Solution: 


a. 0.11 m/s” 
b. 1.2 x 104N 


Exercise: 


Problem: 


What force does a trampoline have to apply to a 45.0-kg gymnast to 
accelerate her straight up at 7.50 m/ s”? Note that the answer is 
independent of the velocity of the gymnast—she can be moving either 
up or down, or be stationary. 


Exercise: 


Problem: 


(a) Calculate the tension in a vertical strand of spider web if a spider of 
mass 8.00 x 10~° kg hangs motionless on it. (b) Calculate the tension 
in a horizontal strand of spider web if the same spider sits motionless 
in the middle of it much like the tightrope walker in [link]. The strand 
sags at an angle of 12° below the horizontal. Compare this with the 
tension in the vertical strand (find their ratio). 


Solution: 
(a) 7.84 x 10°-4N 
(b) 1.89 x 10° N.. This is 2.41 times the tension in the vertical 
strand. 
Exercise: 
Problem: 


Suppose a 60.0-kg gymnast climbs a rope. (a) What is the tension in 
the rope if he climbs at a constant speed? (b) What is the tension in the 


rope if he accelerates upward at a rate of 1.50 m/ $7? 
Exercise: 


Problem: 


Show that, as stated in the text, a force F , exerted on a flexible 
medium at its center and perpendicular to its length (such as on the 
tightrope wire in [link]) gives rise to a tension of magnitude 


a F 
T=35 sin 0) 


Solution: 


Newton’s second law applied in vertical direction gives 
Equation: 


Fy, = F —2T sin? =0 
Equation: 
F = 2T sin 6 


Equation: 


Exercise: 


Problem: 


Consider the baby being weighed in [link]. (a) What is the mass of the 
child and basket if a scale reading of 55 N is observed? (b) What is the 
tension 7, in the cord attaching the baby to the scale? (c) What is the 
tension 7° in the cord attaching the scale to the ceiling, if the scale has 
a mass of 0.500 kg? (d) Draw a sketch of the situation indicating the 
system of interest used to solve each part. The masses of the cords are 
negligible. 


A baby is weighed 
using a spring 
scale. 


Glossary 


inertial frame of reference 
a coordinate system that is not accelerating; all forces acting in an 
inertial frame of reference are real forces, as opposed to fictitious 
forces that are observed due to an accelerating frame of reference 


normal force 
the force that a surface applies to an object to support the weight of the 
object; acts perpendicular to the surface on which the object rests 


tension 
the pulling force that acts along a medium, especially a stretched 
flexible connector, such as a rope or cable; when a rope supports the 
weight of an object, the force on the object due to the rope is called a 
tension force 


Problem-Solving Strategies 
e Understand and apply a problem-solving procedure to solve problems using Newton's laws of motion. 


Success in problem solving is obviously necessary to understand and apply physical principles, not to mention the 
more immediate need of passing exams. The basics of problem solving, presented earlier in this text, are followed 
here, but specific strategies useful in applying Newton’s laws of motion are emphasized. These techniques also 
reinforce concepts that are useful in many other areas of physics. Many problem-solving strategies are stated 
outright in the worked examples, and so the following techniques should reinforce skills you have already begun to 
develop. 


Problem-Solving Strategy for Newton’s Laws of Motion 


Step 1. As usual, it is first necessary to identify the physical principles involved. Once it is determined that 
Newton’s laws of motion are involved (if the problem involves forces), it is particularly important to draw a careful 
sketch of the situation. Such a sketch is shown in [link](a). Then, as in [link](b), use arrows to represent all forces, 
label them carefully, and make their lengths and directions correspond to the forces they represent (whenever 
sufficient information exists). 


T 
This force is not a force on 
the system of interest since it 
is exerted on the outside world. 
= It must be omitted from the 
free-body diagram 
Fr 
f 
System of interest 
w 
5 These f t be | 
Free-body ese forces must be equal 
di and opposite since the 
iagram > 
net external force is zero. 
w Thus T = -w 
(a) (b) (c) (d) 
Sketch Identify forces Define system of interest Add forces 


(a) A sketch of Tarzan hanging from a vine. (b) Arrows are 
used to represent all forces. T is the tension in the vine above 
Tarzan, F'y is the force he exerts on the vine, and w is his 
weight. All other forces, such as the nudge of a breeze, are 
assumed negligible. (c) Suppose we are given the ape man’s 
mass and asked to find the tension in the vine. We then define 
the system of interest as shown and draw a free-body diagram. 
F 7 is no longer shown, because it is not a force acting on the 
system of interest; rather, Fy acts on the outside world. (d) 
Showing only the arrows, the head-to-tail method of addition is 
used. It is apparent that T = —w, if Tarzan is stationary. 


Step 2. Identify what needs to be determined and what is known or can be inferred from the problem as stated. 
That is, make a list of knowns and unknowns. Then carefully determine the system of interest. This decision is a 
crucial step, since Newton’s second law involves only external forces. Once the system of interest has been 
identified, it becomes possible to determine which forces are external and which are internal, a necessary step to 


employ Newton’s second law. (See [link](c).) Newton’s third law may be used to identify whether forces are 
exerted between components of a system (internal) or between the system and something outside (external). As 
illustrated earlier in this chapter, the system of interest depends on what question we need to answer. This choice 
becomes easier with practice, eventually developing into an almost unconscious process. Skill in clearly defining 
systems will be beneficial in later chapters as well. 


A diagram showing the system of interest and all of the external forces is called a free-body diagram. Only forces 
are shown on free-body diagrams, not acceleration or velocity. We have drawn several of these in worked 
examples. [link](c) shows a free-body diagram for the system of interest. Note that no internal forces are shown in 
a free-body diagram. 


Step 3. Once a free-body diagram is drawn, Newton’s second law can be applied to solve the problem. This is done 
in [link](d) for a particular situation. In general, once external forces are clearly identified in free-body diagrams, it 
should be a straightforward task to put them into equation form and solve for the unknown, as done in all previous 
examples. If the problem is one-dimensional—that is, if all forces are parallel—then they add like scalars. If the 
problem is two-dimensional, then it must be broken down into a pair of one-dimensional problems. This is done by 
projecting the force vectors onto a set of axes chosen for convenience. As seen in previous examples, the choice of 
axes can simplify the problem. For example, when an incline is involved, a set of axes with one axis parallel to the 
incline and one perpendicular to it is most convenient. It is almost always convenient to make one axis parallel to 
the direction of motion, if this is known. 


Note: 

Applying Newton’s Second Law 

Before you write net force equations, it is critical to determine whether the system is accelerating in a particular 
direction. If the acceleration is zero in a particular direction, then the net force is zero in that direction. Similarly, 
if the acceleration is nonzero in a particular direction, then the net force is described by the equation: Fy, = ma. 
For example, if the system is accelerating in the horizontal direction, but it is not accelerating in the vertical 
direction, then you will have the following conclusions: 

Equation: 


Pret x — Ma, 
Equation: 
Biren = Oe 


You will need this information in order to determine unknown forces acting in a system. 


Step 4. As always, check the solution to see whether it is reasonable. In some cases, this is obvious. For example, 
it is reasonable to find that friction causes an object to slide down an incline more slowly than when no friction 
exists. In practice, intuition develops gradually through problem solving, and with experience it becomes 
progressively easier to judge whether an answer is reasonable. Another way to check your solution is to check the 
units. If you are solving for force and end up with units of m/s, then you have made a mistake. 


Section Summary 
¢ To solve problems involving Newton’s laws of motion, follow the procedure described: 


1. Draw a sketch of the problem. 

2. Identify known and unknown quantities, and identify the system of interest. Draw a free-body diagram, 
which is a sketch showing all of the forces acting on an object. The object is represented by a dot, and 
the forces are represented by vectors extending in different directions from the dot. If vectors act in 


directions that are not horizontal or vertical, resolve the vectors into horizontal and vertical components 
and draw them on the free-body diagram. 

3. Write Newton’s second law in the horizontal and vertical directions and add the forces acting on the 
object. If the object does not accelerate in a particular direction (for example, the x-direction) then 
Fret x = 0. If the object does accelerate in that direction, Fret ¢ = ma. 

4. Check your answer. Is the answer reasonable? Are the units correct? 


Problem Exercises 


Exercise: 
Problem: 
A 5.00 x 10°-kg rocket is accelerating straight up. Its engines produce 1.250 x 10” N of thrust, and air 
resistance is 4.50 x 10° N. What is the rocket’s acceleration? Explicitly show how you follow the steps in the 


Problem-Solving Strategy for Newton’s laws of motion. 


Solution: 
T 


{m3 


Using the free-body diagram: 


Fret = T — f — mg = ma, 


so that 
T-f- 1.250x 107 N—4.50x 10° N—(5.00x 105 kg)(9.80 m/s” 2 
q = ime — ( )(9-80m/s) _ 620 m/s’. 
m 5.00x10° ke 
Exercise: 
Problem: 


The wheels of a midsize car exert a force of 2100 N backward on the road to accelerate the car in the forward 
direction. If the force of friction including air resistance is 250 N and the acceleration of the car is 1.80 m/ s’, 
what is the mass of the car plus its occupants? Explicitly show how you follow the steps in the Problem- 
Solving Strategy for Newton’s laws of motion. For this situation, draw a free-body diagram and write the net 
force equation. 

Exercise: 


Problem: 

Calculate the force a 70.0-kg high jumper must exert on the ground to produce an upward acceleration 4.00 
times the acceleration due to gravity. Explicitly show how you follow the steps in the Problem-Solving 
Strategy for Newton’s laws of motion. 


Solution: 


Use Newton’s laws of motion. 


Ww 


Given :a = 4.00g = (4.00)(9.80 m/s”) = 39.2 m/s”; ™ = 70.0 kg, 
Find: F’. 
SO F=4+F-w=maso F=ma+w=ma+mg=m(a-+ g). 
that F = (70.0 kg) [(39.2 m/s”) + (9.80 m/s” 

= 3.43 x 10°N. The force exerted by the 
high-jumper is actually down on the ground, 
but Fis up from the ground and makes him 
jump. 

This result is reasonable, since it is quite possible for a person to exert a force of the magnitude of 10° N. 


Exercise: 


Problem: 


When landing after a spectacular somersault, a 40.0-kg gymnast decelerates by pushing straight down on the 
mat. Calculate the force she must exert if her deceleration is 7.00 times the acceleration due to gravity. 
Explicitly show how you follow the steps in the Problem-Solving Strategy for Newton’s laws of motion. 


Exercise: 


Problem: 


A freight train consists of two 8.00 x 10*-kg engines and 45 cars with average masses of 5.50 x 104 kg . (a) 
What force must each engine exert backward on the track to accelerate the train at a rate of 5.00 x 10°? m/ 8° 
if the force of friction is 7.50 x 10° N, assuming the engines exert identical forces? This is not a large 
frictional force for such a massive system. Rolling friction for trains is small, and consequently trains are very 
energy-efficient transportation systems. (b) What is the force in the coupling between the 37th and 38th cars 
(this is the force each exerts on the other), assuming all cars have the same mass and that friction is evenly 
distributed among all of the cars and engines? 


Solution: 
(a) 4.41 x 10° N 


(b) 1.50 x 10° N 
Exercise: 


Problem: 


Commercial airplanes are sometimes pushed out of the passenger loading area by a tractor. (a) An 1800-kg 
tractor exerts a force of 1.75 x 10+ N backward on the pavement, and the system experiences forces resisting 
motion that total 2400 N. If the acceleration is 0.150 m/ 4 what is the mass of the airplane? (b) Calculate the 
force exerted by the tractor on the airplane, assuming 2200 N of the friction is experienced by the airplane. (c) 
Draw two sketches showing the systems of interest used to solve each part, including the free-body diagrams 
for each. 


Exercise: 


Problem: 


A 1100-kg car pulls a boat on a trailer. (a) What total force resists the motion of the car, boat, and trailer, if the 
car exerts a 1900-N force on the road and produces an acceleration of 0.550 m/ s°? The mass of the boat plus 
trailer is 700 kg. (b) What is the force in the hitch between the car and the trailer if 80% of the resisting forces 
are experienced by the boat and trailer? 


Solution: 
(a) 910 N 


(b) 1.11 x 10° N 
Exercise: 


Problem: 


(a) Find the magnitudes of the forces F; and F 2 that add to give the total force F,., shown in [link]. This 
may be done either graphically or by using trigonometry. (b) Show graphically that the same total force is 
obtained independent of the order of addition of F,; and F3. (c) Find the direction and magnitude of some 
other pair of vectors that add to give F;,;. Draw these to scale on the same drawing used in part (b) or a 
similar picture. 


Fp Free-body diagram 


Exercise: 
Problem: 
Two children pull a third child on a snow saucer sled exerting forces F; and F2 as shown from above in 
[link]. Find the acceleration of the 49.00-kg sled and child system. Note that the direction of the frictional 
force is unspecified; it will be in the opposite direction of the sum of Fy and F». 


Solution: 


a = 0.139 m/s, 6 = 12.4° north of east 


An overhead view of the 
horizontal forces acting on a 


child’s snow saucer sled. 


Exercise: 


Problem: 


Suppose your car was mired deeply in the mud and you wanted to use the method illustrated in [link] to pull it 
out. (a) What force would you have to exert perpendicular to the center of the rope to produce a force of 
12,000 N on the car if the angle is 2.00°? In this part, explicitly show how you follow the steps in the 
Problem-Solving Strategy for Newton’s laws of motion. (b) Real ropes stretch under such forces. What force 
would be exerted on the car if the angle increases to 7.00° and you still apply the force found in part (a) to its 
center? 


Exercise: 
Problem: 
What force is exerted on the tooth in [link] if the tension in the wire is 25.0 N? Note that the force applied to 
the tooth is smaller than the tension in the wire, but this is necessitated by practical considerations of how 


force can be applied in the mouth. Explicitly show how you follow steps in the Problem-Solving Strategy for 
Newton’s laws of motion. 


Solution: 


Use Newton’s laws since we are looking for forces. 
Draw a free-body diagram: 


The T= 25.0N.FindFapp-Using & Fy = 0,so that y- Fipp = 2 T sin® = 2(25.0 N)sin(15°) = 
tension Newton’s applied components 

is laws force of the two 

given gives: is due tensions: 

as to the 


This seems reasonable, since the applied tensions should be greater than the force applied to the tooth. 


Braces are used to 
apply forces to teeth to 
realign them. Shown 
in this figure are the 
tensions applied by the 
wire to the protruding 
tooth. The total force 
applied to the tooth by 
the wire, Fapp, points 
straight toward the 
back of the mouth. 


Exercise: 


Problem: 


[link] shows Superhero and Trusty Sidekick hanging motionless from a rope. Superhero’s mass is 90.0 kg, 
while Trusty Sidekick’s is 55.0 kg, and the mass of the rope is negligible. (a) Draw a free-body diagram of the 
situation showing all forces acting on Superhero, Trusty Sidekick, and the rope. (b) Find the tension in the 
rope above Superhero. (c) Find the tension in the rope between Superhero and Trusty Sidekick. Indicate on 
your free-body diagram the system of interest used to solve each part. 


Superhero 
and Trusty 
Sidekick 
hang 
motionless 
on a rope 
as they try 
to figure 
out what to 
do next. 
Will the 
tension be 
the same 
everywher 
e in the 
rope? 


Exercise: 


Problem: 


A nurse pushes a cart by exerting a force on the handle at a downward angle 35.0° below the horizontal. The 
loaded cart has a mass of 28.0 kg, and the force of friction is 60.0 N. (a) Draw a free-body diagram for the 
system of interest. (b) What force must the nurse exert to move at a constant velocity? 


Exercise: 


Problem: 


Construct Your Own Problem Consider the tension in an elevator cable during the time the elevator starts 
from rest and accelerates its load upward to some cruising velocity. Taking the elevator and its load to be the 
system of interest, draw a free-body diagram. Then calculate the tension in the cable. Among the things to 
consider are the mass of the elevator and its load, the final velocity, and the time taken to reach that velocity. 


Exercise: 


Problem: 


Construct Your Own Problem Consider two people pushing a toboggan with four children on it up a snow- 
covered slope. Construct a problem in which you calculate the acceleration of the toboggan and its load. 
Include a free-body diagram of the appropriate system of interest as the basis for your analysis. Show vector 
forces and their components and explain the choice of coordinates. Among the things to be considered are the 
forces exerted by those pushing, the angle of the slope, and the masses of the toboggan and children. 


Exercise: 


Problem: 


Unreasonable Results (a) Repeat [link], but assume an acceleration of 1.20 m/ s” is produced. (b) What is 
unreasonable about the result? (c) Which premise is unreasonable, and why is it unreasonable? 


Exercise: 


Problem: 


Unreasonable Results (a) What is the initial acceleration of a rocket that has a mass of 1.50 x 10° kg at 
takeoff, the engines of which produce a thrust of 2.00 x 10° N? Do not neglect gravity. (b) What is 
unreasonable about the result? (This result has been unintentionally achieved by several real rockets.) (c) 
Which premise is unreasonable, or which premises are inconsistent? (You may find it useful to compare this 
problem to the rocket problem earlier in this section.) 


Further Applications of Newton’s Laws of Motion 


e Apply problem-solving techniques to solve for quantities in more complex 
systems of forces. 

e Integrate concepts from kinematics to solve problems using Newton's laws of 
motion. 


There are many interesting applications of Newton’s laws of motion, a few more of 
which are presented in this section. These serve also to illustrate some further subtleties 
of physics and to help build problem-solving skills. 


Example: 

Drag Force on a Barge 

Suppose two tugboats push on a barge at different angles, as shown in [link]. The first 
tugboat exerts a force of 2.7 x 10° N in the x-direction, and the second tugboat exerts 
a force of 3.6 x 10° N in the y-direction. 


f 
Lit -- 
= 

ty 


es 
= 
F, = 2.7 x 10° N 


F, = 3.6 x 10°N 


(a) (b) 


(a) A view from above of two tugboats pushing on a barge. (b) The free- 
body diagram for the ship contains only forces acting in the plane of the 
water. It omits the two vertical forces—the weight of the barge and the 
buoyant force of the water supporting it cancel and are not shown. Since the 
applied forces are perpendicular, the x- and y-axes are in the same direction 
as F,, and F,. The problem quickly becomes a one-dimensional problem 
along the direction of F,p, since friction is in the direction opposite to 


app: 


If the mass of the barge is 5.0 x 10° kg and its acceleration is observed to be 
oe 0me an i, s” in the direction shown, what is the drag force of the water on the 


barge resisting the motion? (Note: drag force is a frictional force exerted by fluids, 
such as air or water. The drag force opposes the motion of the object.) 

Strategy 

The directions and magnitudes of acceleration and the applied forces are given in 
[link](a). We will define the total force of the tugboats on the barge as Fp, so that: 
Equation: 


Papp=F, + F, 


Since the barge is flat bottomed, the drag of the water Fp will be in the direction 
opposite to F',,), as shown in the free-body diagram in [link](b). The system of 
interest here is the barge, since the forces on it are given as well as its acceleration. 
Our strategy is to find the magnitude and direction of the net applied force Fpp, and 
then apply Newton’s second law to solve for the drag force Fp. 

Solution 

Since F, and F, are perpendicular, the magnitude and direction of F app are easily 
found. First, the resultant magnitude is given by the Pythagorean theorem: 
Equation: 


Boop BEES 


Papp = V2.7 x 10° N)?2 + (3.6 x 10°N)? = 4.5 x 10°N. 
The angle is given by 
Equation: 


= ea (+) 


—1{ 3.6x10°N \ _ rao 
eh (Sean) = 53°, 


S 
| 


which we know, because of Newton’s first law, is the same direction as the 
acceleration. Fp is in the opposite direction of F ,,), since it acts to slow down the 
acceleration. Therefore, the net external force is in the same direction as F'pp, but its 
magnitude is slightly less than F',)). The problem is now one-dimensional. From 
[link](b), we can see that 

Equation: 


Fret = Fapp — Fp. 


But Newton’s second law states that 
Equation: 


yee = Mma. 


Thus, 
Equation: 


Papp — Fp = ma. 


This can be solved for the magnitude of the drag force of the water F’p in terms of 
known quantities: 
Equation: 


Fp = Fapp — ma. 


Substituting known values gives 
Equation: 


Fp = (4.5 x 10° N) — (5.0 x 10° kg)(7.5 x 10° m/s”) = 7.5 x 10N. 


The direction of Fp has already been determined to be in the direction opposite to 

F app, or at an angle of 53° south of west. 

Discussion 

The numbers used in this example are reasonable for a moderately large barge. It is 
certainly difficult to obtain larger accelerations with tugboats, and small speeds are 
desirable to avoid running the barge into the docks. Drag is relatively small for a well- 
designed hull at low speeds, consistent with the answer to this example, where F’p is 
less than 1/600th of the weight of the ship. 


In the earlier example of a tightrope walker we noted that the tensions in wires 
supporting a mass were equal only because the angles on either side were equal. 
Consider the following example, where the angles are not equal; slightly more 
trigonometry is involved. 


Example: 

Different Tensions at Different Angles 

Consider the traffic light (mass 15.0 kg) suspended from two wires as shown in [link]. 
Find the tension in each wire, neglecting the masses of the wires. 


Just some of the forces are 


Sketch shown here. T 
2 


Only forces on the system 
are shown. 


y T. 
T; Ne 
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(c) (d) 
Free-body diagram 
a The net vertical 
i force is zero, so 
1 Ty Ty + Toy =-wW 
wane bce Tp, 
Tx — The net horizontal 
force is zero, so 
w Tix = -Thx 


(e) 


A traffic light is suspended from two wires. (b) Some 
of the forces involved. (c) Only forces acting on the 
system are shown here. The free-body diagram for the 
traffic light is also shown. (d) The forces projected 
onto vertical (y) and horizontal (x) axes. The 
horizontal components of the tensions must cancel, 
and the sum of the vertical components of the 
tensions must equal the weight of the traffic light. (e) 
The free-body diagram shows the vertical and 
horizontal forces acting on the traffic light. 


Strategy 


The system of interest is the traffic light, and its free-body diagram is shown in [link] 
(c). The three forces involved are not parallel, and so they must be projected onto a 
coordinate system. The most convenient coordinate system has one axis vertical and 
one horizontal, and the vector projections on it are shown in part (d) of the figure. 
There are two unknowns in this problem (7, and 7»), so two equations are needed to 
find them. These two equations come from applying Newton’s second law along the 
vertical and horizontal axes, noting that the net external force is zero along each axis 
because acceleration is zero. 

Solution 

First consider the horizontal or x-axis: 

Equation: 


elastics = Tox = Ti; = 0. 


Thus, as you might expect, 


Equation: 
Tie = Toe. 
This gives us the following relationship between T; and T%: 
Equation: 
T; cos (30°) = T> cos (45°). 
Thus, 
Equation: 


T, = (1.225)T}. 


Note that 7 and 7% are not equal in this case, because the angles on either side are not 
equal. It is reasonable that T> ends up being greater than T;, because it is exerted more 
vertically than T}. 

Now consider the force components along the vertical or y-axis: 

Equation: 


Pret y = Ty + Ta —w= 0. 


This implies 
Equation: 


Diy + Toy = Ww. 


Substituting the expressions for the vertical components gives 
Equation: 


T, sin (30°) + T> sin (45°) = w. 


There are two unknowns in this equation, but substituting the expression for T in 
terms of T; reduces this to one equation with one unknown: 
Equation: 


T, (0.500) + (1.2257)(0.707) = w = mg, 


which yields 
Equation: 


(1.366)T, = (15.0 kg)(9.80 m/s”). 


Solving this last equation gives the magnitude of 7), to be 
Equation: 


T; = 108N. 


Finally, the magnitude of T> is determined using the relationship between them, 72 = 
1.225 T;, found above. Thus we obtain 
Equation: 


ti — 132, N. 


Discussion 

Both tensions would be larger if both wires were more horizontal, and they will be 
equal if and only if the angles on either side are the same (as they were in the earlier 
example of a tightrope walker). 


The bathroom scale is an excellent example of a normal force acting on a body. It 
provides a quantitative reading of how much it must push upward to support the weight 
of an object. But can you predict what you would see on the dial of a bathroom scale if 
you stood on it during an elevator ride? Will you see a value greater than your weight 
when the elevator starts up? What about when the elevator moves upward at a constant 
speed: will the scale still read more than your weight at rest? Consider the following 
example. 


Example: 

What Does the Bathroom Scale Read in an Elevator? 

[link] shows a 75.0-kg man (weight of about 165 lb) standing on a bathroom scale in 
an elevator. Calculate the scale reading: (a) if the elevator accelerates upward at a rate 


of 1.20 m/ 3”, and (b) if the elevator moves upward at a constant speed of 1 m/s. 


System of Interest 


Free-body diagram 
F, 


— iS 
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b) 


(b) 


(a) The various forces acting when a person stands on a 
bathroom scale in an elevator. The arrows are approximately 
correct for when the elevator is accelerating upward—broken 

arrows represent forces too large to be drawn to scale. T is the 
tension in the supporting cable, w is the weight of the person, 
w; is the weight of the scale, w, is the weight of the elevator, 
F, is the force of the scale on the person, Fy is the force of the 
person on the scale, F; is the force of the scale on the floor of 
the elevator, and N is the force of the floor upward on the 
scale. (b) The free-body diagram shows only the external 
forces acting on the designated system of interest—the person. 


Strategy 

If the scale is accurate, its reading will equal F,,, the magnitude of the force the person 
exerts downward on it. [link](a) shows the numerous forces acting on the elevator, 
scale, and person. It makes this one-dimensional problem look much more formidable 
than if the person is chosen to be the system of interest and a free-body diagram is 
drawn as in [link](b). Analysis of the free-body diagram using Newton’s laws can 
produce answers to both parts (a) and (b) of this example, as well as some other 
questions that might arise. The only forces acting on the person are his weight w and 
the upward force of the scale F. According to Newton’s third law F,, and F, are 


equal in magnitude and opposite in direction, so that we need to find F, in order to 
find what the scale reads. We can do this, as usual, by applying Newton’s second law, 
Equation: 


Fone = Mma. 


From the free-body diagram we see that Fn, = Fs — w, so that 
Equation: 


Fi, — w= ma. 


Solving for F; gives an equation with only one unknown: 
Equation: 


Fo =ma+ uw, 


or, because w = mg, simply 
Equation: 


F, = ma-+ mg. 


No assumptions were made about the acceleration, and so this solution should be valid 
for a variety of accelerations in addition to the ones in this exercise. 

Solution for (a) 

In this part of the problem, a = 1.20 m/s’, so that 

Equation: 


F, = (75.0 kg)(1.20 m/s”) + (75.0 kg)(9.80 m/s”), 


yielding 
Equation: 


F, = 825N. 


Discussion for (a) 

This is about 185 lb. What would the scale have read if he were stationary? Since his 
acceleration would be zero, the force of the scale would be equal to his weight: 
Equation: 


Eee = ine = Ut 
1 

(75.0 kg)(9.80 m/s”) 
735 N. 


eles 
| 


So, the scale reading in the elevator is greater than his 735-N (165 |b) weight. This 
means that the scale is pushing up on the person with a force greater than his weight, 
as it must in order to accelerate him upward. Clearly, the greater the acceleration of the 
elevator, the greater the scale reading, consistent with what you feel in rapidly 
accelerating versus slowly accelerating elevators. 

Solution for (b) 

Now, what happens when the elevator reaches a constant upward velocity? Will the 
scale still read more than his weight? For any constant velocity—up, down, or 
stationary—acceleration is zero because a = x, and Av = 0. 

Thus, 

Equation: 


Fo=ma+mg=0+mg. 


Now 
Equation: 


F, = (75.0 kg)(9.80 m/s’), 


which gives 
Equation: 


F, = 735 N. 


Discussion for (b) 

The scale reading is 735 N, which equals the person’s weight. This will be the case 
whenever the elevator has a constant velocity—moving up, moving down, or 
stationary. 


The solution to the previous example also applies to an elevator accelerating 
downward, as mentioned. When an elevator accelerates downward, a is negative, and 
the scale reading is less than the weight of the person, until a constant downward 
velocity is reached, at which time the scale reading again becomes equal to the person’s 
weight. If the elevator is in free-fall and accelerating downward at g, then the scale 
reading will be zero and the person will appear to be weightless. 


Integrating Concepts: Newton’s Laws of Motion and Kinematics 


Physics is most interesting and most powerful when applied to general situations that 
involve more than a narrow set of physical principles. Newton’s laws of motion can 
also be integrated with other concepts that have been discussed previously in this text to 


solve problems of motion. For example, forces produce accelerations, a topic of 
kinematics, and hence the relevance of earlier chapters. When approaching problems 
that involve various types of forces, acceleration, velocity, and/or position, use the 
following steps to approach the problem: 


Problem-Solving Strategy 


Step 1. Identify which physical principles are involved. Listing the givens and the 
quantities to be calculated will allow you to identify the principles involved. 

Step 2. Solve the problem using strategies outlined in the text. If these are available for 
the specific topic, you should refer to them. You should also refer to the sections of the 
text that deal with a particular topic. The following worked example illustrates how 
these strategies are applied to an integrated concept problem. 


Example: 

What Force Must a Soccer Player Exert to Reach Top Speed? 

A soccer player starts from rest and accelerates forward, reaching a velocity of 8.00 
m/s in 2.50 s. (a) What was his average acceleration? (b) What average force did he 
exert backward on the ground to achieve this acceleration? The player’s mass is 70.0 
kg, and air resistance is negligible. 

Strategy 
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The following solutions to each part of the example illustrate how the specific 


problem-solving strategies are applied. These involve identifying knowns and 
unknowns, checking to see if the answer is reasonable, and so forth. 


Solution for (a) 

We are given the initial and final velocities (zero and 8.00 m/s forward); thus, the 
change in velocity is Av = 8.00 m/s. We are given the elapsed time, and so 

At = 2.50 s. The unknown is acceleration, which can be found from its definition: 
Equation: 


Av 
= —, 
At 
Substituting the known values yields 
Equation: 
8.00 m/s 
e 2.50 
= Sylinyee 


Discussion for (a) 

This is an attainable acceleration for an athlete in good condition. 

Solution for (b) 

Here we are asked to find the average force the player exerts backward to achieve this 
forward acceleration. Neglecting air resistance, this would be equal in magnitude to the 
net external force on the player, since this force causes his acceleration. Since we now 
know the player’s acceleration and are given his mass, we can use Newton’s second 
law to find the force exerted. That is, 

Equation: 


Pc = Mma. 


Substituting the known values of m and a gives 
Equation: 


1g net 


(70.0 kg)(3.20 m/s”) 
= 224N. 


Discussion for (b) 

This is about 50 pounds, a reasonable average force. 

This worked example illustrates how to apply problem-solving strategies to situations 
that include topics from different chapters. The first step is to identify the physical 
principles involved in the problem. The second step is to solve for the unknown using 
familiar problem-solving strategies. These strategies are found throughout the text, and 
many worked examples show how to use them for single topics. You will find these 


techniques for integrated concept problems useful in applications of physics outside of 
a physics course, such as in your profession, in other science disciplines, and in 
everyday life. The following problems will build your skills in the broad application of 
physical principles. 


Summary 


e Newton’s laws of motion can be applied in numerous situations to solve problems 
of motion. 

¢ Some problems will contain multiple force vectors acting in different directions on 
an object. Be sure to draw diagrams, resolve all force vectors into horizontal and 
vertical components, and draw a free-body diagram. Always analyze the direction 
in which an object accelerates so that you can determine whether Fyre = ma or 
F, net = 0. 

e The normal force on an object is not always equal in magnitude to the weight of 
the object. If an object is accelerating, the normal force will be less than or greater 
than the weight of the object. Also, if the object is on an inclined plane, the normal 
force will always be less than the full weight of the object. 

¢ Some problems will contain various physical quantities, such as forces, 
acceleration, velocity, or position. You can apply concepts from kinematics and 
dynamics in order to solve these problems of motion. 


Conceptual Questions 


Exercise: 


Problem: 


To simulate the apparent weightlessness of space orbit, astronauts are trained in 
the hold of a cargo aircraft that is accelerating downward at g. Why will they 
appear to be weightless, as measured by standing on a bathroom scale, in this 
accelerated frame of reference? Is there any difference between their apparent 
weightlessness in orbit and in the aircraft? 


Exercise: 
Problem: 
A cartoon shows the toupee coming off the head of an elevator passenger when the 


elevator rapidly stops during an upward ride. Can this really happen without the 
person being tied to the floor of the elevator? Explain your answer. 


Problem Exercises 


Exercise: 
Problem: 
A flea jumps by exerting a force of 1.20 x 10°°N straight down on the ground. A 
breeze blowing on the flea parallel to the ground exerts a force of 0.500 x 10-°N 


on the flea. Find the direction and magnitude of the acceleration of the flea if its 
mass is 6.00 x 107’ kg. Do not neglect the gravitational force. 


Solution: 


10.2 m/s”, 4.67° from vertical 
Exercise: 
Problem: 
Two muscles in the back of the leg pull upward on the Achilles tendon, as shown 
in [link]. (These muscles are called the medial and lateral heads of the 


gastrocnemius muscle.) Find the magnitude and direction of the total force on the 
Achilles tendon. What type of movement could be caused by this force? 


F,(200 N) F,(200 N) 


Achilles 
tendon 


Exercise: 


Problem: 


A 76.0-kg person is being pulled away from a burning building as shown in [link]. 
Calculate the tension in the two ropes if the person is momentarily motionless. 
Include a free-body diagram in your solution. 


Solution: 


The force T) needed to 
hold steady the person 
being rescued from the 
fire is less than her 
weight and less than the 
force T in the other 
rope, since the more 


vertical rope supports a 
greater part of her weight 
(a vertical force). 


Exercise: 
Problem: 
Integrated Concepts A 35.0-kg dolphin decelerates from 12.0 to 7.50 m/s in 2.30 
s to join another dolphin in play. What average force was exerted to slow him if he 


was moving horizontally? (The gravitational force is balanced by the buoyant 
force of the water.) 


Exercise: 
Problem: 
Integrated Concepts When starting a foot race, a 70.0-kg sprinter exerts an 


average force of 650 N backward on the ground for 0.800 s. (a) What is his final 
speed? (b) How far does he travel? 


Solution: 
(a) 7.43 m/s 


(b) 2.97 m 
Exercise: 


Problem: 


Integrated Concepts A large rocket has a mass of 2.00 x 10° kg at takeoff, and 
its engines produce a thrust of 3.50 x 10’ N. (a) Find its initial acceleration if it 
takes off vertically. (b) How long does it take to reach a velocity of 120 km/h 
straight up, assuming constant mass and thrust? (c) In reality, the mass of a rocket 
decreases significantly as its fuel is consumed. Describe qualitatively how this 
affects the acceleration and time for this motion. 


Exercise: 


Problem: 


Integrated Concepts A basketball player jumps straight up for a ball. To do this, 
he lowers his body 0.300 m and then accelerates through this distance by 
forcefully straightening his legs. This player leaves the floor with a vertical 
velocity sufficient to carry him 0.900 m above the floor. (a) Calculate his velocity 
when he leaves the floor. (b) Calculate his acceleration while he is straightening 
his legs. He goes from zero to the velocity found in part (a) in a distance of 0.300 
m. (c) Calculate the force he exerts on the floor to do this, given that his mass is 
110 kg. 


Solution: 
(a) 4.20 m/s 
(b) 29.4 m/s” 


(c) 4.31 x 10° N 

Exercise: 
Problem: 
Integrated Concepts A 2.50-kg fireworks shell is fired straight up from a mortar 
and reaches a height of 110 m. (a) Neglecting air resistance (a poor assumption, 
but we will make it for this example), calculate the shell’s velocity when it leaves 
the mortar. (b) The mortar itself is a tube 0.450 m long. Calculate the average 
acceleration of the shell in the tube as it goes from zero to the velocity found in 


(a). (c) What is the average force on the shell in the mortar? Express your answer 
in newtons and as a ratio to the weight of the shell. 


Exercise: 


Problem: 


Integrated Concepts Repeat [link] for a shell fired at an angle 10.0° from the 
vertical. 


Solution: 
(a) 47.1 m/s 
(b) 2.47 x 10° m/s?” 


(c) 6.18 x 10° N . The average force is 252 times the shell’s weight. 


Exercise: 


Problem: 


Integrated Concepts An elevator filled with passengers has a mass of 1700 kg. 
(a) The elevator accelerates upward from rest at a rate of 1.20 m/ s” for 1.50. 
Calculate the tension in the cable supporting the elevator. (b) The elevator 
continues upward at constant velocity for 8.50 s. What is the tension in the cable 
during this time? (c) The elevator decelerates at a rate of 0.600 m/ s” for 3.00 s. 
What is the tension in the cable during deceleration? (d) How high has the elevator 
moved above its original starting point, and what is its final velocity? 


Exercise: 


Problem: 


Unreasonable Results (a) What is the final velocity of a car originally traveling at 
50.0 km/h that decelerates at a rate of 0.400 m/ s* for 50.0 s? (b) What is 
unreasonable about the result? (c) Which premise is unreasonable, or which 
premises are inconsistent? 


Exercise: 


Problem: 


Unreasonable Results A 75.0-kg man stands on a bathroom scale in an elevator 
that accelerates from rest to 30.0 m/s in 2.00 s. (a) Calculate the scale reading in 
newtons and compare it with his weight. (The scale exerts an upward force on him 
equal to its reading.) (b) What is unreasonable about the result? (c) Which premise 
is unreasonable, or which premises are inconsistent? 


Extended Topic: The Four Basic Forces—An Introduction 
e Understand the four basic forces that underlie the processes in nature. 


One of the most remarkable simplifications in physics is that only four distinct forces account 
for all known phenomena. In fact, nearly all of the forces we experience directly are due to only 
one basic force, called the electromagnetic force. (The gravitational force is the only force we 
experience directly that is not electromagnetic.) This is a tremendous simplification of the 
myriad of apparently different forces we can list, only a few of which were discussed in the 
previous section. As we will see, the basic forces are all thought to act through the exchange of 
microscopic carrier particles, and the characteristics of the basic forces are determined by the 
types of particles exchanged. Action at a distance, such as the gravitational force of Earth on the 
Moon, is explained by the existence of a force field rather than by “physical contact.” 


The four basic forces are the gravitational force, the electromagnetic force, the weak nuclear 
force, and the strong nuclear force. Their properties are summarized in [link]. Since the weak 
and strong nuclear forces act over an extremely short range, the size of a nucleus or less, we do 
not experience them directly, although they are crucial to the very structure of matter. These 
forces determine which nuclei are stable and which decay, and they are the basis of the release 
of energy in certain nuclear reactions. Nuclear forces determine not only the stability of nuclei, 
but also the relative abundance of elements in nature. The properties of the nucleus of an atom 
determine the number of electrons it has and, thus, indirectly determine the chemistry of the 
atom. More will be said of all of these topics in later chapters. 


Note: 

Concept Connections: The Four Basic Forces 

The four basic forces will be encountered in more detail as you progress through the text. The 
gravitational force is defined in Uniform Circular Motion and Gravitation, electric force in 
Electric Charge and Electric Field, magnetic force in Magnetism, and nuclear forces in 
Radioactivity and Nuclear Physics. On a macroscopic scale, electromagnetism and gravity are 
the basis for all forces. The nuclear forces are vital to the substructure of matter, but they are 
not directly experienced on the macroscopic scale. 
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Properties of the Four Basic Forces| footnote | 

The graviton is a proposed particle, though it has not yet been observed by scientists. See the 
discussion of gravitational waves later in this section. The particles W*, W_, and Z° are called 
vector bosons; these were predicted by theory and first observed in 1983. There are eight types 
of gluons proposed by scientists, and their existence is indicated by meson exchange in the 
nuclei of atoms. 


The gravitational force is surprisingly weak—it is only because gravity is always attractive that 
we notice it at all. Our weight is the gravitational force due to the entire Earth acting on us. On 
the very large scale, as in astronomical systems, the gravitational force is the dominant force 
determining the motions of moons, planets, stars, and galaxies. The gravitational force also 
affects the nature of space and time. As we shall see later in the study of general relativity, 
space is curved in the vicinity of very massive bodies, such as the Sun, and time actually slows 
down near massive bodies. 


Electromagnetic forces can be either attractive or repulsive. They are long-range forces, which 
act over extremely large distances, and they nearly cancel for macroscopic objects. (Remember 
that it is the net external force that is important.) If they did not cancel, electromagnetic forces 
would completely overwhelm the gravitational force. The electromagnetic force is a 
combination of electrical forces (such as those that cause static electricity) and magnetic forces 
(such as those that affect a compass needle). These two forces were thought to be quite distinct 
until early in the 19th century, when scientists began to discover that they are different 
manifestations of the same force. This discovery is a classical case of the unification of forces. 
Similarly, friction, tension, and all of the other classes of forces we experience directly (except 
gravity, of course) are due to electromagnetic interactions of atoms and molecules. It is still 
convenient to consider these forces separately in specific applications, however, because of the 
ways they manifest themselves. 


Note: 
Concept Connections: Unifying Forces 


Attempts to unify the four basic forces are discussed in relation to elementary particles later in 
this text. By “unify” we mean finding connections between the forces that show that they are 
different manifestations of a single force. Even if such unification is achieved, the forces will 
retain their separate characteristics on the macroscopic scale and may be identical only under 
extreme conditions such as those existing in the early universe. 


Physicists are now exploring whether the four basic forces are in some way related. Attempts to 
unify all forces into one come under the rubric of Grand Unified Theories (GUTs), with which 
there has been some success in recent years. It is now known that under conditions of extremely 
high density and temperature, such as existed in the early universe, the electromagnetic and 
weak nuclear forces are indistinguishable. They can now be considered to be different 
manifestations of one force, called the electroweak force. So the list of four has been reduced in 
a sense to only three. Further progress in unifying all forces is proving difficult—especially the 
inclusion of the gravitational force, which has the special characteristics of affecting the space 
and time in which the other forces exist. 


While the unification of forces will not affect how we discuss forces in this text, it is fascinating 
that such underlying simplicity exists in the face of the overt complexity of the universe. There 
is no reason that nature must be simple—it simply is. 


Action at a Distance: Concept of a Field 


All forces act at a distance. This is obvious for the gravitational force. Earth and the Moon, for 
example, interact without coming into contact. It is also true for all other forces. Friction, for 
example, is an electromagnetic force between atoms that may not actually touch. What is it that 
carries forces between objects? One way to answer this question is to imagine that a force field 
surrounds whatever object creates the force. A second object (often called a test object) placed 
in this field will experience a force that is a function of location and other variables. The field 
itself is the “thing” that carries the force from one object to another. The field is defined so as to 
be a characteristic of the object creating it; the field does not depend on the test object placed in 
it. Earth’s gravitational field, for example, is a function of the mass of Earth and the distance 
from its center, independent of the presence of other masses. The concept of a field is useful 
because equations can be written for force fields surrounding objects (for gravity, this yields 

w = mg at Earth’s surface), and motions can be calculated from these equations. (See [link].) 


The electric force field 
between a positively 
charged particle and a 
negatively charged 
particle. When a positive 
test charge is placed in 
the field, the charge will 
experience a force in the 
direction of the force field 
lines. 


Note: 

Concept Connections: Force Fields 

The concept of a force field is also used in connection with electric charge and is presented in 
Electric Charge and Electric Field. It is also a useful idea for all the basic forces, as will be 
seen in Particle Physics. Fields help us to visualize forces and how they are transmitted, as well 
as to describe them with precision and to link forces with subatomic carrier particles. 


The field concept has been applied very successfully; we can calculate motions and describe 
nature to high precision using field equations. As useful as the field concept is, however, it 
leaves unanswered the question of what carries the force. It has been proposed in recent 
decades, starting in 1935 with Hideki Yukawa’s (1907-1981) work on the strong nuclear force, 
that all forces are transmitted by the exchange of elementary particles. We can visualize particle 
exchange as analogous to macroscopic phenomena such as two people passing a basketball 
back and forth, thereby exerting a repulsive force without touching one another. (See [link].) 
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The exchange of masses 
resulting in repulsive forces. 
(a) The person throwing the 
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This idea of particle exchange deepens rather than contradicts field concepts. It is more 
satisfying philosophically to think of something physical actually moving between objects 
acting at a distance. [link] lists the exchange or carrier particles, both observed and proposed, 
that carry the four forces. But the real fruit of the particle-exchange proposal is that searches for 
Yukawa’s proposed particle found it and a number of others that were completely unexpected, 
stimulating yet more research. All of this research eventually led to the proposal of quarks as 
the underlying substructure of matter, which is a basic tenet of GUTs. If successful, these 
theories will explain not only forces, but also the structure of matter itself. Yet physics is an 
experimental science, so the test of these theories must lie in the domain of the real world. As of 
this writing, scientists at the CERN laboratory in Switzerland are starting to test these theories 
using the world’s largest particle accelerator: the Large Hadron Collider. This accelerator (27 
km in circumference) allows two high-energy proton beams, traveling in opposite directions, to 
collide. An energy of 14 trillion electron volts will be available. It is anticipated that some new 
particles, possibly force carrier particles, will be found. (See [link].) One of the force carriers of 
high interest that researchers hope to detect is the Higgs boson. The observation of its properties 
might tell us why different particles have different masses. 


The world’s largest particle accelerator 
spans the border between Switzerland and 
France. Two beams, traveling in opposite 

directions close to the speed of light, collide 
in a tube similar to the central tube shown 

here. External magnets determine the beam’s 

path. Special detectors will analyze particles 

created in these collisions. Questions as 

broad as what is the origin of mass and what 
was matter like the first few seconds of our 
universe will be explored. This accelerator 

began preliminary operation in 2008. (credit: 

Frank Hommes) 


Tiny particles also have wave-like behavior, something we will explore more in a later chapter. 
To better understand force-carrier particles from another perspective, let us consider gravity. 
The search for gravitational waves has been going on for a number of years. Almost 100 years 


ago, Einstein predicted the existence of these waves as part of his general theory of relativity. 
Gravitational waves are created during the collision of massive stars, in black holes, or in 
supernova explosions—like shock waves. These gravitational waves will travel through space 
from such sites much like a pebble dropped into a pond sends out ripples—except these waves 
move at the speed of light. A detector apparatus has been built in the U.S., consisting of two 
large installations nearly 3000 km apart—one in Washington state and one in Louisiana! The 
facility is called the Laser Interferometer Gravitational-Wave Observatory (LIGO). Each 
installation is designed to use optical lasers to examine any slight shift in the relative positions 
of two masses due to the effect of gravity waves. The two sites allow simultaneous 
measurements of these small effects to be separated from other natural phenomena, such as 
earthquakes. Initial operation of the detectors began in 2002, and work is proceeding on 
increasing their sensitivity. Similar installations have been built in Italy (VIRGO), Germany 
(GEO600), and Japan (TAMA300) to provide a worldwide network of gravitational wave 
detectors. 


International collaboration in this area is moving into space with the joint EU/US project LISA 
(Laser Interferometer Space Antenna). Earthquakes and other Earthly noises will be no problem 
for these monitoring spacecraft. LISA will complement LIGO by looking at much more 
massive black holes through the observation of gravitational-wave sources emitting much larger 
wavelengths. Three satellites will be placed in space above Earth in an equilateral triangle (with 
5,000,000-km sides) ([link]). The system will measure the relative positions of each satellite to 
detect passing gravitational waves. Accuracy to within 10% of the size of an atom will be 
needed to detect any waves. The launch of this project might be as early as 2018. 


“I’m sure LIGO will tell us something about the universe that we didn’t know before. The 
history of science tells us that any time you go where you haven't been before, you usually find 
something that really shakes the scientific paradigms of the day. Whether gravitational wave 
astrophysics will do that, only time will tell.” —David Reitze, LIGO Input Optics Manager, 
University of Florida 


Space-based future experiments for 
the measurement of gravitational 
waves. Shown here is a drawing of 


LISA’s orbit. Each satellite of LISA 
will consist of a laser source and a 
mass. The lasers will transmit a signal 
to measure the distance between each 
satellite’s test mass. The relative 
motion of these masses will provide 
information about passing 
gravitational waves. (credit: NASA) 


The ideas presented in this section are but a glimpse into topics of modern physics that will be 
covered in much greater depth in later chapters. 


Summary 


¢ The various types of forces that are categorized for use in many applications are all 
manifestations of the four basic forces in nature. 

e The properties of these forces are summarized in [link]. 

e Everything we experience directly without sensitive instruments is due to either 
electromagnetic forces or gravitational forces. The nuclear forces are responsible for the 
submicroscopic structure of matter, but they are not directly sensed because of their short 
ranges. Attempts are being made to show all four forces are different manifestations of a 
single unified force. 

e A force field surrounds an object creating a force and is the carrier of that force. 


Conceptual Questions 


Exercise: 
Problem: 
Explain, in terms of the properties of the four basic forces, why people notice the 
gravitational force acting on their bodies if it is such a comparatively weak force. 
Exercise: 
Problem: 
What is the dominant force between astronomical objects? Why are the other three basic 
forces less significant over these very large distances? 
Exercise: 
Problem: 


Give a detailed example of how the exchange of a particle can result in an attractive force. 
(For example, consider one child pulling a toy out of the hands of another.) 


Problem Exercises 


Exercise: 
Problem: 
(a) What is the strength of the weak nuclear force relative to the strong nuclear force? (b) 
What is the strength of the weak nuclear force relative to the electromagnetic force? Since 
the weak nuclear force acts at only very short distances, such as inside nuclei, where the 
strong and electromagnetic forces also act, it might seem surprising that we have any 


knowledge of it at all. We have such knowledge because the weak nuclear force is 
responsible for beta decay, a type of nuclear decay not explained by other forces. 


Solution: 
(ay eld 


(b)1 x 10°" 

Exercise: 
Problem: 
(a) What is the ratio of the strength of the gravitational force to that of the strong nuclear 
force? (b) What is the ratio of the strength of the gravitational force to that of the weak 
nuclear force? (c) What is the ratio of the strength of the gravitational force to that of the 


electromagnetic force? What do your answers imply about the influence of the 
gravitational force on atomic nuclei? 


Exercise: 
Problem: 
What is the ratio of the strength of the strong nuclear force to that of the electromagnetic 
force? Based on this ratio, you might expect that the strong force dominates the nucleus, 
which is true for small nuclei. Large nuclei, however, have sizes greater than the range of 


the strong nuclear force. At these sizes, the electromagnetic force begins to affect nuclear 
stability. These facts will be used to explain nuclear fusion and fission later in this text. 


Solution: 


10? 


Glossary 


carrier particle 
a fundamental particle of nature that is surrounded by a characteristic force field; photons 
are carrier particles of the electromagnetic force 


force field 
a region in which a test particle will experience a force 


Introduction: Further Applications of Newton’s Laws 
class="introduction" 


Total hip 
replacemen 
t surgery 
has become 
a common 
procedure. 
The head 
(or ball) of 
the 
patient’s 
femur fits 
into a cup 
that has a 
hard 
plastic-like 
inner 
lining. 
(credit: 
National 
Institutes of 
Health, via 
Wikimedia 
Commons) 


Describe the forces on the hip joint. What means are taken to ensure that 
this will be a good movable joint? From the photograph (for an adult) in 
[link], estimate the dimensions of the artificial device. 


It is difficult to categorize forces into various types (aside from the four 
basic forces discussed in previous chapter). We know that a net force affects 
the motion, position, and shape of an object. It is useful at this point to look 
at some particularly interesting and common forces that will provide further 
applications of Newton’s laws of motion. We have in mind the forces of 
friction, air or liquid drag, and deformation. 


Friction 


e Discuss the general characteristics of friction. 
e Describe the various types of friction. 
¢ Calculate the magnitude of static and kinetic friction. 


Friction is a force that is around us all the time that opposes relative motion 
between systems in contact but also allows us to move (which you have 
discovered if you have ever tried to walk on ice). While a common force, 
the behavior of friction is actually very complicated and is still not 
completely understood. We have to rely heavily on observations for 
whatever understandings we can gain. However, we can still deal with its 
more elementary general characteristics and understand the circumstances 
in which it behaves. 


Note: 
Friction 
Friction is a force that opposes relative motion between systems in contact. 


One of the simpler characteristics of friction is that it is parallel to the 
contact surface between systems and always in a direction that opposes 
motion or attempted motion of the systems relative to each other. If two 
systems are in contact and moving relative to one another, then the friction 
between them is called kinetic friction. For example, friction slows a 
hockey puck sliding on ice. But when objects are stationary, static friction 
can act between them; the static friction is usually greater than the kinetic 
friction between the objects. 


Note: 

Kinetic Friction 

If two systems are in contact and moving relative to one another, then the 
friction between them is called kinetic friction. 


Imagine, for example, trying to slide a heavy crate across a concrete floor— 
you may push harder and harder on the crate and not move it at all. This 
means that the static friction responds to what you do—it increases to be 
equal to and in the opposite direction of your push. But if you finally push 
hard enough, the crate seems to slip suddenly and starts to move. Once in 
motion it is easier to keep it in motion than it was to get it started, indicating 
that the kinetic friction force is less than the static friction force. If you add 
mass to the crate, say by placing a box on top of it, you need to push even 
harder to get it started and also to keep it moving. Furthermore, if you oiled 
the concrete you would find it to be easier to get the crate started and keep 
it going (as you might expect). 


[link] is a crude pictorial representation of how friction occurs at the 
interface between two objects. Close-up inspection of these surfaces shows 
them to be rough. So when you push to get an object moving (in this case, a 
crate), you must raise the object until it can skip along with just the tips of 
the surface hitting, break off the points, or do both. A considerable force 
can be resisted by friction with no apparent motion. The harder the surfaces 
are pushed together (such as if another box is placed on the crate), the more 
force is needed to move them. Part of the friction is due to adhesive forces 
between the surface molecules of the two objects, which explain the 
dependence of friction on the nature of the substances. Adhesion varies 
with substances in contact and is a complicated aspect of surface physics. 
Once an object is moving, there are fewer points of contact (fewer 
molecules adhering), so less force is required to keep the object moving. At 


small but nonzero speeds, friction is nearly independent of speed. 


Direction of motion 
or attempted motion 


Frictional forces, such as f, always 

oppose motion or attempted motion 

between objects in contact. Friction 
arises in part because of the roughness of 


the surfaces in contact, as seen in the 
expanded view. In order for the object to 
move, it must rise to where the peaks can 
skip along the bottom surface. Thus a 
force is required just to set the object in 
motion. Some of the peaks will be broken 
off, also requiring a force to maintain 
motion. Much of the friction is actually 
due to attractive forces between 
molecules making up the two objects, so 
that even perfectly smooth surfaces are 
not friction-free. Such adhesive forces 
also depend on the substances the 
surfaces are made of, explaining, for 
example, why rubber-soled shoes slip 
less than those with leather soles. 


The magnitude of the frictional force has two forms: one for static 
situations (static friction), the other for when there is motion (kinetic 
friction). 


When there is no motion between the objects, the magnitude of static 
friction f, is 
Equation: 


f <pN, 


where J, is the coefficient of static friction and N is the magnitude of the 
normal force (the force perpendicular to the surface). 


Note: 
Magnitude of Static Friction 
Magnitude of static friction f, is 


Equation: 
fs S Ms, 


where [Us is the coefficient of static friction and NV is the magnitude of the 
normal force. 


The symbol < means less than or equal to, implying that static friction can 
have a minimum and a maximum value of ps. Static friction is a 
responsive force that increases to be equal and opposite to whatever force is 
exerted, up to its maximum limit. Once the applied force exceeds fs(max), 
the object will move. Thus 

Equation: 


J (Gnas) = pV. 


Once an object is moving, the magnitude of kinetic friction f,, is given by 
Equation: 


fir = KN, 


where Ux is the coefficient of kinetic friction. A system in which 
fx = px is described as a system in which friction behaves simply. 


Note: 

Magnitude of Kinetic Friction 

The magnitude of kinetic friction f, is given by 
Equation: 


fx = uN, 


where jt, is the coefficient of kinetic friction. 


As seen in [link], the coefficients of kinetic friction are less than their static 
counterparts. That values of yz in [link] are stated to only one or, at most, 
two digits is an indication of the approximate description of friction given 
by the above two equations. 


Static Kinetic 

friction friction 
System ve ia 
Rubber on dry concrete 1.0 0.7 
Rubber on wet concrete 0.7 0.5 
Wood on wood 0.5 0.3 
Waxed wood on wet snow 0.14 0.1 
Metal on wood 0.5 0.3 
Steel on steel (dry) 0.6 0.3 
Steel on steel (oiled) 0.05 0.03 
Teflon on steel 0.04 0.04 
Bone lubricated by synovial 0.016 0.015 


fluid 


Shoes on wood 0.9 0.7 


Static Kinetic 


friction friction 
Hs Mk 
System 
Shoes on ice 0.1 0.05 
Ice on ice 0.1 0.03 
Steel on ice 0.04 0.02 


Coefficients of Static and Kinetic Friction 


The equations given earlier include the dependence of friction on materials 
and the normal force. The direction of friction is always opposite that of 
motion, parallel to the surface between objects, and perpendicular to the 
normal force. For example, if the crate you try to push (with a force parallel 
to the floor) has a mass of 100 kg, then the normal force would be equal to 
its weight, W = mg = (100 kg)(9.80 m/s”) = 980 N, perpendicular to 
the floor. If the coefficient of static friction is 0.45, you would have to exert 
a force parallel to the floor greater than 

fs(max) = UsN = (0.45)(980 N) = 440 N to move the crate. Once there is 
motion, friction is less and the coefficient of kinetic friction might be 0.30, 
so that a force of only 290 N (fx = uxN = (0.30)(980 N) = 290 N) 
would keep it moving at a constant speed. If the floor is lubricated, both 
coefficients are considerably less than they would be without lubrication. 
Coefficient of friction is a unit less quantity with a magnitude usually 
between 0 and 1.0. The coefficient of the friction depends on the two 
surfaces that are in contact. 


Note: 

Take-Home Experiment 

Find a small plastic object (such as a food container) and slide it on a 
kitchen table by giving it a gentle tap. Now spray water on the table, 


simulating a light shower of rain. What happens now when you give the 
object the same-sized tap? Now add a few drops of (vegetable or olive) oil 
on the surface of the water and give the same tap. What happens now? This 
latter situation is particularly important for drivers to note, especially after 
a light rain shower. Why? 


Many people have experienced the slipperiness of walking on ice. However, 
many parts of the body, especially the joints, have much smaller 
coefficients of friction—often three or four times less than ice. A joint is 
formed by the ends of two bones, which are connected by thick tissues. The 
knee joint is formed by the lower leg bone (the tibia) and the thighbone (the 
femur). The hip is a ball (at the end of the femur) and socket (part of the 
pelvis) joint. The ends of the bones in the joint are covered by cartilage, 
which provides a smooth, almost glassy surface. The joints also produce a 
fluid (synovial fluid) that reduces friction and wear. A damaged or arthritic 
joint can be replaced by an artificial joint ({link]). These replacements can 
be made of metals (stainless steel or titanium) or plastic (polyethylene), also 
with very small coefficients of friction. 


Artificial knee 
replacement is a 
procedure that has been 
performed for more than 
20 years. In this figure, 
we see the post-op x rays 
of the right knee joint 
replacement. (credit: 
Mike Baird, Flickr) 


Other natural lubricants include saliva produced in our mouths to aid in the 
swallowing process, and the slippery mucus found between organs in the 
body, allowing them to move freely past each other during heartbeats, 
during breathing, and when a person moves. Artificial lubricants are also 
common in hospitals and doctor’s clinics. For example, when ultrasonic 
imaging is carried out, the gel that couples the transducer to the skin also 
serves to to lubricate the surface between the transducer and the skin— 
thereby reducing the coefficient of friction between the two surfaces. This 
allows the transducer to move freely over the skin. 


Example: 

Skiing Exercise 

A skier with a mass of 62 kg is sliding down a snowy slope. Find the 
coefficient of kinetic friction for the skier if friction is known to be 45.0 N. 
Strategy 

The magnitude of kinetic friction was given in to be 45.0 N. Kinetic 
friction is related to the normal force N as f;, = p,N; thus, the coefficient 
of kinetic friction can be found if we can find the normal force of the skier 
on a slope. The normal force is always perpendicular to the surface, and 
since there is no motion perpendicular to the surface, the normal force 
should equal the component of the skier’s weight perpendicular to the 
slope. (See the skier and free-body diagram in [link].) 


Free-body diagram 


The motion of the skier and friction are 
parallel to the slope and so it is most 
convenient to project all forces onto a 

coordinate system where one axis is parallel 
to the slope and the other is perpendicular 

(axes shown to left of skier). N (the normal 
force) is perpendicular to the slope, and f 
(the friction) is parallel to the slope, but w 
(the skier’s weight) has components along 

both axes, namely w, and W //. N is equal 


in magnitude to w__, so there is no motion 
perpendicular to the slope. However, f is 
less than W /; in magnitude, so there is 


acceleration down the slope (along the x- 
axis). 


That is, 
Equation: 


N = w, = w cos 25° = mg cos 25°. 


Substituting this into our expression for kinetic friction, we get 
Equation: 


fi = UxmMg cos 25°, 


which can now be solved for the coefficient of kinetic friction px. 


Solution 
Solving for Ux, gives 
Equation: 


kN Ww COs 25° mg cos 25°. 
Substituting known values on the right-hand side of the equation, 
Equation: 


45.0 N 
0 


(62 kg)(9.80 m/s”) (0.906) 


Discussion 

This result is a little smaller than the coefficient listed in [link] for waxed 
wood on snow, but it is still reasonable since values of the coefficients of 
friction can vary greatly. In situations like this, where an object of mass m 
slides down a slope that makes an angle @ with the horizontal, friction is 
given by f, = u,.mgcos 0. All objects will slide down a slope with 
constant acceleration under these circumstances. Proof of this is left for 
this chapter’s Problems and Exercises. 


Note: 

Take-Home Experiment 

An object will slide down an inclined plane at a constant velocity if the net 
force on the object is zero. We can use this fact to measure the coefficient 
of kinetic friction between two objects. As shown in [link], the kinetic 
friction on a slope fx = Uxmg cos 9. The component of the weight down 
the slope is equal to mg sin @ (see the free-body diagram in [link]). These 
forces act in opposite directions, so when they have equal magnitude, the 
acceleration is zero. Writing these out: 

Equation: 


fx = Fg, 


Equation: 
[u.mg cos 6 = mg sin 0. 


Solving for 4., we find that 
Equation: 


Put a coin on a book and tilt it until the coin slides at a constant velocity 
down the book. You might need to tap the book lightly to get the coin to 
move. Measure the angle of tilt relative to the horizontal and find p,.. Note 
that the coin will not start to slide at all until an angle greater than @ is 
attained, since the coefficient of static friction is larger than the coefficient 
of kinetic friction. Discuss how this may affect the value for ju, and its 
uncertainty. 


We have discussed that when an object rests on a horizontal surface, there is 
a normal force supporting it equal in magnitude to its weight. Furthermore, 
simple friction is always proportional to the normal force. 


Note: 

Making Connections: Submicroscopic Explanations of Friction 

The simpler aspects of friction dealt with so far are its macroscopic (large- 
scale) characteristics. Great strides have been made in the atomic-scale 
explanation of friction during the past several decades. Researchers are 
finding that the atomic nature of friction seems to have several 
fundamental characteristics. These characteristics not only explain some of 
the simpler aspects of friction—they also hold the potential for the 
development of nearly friction-free environments that could save hundreds 
of billions of dollars in energy which is currently being converted 
(unnecessarily) to heat. 


[link] illustrates one macroscopic characteristic of friction that is explained 
by microscopic (small-scale) research. We have noted that friction is 
proportional to the normal force, but not to the area in contact, a somewhat 
counterintuitive notion. When two rough surfaces are in contact, the actual 
contact area is a tiny fraction of the total area since only high spots touch. 
When a greater normal force is exerted, the actual contact area increases, 
and it is found that the friction is proportional to this area. 

N 


Small normal 
a force 


N 
a _= Large normal 
force 
N 


Two rough surfaces in 
contact have a much 
smaller area of actual 

contact than their total 
area. When there is a 

greater normal force as a 
result of a greater applied 
force, the area of actual 
contact increases as does 
friction. 


But the atomic-scale view promises to explain far more than the simpler 
features of friction. The mechanism for how heat is generated is now being 
determined. In other words, why do surfaces get warmer when rubbed? 
Essentially, atoms are linked with one another to form lattices. When 
surfaces rub, the surface atoms adhere and cause atomic lattices to vibrate 
—essentially creating sound waves that penetrate the material. The sound 
waves diminish with distance and their energy is converted into heat. 
Chemical reactions that are related to frictional wear can also occur 


between atoms and molecules on the surfaces. [link] shows how the tip of a 
probe drawn across another material is deformed by atomic-scale friction. 
The force needed to drag the tip can be measured and is found to be related 
to shear stress, which will be discussed later in this chapter. The variation in 
shear stress is remarkable (more than a factor of 10/2 ) and difficult to 
predict theoretically, but shear stress is yielding a fundamental 
understanding of a large-scale phenomenon known since ancient times— 
friction. 
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The tip of a probe is deformed 
sideways by frictional force as 
the probe is dragged across a 
surface. Measurements of how 
the force varies for different 
materials are yielding 
fundamental insights into the 
atomic nature of friction. 


Note: 
PhET Explorations: Forces and Motion 


Explore the forces at work when you try to push a filing cabinet. Create an 
applied force and see the resulting friction force and total force acting on 
the cabinet. Charts show the forces, position, velocity, and acceleration vs. 
time. Draw a free-body diagram of all the forces (including gravitational 
and normal forces). 


Forces 
and 
Motio 
n 


Section Summary 


e Friction is a contact force between systems that opposes the motion or 
attempted motion between them. Simple friction is proportional to the 
normal force N pushing the systems together. (A normal force is 
always perpendicular to the contact surface between systems.) Friction 
depends on both of the materials involved. The magnitude of static 
friction f, between systems stationary relative to one another is given 
by 
Equation: 


iss, 


where [Js is the coefficient of static friction, which depends on both of 
the materials. 

e The kinetic friction force f,, between systems moving relative to one 
another is given by 
Equation: 


ic = 14, 


where Ux is the coefficient of kinetic friction, which also depends on 
both materials. 


Conceptual Questions 


Exercise: 
Problem: 
Define normal force. What is its relationship to friction when friction 
behaves simply? 
Exercise: 
Problem: 
The glue on a piece of tape can exert forces. Can these forces be a type 


of simple friction? Explain, considering especially that tape can stick 
to vertical walls and even to ceilings. 


Exercise: 


Problem: 


When you learn to drive, you discover that you need to let up slightly 
on the brake pedal as you come to a stop or the car will stop with a 
jerk. Explain this in terms of the relationship between static and kinetic 
friction. 


Exercise: 


Problem: 


When you push a piece of chalk across a chalkboard, it sometimes 
screeches because it rapidly alternates between slipping and sticking to 
the board. Describe this process in more detail, in particular explaining 
how it is related to the fact that kinetic friction is less than static 
friction. (The same slip-grab process occurs when tires screech on 
pavement.) 


Problems & Exercises 


Exercise: 


Problem: 


A physics major is cooking breakfast when he notices that the 
frictional force between his steel spatula and his Teflon frying pan is 
only 0.200 N. Knowing the coefficient of kinetic friction between the 
two materials, he quickly calculates the normal force. What is it? 


Solution: 


5.00 N 
Exercise: 


Problem: 


(a) When rebuilding her car’s engine, a physics major must exert 300 
N of force to insert a dry steel piston into a steel cylinder. What is the 
magnitude of the normal force between the piston and cylinder? (b) 
What is the magnitude of the force would she have to exert if the steel 
parts were oiled? 


Exercise: 


Problem: 


(a) What is the maximum frictional force in the knee joint of a person 
who supports 66.0 kg of her mass on that knee? (b) During strenuous 
exercise it is possible to exert forces to the joints that are easily ten 
times greater than the weight being supported. What is the maximum 
force of friction under such conditions? The frictional forces in joints 
are relatively small in all circumstances except when the joints 
deteriorate, such as from injury or arthritis. Increased frictional forces 
can cause further damage and pain. 


Exercise: 


Problem: 


Suppose you have a 120-kg wooden crate resting on a wood floor. (a) 
What maximum force can you exert horizontally on the crate without 
moving it? (b) If you continue to exert this force once the crate starts 
to slip, what will the magnitude of its acceleration then be? 


Solution: 
(a) 588 N 


(b) 1.96 m/s” 
Exercise: 


Problem: 


(a) If half of the weight of asmall 1.00 x 10° kg utility truck is 
supported by its two drive wheels, what is the magnitude of the 
maximum acceleration it can achieve on dry concrete? (b) Will a metal 
cabinet lying on the wooden bed of the truck slip if it accelerates at 
this rate? (c) Solve both problems assuming the truck has four-wheel 
drive. 


Exercise: 


Problem: 


A team of eight dogs pulls a sled with waxed wood runners on wet 
snow (mush!). The dogs have average masses of 19.0 kg, and the 
loaded sled with its rider has a mass of 210 kg. (a) Calculate the 
magnitude of the acceleration starting from rest if each dog exerts an 
average force of 185 N backward on the snow. (b) What is the 
magnitude of the acceleration once the sled starts to move? (c) For 
both situations, calculate the magnitude of the force in the coupling 
between the dogs and the sled. 


Solution: 


(a) 3.29 m/s? 
(b) 3.52 m/s? 


(c) 980 N; 945 N 


Exercise: 


Problem: 


Consider the 65.0-kg ice skater being pushed by two others shown in 
[link]. (a) Find the direction and magnitude of Fto;, the total force 
exerted on her by the others, given that the magnitudes F, and F are 
26.4 N and 18.6 N, respectively. (b) What is her initial acceleration if 
she is initially stationary and wearing steel-bladed skates that point in 
the direction of F'4,,? (c) What is her acceleration assuming she is 
already moving in the direction of F,.4,2 (Remember that friction 
always acts in the direction opposite that of motion or attempted 
motion between surfaces in contact.) 


(a) (b) 


Exercise: 


Problem: 


Show that the acceleration of any object down a frictionless incline 
that makes an angle @ with the horizontal is a = g sin 0. (Note that 
this acceleration is independent of mass.) 


Exercise: 


Problem: 


Show that the acceleration of any object down an incline where 
friction behaves simply (that is, where f;, = p4,N) is 

a = g( sin 0 — pcos @ ). Note that the acceleration is independent of 
mass and reduces to the expression found in the previous problem 
when friction becomes negligibly small (u, = 0 ). 


Exercise: 
Problem: 
Calculate the deceleration of a snow boarder going up a 5.0°, slope 
assuming the coefficient of friction for waxed wood on wet snow. The 
result of [link] may be useful, but be careful to consider the fact that 


the snow boarder is going uphill. Explicitly show how you follow the 
steps in Problem-Solving Strategies. 


Solution: 


1.83 m/s” 
Exercise: 


Problem: 


(a) Calculate the acceleration of a skier heading down a 10.0° slope, 
assuming the coefficient of friction for waxed wood on wet snow. (b) 
Find the angle of the slope down which this skier could coast at a 
constant velocity. You can neglect air resistance in both parts, and you 
will find the result of [link] to be useful. Explicitly show how you 
follow the steps in the Problem-Solving Strategies. 


Exercise: 


Problem: 


If an object is to rest on an incline without slipping, then friction must 
equal the component of the weight of the object parallel to the incline. 
This requires greater and greater friction for steeper slopes. Show that 
the maximum angle of an incline above the horizontal for which an 
object will not slide down is 6 = tan p1,. You may use the result of 
the previous problem. Assume that a = 0 and that static friction has 
reached its maximum value. 


Exercise: 


Problem: 


Calculate the maximum deceleration of a car that is heading down a 6° 
slope (one that makes an angle of 6° with the horizontal) under the 
following road conditions. You may assume that the weight of the car 
is evenly distributed on all four tires and that the coefficient of static 
friction is involved—that is, the tires are not allowed to slip during the 
deceleration. (Ignore rolling.) Calculate for a car: (a) On dry concrete. 
(b) On wet concrete. (c) On ice, assuming that 4, = 0.100, the same 
as for shoes on ice. 


Exercise: 


Problem: 


Calculate the maximum acceleration of a car that is heading up a 4° 
slope (one that makes an angle of 4° with the horizontal) under the 
following road conditions. Assume that only half the weight of the car 
is supported by the two drive wheels and that the coefficient of static 
friction is involved—that is, the tires are not allowed to slip during the 
acceleration. (Ignore rolling.) (a) On dry concrete. (b) On wet 
concrete. (c) On ice, assuming that p, = 0.100, the same as for shoes 
on ice. 


Solution: 


(a) 4.20 m/s? 


(b) 2.74 m/s” 


(c) -0.195 m/s” 


Exercise: 


Problem: Repeat [link] for a car with four-wheel drive. 
Exercise: 


Problem: 


A freight train consists of two 8.00 x 10°-kg engines and 45 cars with 
average masses of 5.50 x 10° kg. (a) What force must each engine 
exert backward on the track to accelerate the train at a rate of 

5.00 x 10-7 m/s? if the force of friction is 7.50 x 10° N, assuming 
the engines exert identical forces? This is not a large frictional force 
for such a massive system. Rolling friction for trains is small, and 
consequently trains are very energy-efficient transportation systems. 
(b) What is the magnitude of the force in the coupling between the 
37th and 38th cars (this is the force each exerts on the other), assuming 
all cars have the same mass and that friction is evenly distributed 
among all of the cars and engines? 


Solution: 
(a) 1.03 x 10° N 


(b) 3.48 x 10° N 
Exercise: 


Problem: 


Consider the 52.0-kg mountain climber in [link]. (a) Find the tension 
in the rope and the force that the mountain climber must exert with her 
feet on the vertical rock face to remain stationary. Assume that the 
force is exerted parallel to her legs. Also, assume negligible force 
exerted by her arms. (b) What is the minimum coefficient of friction 
between her shoes and the cliff? 


Part of the 
climber’s weight is 
supported by her 
rope and part by 
friction between 
her feet and the 
rock face. 


Exercise: 


Problem: 


A contestant in a winter sporting event pushes a 45.0-kg block of ice 
across a frozen lake as shown in [link](a). (a) Calculate the minimum 
force F’ he must exert to get the block moving. (b) What is the 
magnitude of its acceleration once it starts to move, if that force is 
maintained? 


Solution: 


(a) 51.0 N 


(b) 0.720 m/s” 


Exercise: 
Problem: 
Repeat [link] with the contestant pulling the block of ice with a rope 


over his shoulder at the same angle above the horizontal as shown in 
[link](b). 


(b) 


Which method of 
sliding a block of ice 
requires less force—(a) 
pushing or (b) pulling 
at the same angle above 
the horizontal? 


Glossary 


friction 
a force that opposes relative motion or attempts at motion between 
systems in contact 


kinetic friction 


a force that opposes the motion of two systems that are in contact and 
moving relative to one another 


Static friction 
a force that opposes the motion of two systems that are in contact and 
are not moving relative to one another 


magnitude of static friction 
fs < ps N, where pu, is the coefficient of static friction and N is the 
magnitude of the normal force 


magnitude of kinetic friction 
fx = LN, where pu, is the coefficient of kinetic friction 


Elasticity: Stress and Strain 


e State Hooke’s law. 

e Explain Hooke’s law using graphical representation between 
deformation and applied force. 

e Discuss the three types of deformations such as changes in length, 
sideways shear and changes in volume. 

e Describe with examples the young’s modulus, shear modulus and bulk 
modulus. 

e Determine the change in length given mass, length and radius. 


We now move from consideration of forces that affect the motion of an 
object (such as friction and drag) to those that affect an object’s shape. If a 
bulldozer pushes a car into a wall, the car will not move but it will 
noticeably change shape. A change in shape due to the application of a force 
is a deformation. Even very small forces are known to cause some 
deformation. For small deformations, two important characteristics are 
observed. First, the object returns to its original shape when the force is 
removed—that is, the deformation is elastic for small deformations. Second, 
the size of the deformation is proportional to the force—that is, for small 
deformations, Hooke’s law is obeyed. In equation form, Hooke’s law is 
given by 

Equation: 


F=kAL, 


where AL is the amount of deformation (the change in length, for example) 
produced by the force F’, and k is a proportionality constant that depends on 
the shape and composition of the object and the direction of the force. Note 
that this force is a function of the deformation AL —it is not constant as a 
kinetic friction force is. Rearranging this to 

Equation: 


F 
AL = — 
k 


makes it clear that the deformation is proportional to the applied force. [link] 
shows the Hooke’s law relationship between the extension AL of a spring 
or of a human bone. For metals or springs, the straight line region in which 
Hooke’s law pertains is much larger. Bones are brittle and the elastic region 
is small and the fracture abrupt. Eventually a large enough stress to the 
material will cause it to break or fracture. Tensile strength is the breaking 
stress that will cause permanent deformation or fracture of a material. 


Note: 
Hooke’s Law 
Equation: 


F=kAL, 


where AL is the amount of deformation (the change in length, for example) 
produced by the force F’, and k is a proportionality constant that depends on 
the shape and composition of the object and the direction of the force. 
Equation: 


F 
AL = — 
k 


AL 4 


— 


«- Hooke’s law is F 
obeyed 
; . Fracture 
~————_ Elastic region 
Permanent 


deformation 


A graph of 
deformation AL 
versus applied force F’ 


. The straight segment 
is the linear region 
where Hooke’s law is 
obeyed. The slope of 
the straight region is 
-. For larger forces, 


the graph is curved but 
the deformation is still 
elastic— AL will 
return to zero if the 
force is removed. Still 
greater forces 
permanently deform 
the object until it 
finally fractures. The 
shape of the curve 
near fracture depends 
on several factors, 
including how the 
force F is applied. 
Note that in this graph 
the slope increases just 
before fracture, 
indicating that a small 
increase in F' is 
producing a large 
increase in DL near the 
fracture. 


The proportionality constant k depends upon a number of factors for the 
material. For example, a guitar string made of nylon stretches when it is 
tightened, and the elongation AL is proportional to the force applied (at 
least for small deformations). Thicker nylon strings and ones made of steel 
stretch less for the same applied force, implying they have a larger k (see 
[link]). Finally, all three strings return to their normal lengths when the force 


is removed, provided the deformation is small. Most materials will behave 
in this manner if the deformation is less than about 0.1% or about 1 part in 
10°, 


ane 


The same force, 
in this case a 
weight (w), 
applied to three 
different guitar 
strings of 
identical length 
produces the 
three different 
deformations 
shown as shaded 
segments. The 
string on the left 
is thin nylon, the 
one in the 
middle is thicker 
nylon, and the 
one on the right 
is steel. 


Note: 

Stretch Yourself a Little 

How would you go about measuring the proportionality constant k of a 
rubber band? If a rubber band stretched 3 cm when a 100-g mass was 
attached to it, then how much would it stretch if two similar rubber bands 
were attached to the same mass—even if put together in parallel or 
alternatively if tied together in series? 


We now consider three specific types of deformations: changes in length 
(tension and compression), sideways shear (stress), and changes in volume. 
All deformations are assumed to be small unless otherwise stated. 


Changes in Length—Tension and Compression: Elastic 
Modulus 


A change in length AL is produced when a force is applied to a wire or rod 
parallel to its length Lo, either stretching it (a tension) or compressing it. 
(See [link].) 


le 


(a) (b) 


(a) Tension. The 
rod is stretched 


a length AD 
when a force is 
applied parallel 
to its length. (b) 

Compression. 
The same rod is 
compressed by 
forces with the 
Same magnitude 
in the opposite 
direction. For 
very small 
deformations 
and uniform 
materials, AL is 
approximately 
the same for the 
Same magnitude 
of tension or 
compression. 
For larger 
deformations, 
the cross- 
sectional area 
changes as the 
rod is 
compressed or 
stretched. 


Experiments have shown that the change in length (AL) depends on only a 
few variables. As already noted, AL is proportional to the force F’ and 
depends on the substance from which the object is made. Additionally, the 
change in length is proportional to the original length Lo and inversely 
proportional to the cross-sectional area of the wire or rod. For example, a 
long guitar string will stretch more than a short one, and a thick string will 


stretch less than a thin one. We can combine all these factors into one 
equation for AL: 
Equation: 


1 


F 
AL = ——L 
VY A 0; 


where AL is the change in length, F' the applied force, Y is a factor, called 
the elastic modulus or Young’s modulus, that depends on the substance, A is 
the cross-sectional area, and Lg is the original length. [link] lists values of Y 
for several materials—those with a large Y are said to have a large tensile 
stifness because they deform less for a given tension or compression. 


Young’s 
modulus 
(tension— Shear Bulk 
compression)Y modulus S modulus B 
Material (10° N/m”) (10° N/m”) (10° N/m’) 
Aluminum 70 25 75 
ae 16 80 8 
tension 
Bone — 
compression 
Brass 90 35 79 


Brick 15 


Material 
Concrete 
Glass 
Granite 


Hair 
(human) 


Hardwood 
Iron, cast 
Lead 
Marble 
Nylon 
Polystyrene 
Silk 


Spider 
thread 


Steel 


Tendon 


Young’s 
modulus 
(tension— 
compression)Y 
(10° N/m”) 
20 

70 


45 
10 


15 


100 


60 


210 


Shear 
modulus S 


(10° N/m”) 


20 


20 


40 


20 


80 


Bulk 
modulus B 


(10° N/m’) 


30 


45 


90 
50 


70 


130 


Young’s 


modulus 
(tension— Shear Bulk 
compression)Y modulus S modulus B 
Material (10° N/m”) (10° N/m”) (10° N/m’) 
Acetone 0.7 
Ethanol 0.9 
Glycerin 4.5 
Mercury 25 
Water 22 


Elastic Modulil footnote | 

Approximate and average values. Young’s moduli Y for tension and 
compression sometimes differ but are averaged here. Bone has significantly 
different Young’s moduli for tension and compression. 


Young’s moduli are not listed for liquids and gases in [link] because they 
cannot be stretched or compressed in only one direction. Note that there is 
an assumption that the object does not accelerate, so that there are actually 
two applied forces of magnitude F' acting in opposite directions. For 
example, the strings in [link] are being pulled down by a force of magnitude 
w and held up by the ceiling, which also exerts a force of magnitude w. 


Example: 

The Stretch of a Long Cable 

Suspension cables are used to carry gondolas at ski resorts. (See [link]) 
Consider a suspension cable that includes an unsupported span of 3020 m. 
Calculate the amount of stretch in the steel cable. Assume that the cable has 
a diameter of 5.6 cm and the maximum tension it can withstand is 

3.0 x 10°N. 


Gondolas travel 
along suspension 
cables at the Gala 
Yuzawa ski resort 
in Japan. (credit: 

Rudy Herman, 

Flickr) 


Strategy 

The force is equal to the maximum tension, or F’ = 3.0 x 10° N. The 
cross-sectional area is mr? = 2.46 x 10° m”. The equation 

AL v +-Lo can be used to find the change in length. 

Solution 

All quantities are known. Thus, 

Equation: 


Ni = (a) (22st) (3020 m) 


210x 10° N/m? 2.46x 10% m? 
= 18m. 


Discussion 

This is quite a stretch, but only about 0.6% of the unsupported length. 
Effects of temperature upon length might be important in these 
environments. 


Bones, on the whole, do not fracture due to tension or compression. Rather 
they generally fracture due to sideways impact or bending, resulting in the 


bone shearing or snapping. The behavior of bones under tension and 
compression is important because it determines the load the bones can carry. 
Bones are classified as weight-bearing structures such as columns in 
buildings and trees. Weight-bearing structures have special features; 
columns in building have steel-reinforcing rods while trees and bones are 
fibrous. The bones in different parts of the body serve different structural 
functions and are prone to different stresses. Thus the bone in the top of the 
femur is arranged in thin sheets separated by marrow while in other places 
the bones can be cylindrical and filled with marrow or just solid. 
Overweight people have a tendency toward bone damage due to sustained 
compressions in bone joints and tendons. 


Another biological example of Hooke’s law occurs in tendons. Functionally, 
the tendon (the tissue connecting muscle to bone) must stretch easily at first 
when a force is applied, but offer a much greater restoring force for a greater 
strain. [link] shows a stress-strain relationship for a human tendon. Some 
tendons have a high collagen content so there is relatively little strain, or 
length change; others, like support tendons (as in the leg) can change length 
up to 10%. Note that this stress-strain curve is nonlinear, since the slope of 
the line changes in different regions. In the first part of the stretch called the 
toe region, the fibers in the tendon begin to align in the direction of the 
stress—this is called uncrimping. In the linear region, the fibrils will be 
stretched, and in the failure region individual fibers begin to break. A simple 
model of this relationship can be illustrated by springs in parallel: different 
springs are activated at different lengths of stretch. Examples of this are 
given in the problems at end of this chapter. Ligaments (tissue connecting 
bone to bone) behave in a similar way. 


Toe Linear Failure oF 
region region region 


Typical stress-strain 
curve for mammalian 
tendon. Three regions 

are shown: (1) toe 
region (2) linear 
region, and (3) failure 
region. 


Unlike bones and tendons, which need to be strong as well as elastic, the 
arteries and lungs need to be very stretchable. The elastic properties of the 
arteries are essential for blood flow. The pressure in the arteries increases 
and arterial walls stretch when the blood is pumped out of the heart. When 
the aortic valve shuts, the pressure in the arteries drops and the arterial walls 
relax to maintain the blood flow. When you feel your pulse, you are feeling 
exactly this—the elastic behavior of the arteries as the blood gushes through 
with each pump of the heart. If the arteries were rigid, you would not feel a 
pulse. The heart is also an organ with special elastic properties. The lungs 
expand with muscular effort when we breathe in but relax freely and 
elastically when we breathe out. Our skins are particularly elastic, especially 
for the young. A young person can go from 100 kg to 60 kg with no visible 
sag in their skins. The elasticity of all organs reduces with age. Gradual 
physiological aging through reduction in elasticity starts in the early 20s. 


Example: 

Calculating Deformation: How Much Does Your Leg Shorten When 
You Stand on It? 

Calculate the change in length of the upper leg bone (the femur) when a 
70.0 kg man supports 62.0 kg of his mass on it, assuming the bone to be 
equivalent to a uniform rod that is 40.0 cm long and 2.00 cm in radius. 
Strategy 

The force is equal to the weight supported, or 

Equation: 


F = mg = (62.0 ke) (9.80 m/s”) = 607.6N, 


and the cross-sectional area is mr? = 1.257 x 10~° m?”. The equation 
i ¥ +L can be used to find the change in length. 

Solution 

All quantities except AD are known. Note that the compression value for 
Young’s modulus for bone must be used here. Thus, 

Equation: 


= 1 607.6 N 
A = ( 9x 109 N/m? ) ( 1.257x 1073 m? ) (0.400 m) 
2x 10° m. 


Discussion 

This small change in length seems reasonable, consistent with our 
experience that bones are rigid. In fact, even the rather large forces 
encountered during strenuous physical activity do not compress or bend 
bones by large amounts. Although bone is rigid compared with fat or 
muscle, several of the substances listed in [link] have larger values of 
Young’s modulus Y. In other words, they are more rigid. 


The equation for change in length is traditionally rearranged and written in 
the following form: 
Equation: 


F AL 
= 


A ~ Le 


The ratio of force to area, a is defined as stress (measured in N/ m’), and 
the ratio of the change in length to length, ae is defined as strain (a 
unitless quantity). In other words, 

Equation: 


stress = Y x strain. 


In this form, the equation is analogous to Hooke’s law, with stress analogous 
to force and strain analogous to deformation. If we again rearrange this 
equation to the form 

Equation: 


we see that it is the same as Hooke’s law with a proportionality constant 
Equation: 


YA 
k= —. 
Lo 
This general idea—that force and the deformation it causes are proportional 


for small deformations—applies to changes in length, sideways bending, 
and changes in volume. 


Note: 
Stress 
The ratio of force to area, +, is defined as stress measured in N/m2. 


Note: 
Strain 


The ratio of the change in length to length, . is defined as strain (a 
unitless quantity). In other words, 
Equation: 


stress = Y x strain. 


Sideways Stress: Shear Modulus 


[link] illustrates what is meant by a sideways stress or a shearing force. Here 
the deformation is called Az and it is perpendicular to Lo, rather than 
parallel as with tension and compression. Shear deformation behaves 
similarly to tension and compression and can be described with similar 
equations. The expression for shear deformation is 

Equation: 


1F 


A eal 


where S is the shear modulus (see [link]) and F' is the force applied 
perpendicular to Lo and parallel to the cross-sectional area A. Again, to 
keep the object from accelerating, there are actually two equal and opposite 
forces F' applied across opposite faces, as illustrated in [link]. The equation 
is logical—for example, it is easier to bend a long thin pencil (small A) than 
a short thick one, and both are more easily bent than similar steel rods (large 


S). 


Note: 
Shear Deformation 
Equation: 
Lae 
Az = ——L 
eee eet 


where § is the shear modulus and F is the force applied perpendicular to 
Lo and parallel to the cross-sectional area A. 


Shearing forces are 
applied 
perpendicular to the 
length Lo and 
parallel to the area 
A, producing a 
deformation Ax. 
Vertical forces are 
not shown, but it 
should be kept in 
mind that in 
addition to the two 
shearing forces, F, 
there must be 
supporting forces to 
keep the object 
from rotating. The 
distorting effects of 
these supporting 
forces are ignored 
in this treatment. 
The weight of the 
object also is not 
shown, since it is 
usually negligible 
compared with 
forces large enough 
to cause significant 
deformations. 


Examination of the shear moduli in [link] reveals some telling patterns. For 
example, shear moduli are less than Young’s moduli for most materials. 
Bone is a remarkable exception. Its shear modulus is not only greater than 
its Young’s modulus, but it is as large as that of steel. This is why bones are 
so rigid. 


The spinal column (consisting of 26 vertebral segments separated by discs) 
provides the main support for the head and upper part of the body. The 
spinal column has normal curvature for stability, but this curvature can be 
increased, leading to increased shearing forces on the lower vertebrae. Discs 
are better at withstanding compressional forces than shear forces. Because 
the spine is not vertical, the weight of the upper body exerts some of both. 
Pregnant women and people that are overweight (with large abdomens) need 
to move their shoulders back to maintain balance, thereby increasing the 
curvature in their spine and so increasing the shear component of the stress. 
An increased angle due to more curvature increases the shear forces along 
the plane. These higher shear forces increase the risk of back injury through 
ruptured discs. The lumbosacral disc (the wedge shaped disc below the last 
vertebrae) is particularly at risk because of its location. 


The shear moduli for concrete and brick are very small; they are too highly 
variable to be listed. Concrete used in buildings can withstand compression, 
as in pillars and arches, but is very poor against shear, as might be 
encountered in heavily loaded floors or during earthquakes. Modern 
structures were made possible by the use of steel and steel-reinforced 
concrete. Almost by definition, liquids and gases have shear moduli near 
zero, because they flow in response to shearing forces. 


Example: 

Calculating Force Required to Deform: That Nail Does Not Bend Much 
Under a Load 

Find the mass of the picture hanging from a steel nail as shown in [link], 
given that the nail bends only 1.80 um. (Assume the shear modulus is 
known to two significant figures.) 


Ax = 1.80 um 


L, = 5.00 mm 


Side view of a nail 
with a picture hung 
from it. The nail 
flexes very slightly 
(shown much larger 
than actual) 
because of the 
shearing effect of 
the supported 
weight. Also shown 
is the upward force 
of the wall on the 
nail, illustrating 
that there are equal 
and opposite forces 
applied across 
Opposite cross 
sections of the nail. 
See [link] for a 
calculation of the 
mass of the picture. 


Strategy 
The force F’ on the nail (neglecting the nail’s own weight) is the weight of 
the picture w. If we can find w, then the mass of the picture is just a . The 


equation Ax = = = Lo can be solved for F’. 


Solution 
Solving the equation Ax = = + Lo for F’, we see that all other quantities 


can be found: 
Equation: 


S is found in [link] and is S = 80 x 10° N/m’. The radius r is 0.750 mm 
(as seen in the figure), so the cross-sectional area is 
Equation: 


A =r? =1.77 x 10° m?. 


The value for Lg is also shown in the figure. Thus, 
Equation: 


(80 x 10° N/m?)(1.77 x 10~® m?) 


Ginx ym) (1.80 x 10° m) = 5IN. 
: x m 


i 


This 51 N force is the weight w of the picture, so the picture’s mass is 
Equation: 


F 
m= — = — =5.2kg. 
9 9 


Discussion 
This is a fairly massive picture, and it is impressive that the nail flexes only 
1.80 m—an amount undetectable to the unaided eye. 


Changes in Volume: Bulk Modulus 


An object will be compressed in all directions if inward forces are applied 
evenly on all its surfaces as in [link]. It is relatively easy to compress gases 
and extremely difficult to compress liquids and solids. For example, air in a 


wine bottle is compressed when it is corked. But if you try corking a brim- 
full bottle, you cannot compress the wine—some must be removed if the 
cork is to be inserted. The reason for these different compressibilities is that 
atoms and molecules are separated by large empty spaces in gases but 
packed close together in liquids and solids. To compress a gas, you must 
force its atoms and molecules closer together. To compress liquids and 
solids, you must actually compress their atoms and molecules, and very 
strong electromagnetic forces in them oppose this compression. 


A Volume Vo 


Volume 
Vo—-AV 


An inward force on 
all surfaces 
compresses this 
cube. Its change in 
volume is 
proportional to the 
force per unit area 
and its original 
volume, and is 
related to the 
compressibility of 
the substance. 


We can describe the compression or volume deformation of an object with 
an equation. First, we note that a force “applied evenly” is defined to have 


the same stress, or ratio of force to area = on all surfaces. The deformation 


produced is a change in volume AV, which is found to behave very 
similarly to the shear, tension, and compression previously discussed. (This 
is not surprising, since a compression of the entire object is equivalent to 
compressing each of its three dimensions.) The relationship of the change in 
volume to other physical quantities is given by 

Equation: 


1F 
DVS 7, 
BA’ 


where B is the bulk modulus (see [link]), Vp is the original volume, and = 


is the force per unit area applied uniformly inward on all surfaces. Note that 
no bulk moduli are given for gases. 


What are some examples of bulk compression of solids and liquids? One 
practical example is the manufacture of industrial-grade diamonds by 
compressing carbon with an extremely large force per unit area. The carbon 
atoms rearrange their crystalline structure into the more tightly packed 
pattern of diamonds. In nature, a similar process occurs deep underground, 
where extremely large forces result from the weight of overlying material. 
Another natural source of large compressive forces is the pressure created by 
the weight of water, especially in deep parts of the oceans. Water exerts an 
inward force on all surfaces of a submerged object, and even on the water 
itself. At great depths, water is measurably compressed, as the following 
example illustrates. 


Example: 

Calculating Change in Volume with Deformation: How Much Is Water 
Compressed at Great Ocean Depths? 

Calculate the fractional decrease in volume ico) for seawater at 5.00 km 


depth, where the force per unit area is 5.00 x 10’ N i mes 
Strategy 


Equation AV = = Vo is the correct physical relationship. All quantities 


in the equation except =e are known. 


Solution 
Solving for the unknown or gives 
Equation: 
AV 1 F 
Vo BA 


Substituting known values with the value for the bulk modulus B from 
[link], 


Equation: 
AV _ 5.00107 N/m? 
Yo 9.2109 N/m? 
= 0/023" — 27370. 
Discussion 


Although measurable, this is not a significant decrease in volume 
considering that the force per unit area is about 500 atmospheres (1 million 
pounds per square foot). Liquids and solids are extraordinarily difficult to 
compress. 


Conversely, very large forces are created by liquids and solids when they try 
to expand but are constrained from doing so—which is equivalent to 
compressing them to less than their normal volume. This often occurs when 
a contained material warms up, since most materials expand when their 
temperature increases. If the materials are tightly constrained, they deform 
or break their container. Another very common example occurs when water 
freezes. Water, unlike most materials, expands when it freezes, and it can 
easily fracture a boulder, rupture a biological cell, or crack an engine block 
that gets in its way. 


Other types of deformations, such as torsion or twisting, behave analogously 
to the tension, shear, and bulk deformations considered here. 


Note: 
PhET Explorations: Masses & Springs 


https://phet.colorado.edu/sims/mass-spring-lab/mass-spring-lab_en.html 


Section Summary 


¢ Hooke’s law is given by 
Equation: 


F=kAL, 


where AL is the amount of deformation (the change in length), F’ is 
the applied force, and k is a proportionality constant that depends on 
the shape and composition of the object and the direction of the force. 
The relationship between the deformation and the applied force can 
also be written as 

Equation: 


where Y is Young’s modulus, which depends on the substance, A is the 
cross-sectional area, and Lg is the original length. 
e The ratio of force to area, oe is defined as stress, measured in N/m2. 
AL 
Lo 


e The ratio of the change in length to length, , is defined as strain (a 


unitless quantity). In other words, 
Equation: 


stress = Y x strain. 


e The expression for shear deformation is 
Equation: 


), ae 
ea 
L SA 05 


where § is the shear modulus and F is the force applied perpendicular 
to Lo and parallel to the cross-sectional area A. 

e The relationship of the change in volume to other physical quantities is 
given by 
Equation: 


where B is the bulk modulus, Vo is the original volume, and f is the 
force per unit area applied uniformly inward on all surfaces. 


Conceptual Questions 


Exercise: 
Problem: 
The elastic properties of the arteries are essential for blood flow. 


Explain the importance of this in terms of the characteristics of the flow 
of blood (pulsating or continuous). 


Exercise: 
Problem: 
What are you feeling when you feel your pulse? Measure your pulse 
rate for 10 s and for 1 min. Is there a factor of 6 difference? 
Exercise: 
Problem: 
Examine different types of shoes, including sports shoes and thongs. In 


terms of physics, why are the bottom surfaces designed as they are? 
What differences will dry and wet conditions make for these surfaces? 


Exercise: 
Problem: 
Would you expect your height to be different depending upon the time 
of day? Why or why not? 
Exercise: 
Problem: 
Why can a squirrel jump from a tree branch to the ground and run away 
undamaged, while a human could break a bone in such a fall? 
Exercise: 
Problem: 
Explain why pregnant women often suffer from back strain late in their 
pregnancy. 
Exercise: 
Problem: 
An old carpenter’s trick to keep nails from bending when they are 


pounded into hard materials is to grip the center of the nail firmly with 
pliers. Why does this help? 


Exercise: 


Problem: 


When a glass bottle full of vinegar warms up, both the vinegar and the 
glass expand, but vinegar expands significantly more with temperature 
than glass. The bottle will break if it was filled to its tightly capped lid. 
Explain why, and also explain how a pocket of air above the vinegar 
would prevent the break. (This is the function of the air above liquids in 
glass containers.) 


Problems & Exercises 


Exercise: 


Problem: 


During a circus act, one performer swings upside down hanging from a 
trapeze holding another, also upside-down, performer by the legs. If the 
upward force on the lower performer is three times her weight, how 
much do the bones (the femurs) in her upper legs stretch? You may 
assume each is equivalent to a uniform rod 35.0 cm long and 1.80 cm 
in radius. Her mass is 60.0 kg. 


Solution: 
Equation: 


1.90 x 1072 cm 


Exercise: 


Problem: 


During a wrestling match, a 150 kg wrestler briefly stands on one hand 
during a maneuver designed to perplex his already moribund adversary. 
By how much does the upper arm bone shorten in length? The bone can 
be represented by a uniform rod 38.0 cm in length and 2.10 cm in 
radius. 


Exercise: 


Problem: 


(a) The “lead” in pencils is a graphite composition with a Young’s 
modulus of about 1 x 10? N/m? . Calculate the change in length of 
the lead in an automatic pencil if you tap it straight into the pencil with 
a force of 4.0 N. The lead is 0.50 mm in diameter and 60 mm long. (b) 
Is the answer reasonable? That is, does it seem to be consistent with 
what you have observed when using pencils? 


Solution: 


(a)1 mm 
(b) This does seem reasonable, since the lead does seem to shrink a 
little when you push on it. 


Exercise: 


Problem: 


TV broadcast antennas are the tallest artificial structures on Earth. In 
1987, a 72.0-kg physicist placed himself and 400 kg of equipment at 
the top of one 610-m high antenna to perform gravity experiments. By 
how much was the antenna compressed, if we consider it to be 
equivalent to a steel cylinder 0.150 m in radius? 


Exercise: 
Problem: 
(a) By how much does a 65.0-kg mountain climber stretch her 0.800- 
cm diameter nylon rope when she hangs 35.0 m below a rock 
outcropping? (b) Does the answer seem to be consistent with what you 


have observed for nylon ropes? Would it make sense if the rope were 
actually a bungee cord? 


Solution: 


(a)9 cm 
(b)This seems reasonable for nylon climbing rope, since it is not 
supposed to stretch that much. 


Exercise: 
Problem: 
A 20.0-m tall hollow aluminum flagpole is equivalent in stiffness to a 
solid cylinder 4.00 cm in diameter. A strong wind bends the pole much 


as a horizontal force of 900 N exerted at the top would. How far to the 
side does the top of the pole flex? 


Exercise: 


Problem: 


As an oil well is drilled, each new section of drill pipe supports its own 
weight and that of the pipe and drill bit beneath it. Calculate the stretch 
in anew 6.00 m length of steel pipe that supports 3.00 km of pipe 
having a mass of 20.0 kg/m and a 100-kg drill bit. The pipe is 
equivalent in stiffness to a solid cylinder 5.00 cm in diameter. 


Solution: 


8.59 mm 
Exercise: 
Problem: 
Calculate the force a piano tuner applies to stretch a steel piano wire 


8.00 mm, if the wire is originally 0.850 mm in diameter and 1.35 m 
long. 


Exercise: 


Problem: 


A vertebra is subjected to a shearing force of 500 N. Find the shear 
deformation, taking the vertebra to be a cylinder 3.00 cm high and 4.00 
cm in diameter. 


Solution: 
Equation: 


1.49 x 10°’ m 


Exercise: 


Problem: 


A disk between vertebrae in the spine is subjected to a shearing force of 
600 N. Find its shear deformation, taking it to have the shear modulus 
of 1 x 10° N/m? . The disk is equivalent to a solid cylinder 0.700 cm 
high and 4.00 cm in diameter. 


Exercise: 
Problem: 
When using a pencil eraser, you exert a vertical force of 6.00 N at a 
distance of 2.00 cm from the hardwood-eraser joint. The pencil is 6.00 
mm in diameter and is held at an angle of 20.0° to the horizontal. (a) 


By how much does the wood flex perpendicular to its length? (b) How 
much is it compressed lengthwise? 


Solution: 
(a) 3.99 x 10-7 m 


(b) 9.67 x 10°-°m 
Exercise: 


Problem: 


To consider the effect of wires hung on poles, we take data from [link], 
in which tensions in wires supporting a traffic light were calculated. 
The left wire made an angle 30.0° below the horizontal with the top of 
its pole and carried a tension of 108 N. The 12.0 m tall hollow 
aluminum pole is equivalent in stiffness to a 4.50 cm diameter solid 
cylinder. (a) How far is it bent to the side? (b) By how much is it 
compressed? 


Exercise: 


Problem: 


A farmer making grape juice fills a glass bottle to the brim and caps it 
tightly. The juice expands more than the glass when it warms up, in 
such a way that the volume increases by 0.2% (that is, 

AV /Vo = 2 x 107%) relative to the space available. Calculate the 
magnitude of the normal force exerted by the juice per square 
centimeter if its bulk modulus is 1.8 x 10° N / m’, assuming the bottle 
does not break. In view of your answer, do you think the bottle will 
survive? 


Solution: 


4x10°N / m”. This is about 36 atm, greater than a typical jar can 
withstand. 


Exercise: 
Problem: 
(a) When water freezes, its volume increases by 9.05% (that is, 
AV/Vo = 9.05 x 10~2 ). What force per unit area is water capable of 
exerting on a container when it freezes? (It is acceptable to use the bulk 


modulus of water in this problem.) (b) Is it surprising that such forces 
can fracture engine blocks, boulders, and the like? 


Exercise: 
Problem: 
This problem returns to the tightrope walker studied in [link], who 
created a tension of 3.94 x 10° N ina wire making an angle 5.0° 
below the horizontal with each supporting pole. Calculate how much 


this tension stretches the steel wire if it was originally 15 m long and 
0.50 cm in diameter. 


Solution: 


1.4 cm 


Exercise: 


Problem: 


The pole in [link] is at a 90.0° bend in a power line and is therefore 
subjected to more shear force than poles in straight parts of the line. 
The tension in each line is 4.00 x 10* N, at the angles shown. The pole 
is 15.0 m tall, has an 18.0 cm diameter, and can be considered to have 
half the stiffness of hardwood. (a) Calculate the compression of the 
pole. (b) Find how much it bends and in what direction. (c) Find the 
tension in a guy wire used to keep the pole straight if it is attached to 
the top of the pole at an angle of 30.0° with the vertical. (Clearly, the 
guy wire must be in the opposite direction of the bend.) 


This telephone 

pole is at a 90° 
bend in a power 
line. A guy wire 
is attached to the 
top of the pole at 
an angle of 30° 
with the vertical. 


Glossary 


deformation 


change in shape due to the application of force 


Hooke’s law 
proportional relationship between the force F' on a material and the 
deformation AL it causes, F = kAL 


tensile strength 
the breaking stress that will cause permanent deformation or fraction of 
a material 


stress 
ratio of force to area 


strain 
ratio of change in length to original length 


shear deformation 
deformation perpendicular to the original length of an object 


Introduction to Work, Energy, and Energy Resources 
class="introduction" 


How many 
forms of 
energy can 
you identify 
in this 
photograph 
of a wind 
farm in 
Iowa? 
(credit: 
Jiirgen from 
Sandesneben 
, Germany, 
Wikimedia 
Commons) 


Energy plays an essential role both in everyday events and in scientific 
phenomena. You can no doubt name many forms of energy, from that 
provided by our foods, to the energy we use to run our cars, to the sunlight 
that warms us on the beach. You can also cite examples of what people call 
energy that may not be scientific, such as someone having an energetic 
personality. Not only does energy have many interesting forms, it is 


involved in almost all phenomena, and is one of the most important 
concepts of physics. What makes it even more important is that the total 
amount of energy in the universe is constant. Energy can change forms, but 
it cannot appear from nothing or disappear without a trace. Energy is thus 
one of a handful of physical quantities that we say is conserved. 


Conservation of energy (as physicists like to call the principle that energy 
can neither be created nor destroyed) is based on experiment. Even as 
scientists discovered new forms of energy, conservation of energy has 
always been found to apply. Perhaps the most dramatic example of this was 
supplied by Einstein when he suggested that mass is equivalent to energy 
(his famous equation E = mc’). 


From a societal viewpoint, energy is one of the major building blocks of 
modern civilization. Energy resources are key limiting factors to economic 
growth. The world use of energy resources, especially oil, continues to 
grow, with ominous consequences economically, socially, politically, and 
environmentally. We will briefly examine the world’s energy use patterns at 
the end of this chapter. 


There is no simple, yet accurate, scientific definition for energy. Energy is 
characterized by its many forms and the fact that it is conserved. We can 
loosely define energy as the ability to do work, admitting that in some 
circumstances not all energy is available to do work. Because of the 
association of energy with work, we begin the chapter with a discussion of 
work. Work is intimately related to energy and how energy moves from one 
system to another or changes form. 


Work: The Scientific Definition 


e Explain how an object must be displaced for a force on it to do work. 
e Explain how relative directions of force and displacement determine 
whether the work done is positive, negative, or zero. 


What It Means to Do Work 


The scientific definition of work differs in some ways from its everyday 
meaning. Certain things we think of as hard work, such as writing an exam 
or carrying a heavy load on level ground, are not work as defined by a 
scientist. The scientific definition of work reveals its relationship to energy 
—whenever work is done, energy is transferred. 


For work, in the scientific sense, to be done, a force must be exerted and 
there must be displacement in the direction of the force. 


Formally, the work done on a system by a constant force is defined to be 
the product of the component of the force in the direction of motion times 
the distance through which the force acts. For one-way motion in one 
dimension, this is expressed in equation form as 

Equation: 


W=|F | (cos@) | d |, 


where W is work, d is the displacement of the system, and @ is the angle 
between the force vector F and the displacement vector d, as in [link]. We 
can also write this as 

Equation: 


W = Fdcos 8. 


To find the work done on a system that undergoes motion that is not one- 
way or that is in two or three dimensions, we divide the motion into one- 
way one-dimensional segments and add up the work done over each 
segment. 


Note: 

What is Work? 

The work done on a system by a constant force is the product of the 
component of the force in the direction of motion times the distance 
through which the force acts. For one-way motion in one dimension, this is 
expressed in equation form as 

Equation: 


W = Fd cos 0, 


where W is work, F' is the magnitude of the force on the system, d is the 
magnitude of the displacement of the system, and @ is the angle between 
the force vector F and the displacement vector d. 


W= Fdcos @ 


(b) (c) 


Electric 
generator 


6 
(d) 


Examples of work. (a) The work done by the force 
F on this lawn mower is Fd cos 9. Note that 
F cos @ is the component of the force in the 
direction of motion. (b) A person holding a 
briefcase does no work on it, because there is no 


(e) 


displacement. No energy is transferred to or from 

the briefcase. (c) The person moving the briefcase 
horizontally at a constant speed does no work on 
it, and transfers no energy to it. (d) Work is done 

on the briefcase by carrying it up stairs at constant 

speed, because there is necessarily a component of 
force F in the direction of the motion. Energy is 
transferred to the briefcase and could in turn be 

used to do work. (e) When the briefcase is 
lowered, energy is transferred out of the briefcase 
and into an electric generator. Here the work done 
on the briefcase by the generator is negative, 
removing energy from the briefcase, because F 
and d are in opposite directions. 


To examine what the definition of work means, let us consider the other 
situations shown in [link]. The person holding the briefcase in [link ](b) 
does no work, for example. Here d = 0,so W = 0. Why is it you get tired 
just holding a load? The answer is that your muscles are doing work against 
one another, but they are doing no work on the system of interest (the 
“briefcase-Earth system”—see Gravitational Potential Energy for more 
details). There must be displacement for work to be done, and there must be 
a component of the force in the direction of the motion. For example, the 
person carrying the briefcase on level ground in [link ](c) does no work on 
it, because the force is perpendicular to the motion. That is, cos 90° = 0, 
and so W = 0. 


In contrast, when a force exerted on the system has a component in the 
direction of motion, such as in [link](d), work is done—energy is 
transferred to the briefcase. Finally, in [link](e), energy is transferred from 
the briefcase to a generator. There are two good ways to interpret this 
energy transfer. One interpretation is that the briefcase’s weight does work 
on the generator, giving it energy. The other interpretation is that the 
generator does negative work on the briefcase, thus removing energy from 
it. The drawing shows the latter, with the force from the generator upward 


on the briefcase, and the displacement downward. This makes 8 = 180°, 
and cos 180° = —1; therefore, W is negative. 


Calculating Work 


Work and energy have the same units. From the definition of work, we see 
that those units are force times distance. Thus, in SI units, work and energy 
are measured in newton-meters. A newton-meter is given the special name 
joule (J), and1 J=1N-m=1kg- m?/s”. One joule is not a large 
amount of energy; it would lift a small 100-gram apple a distance of about 1 
meter. 


Example: 

Calculating the Work You Do to Push a Lawn Mower Across a Large 
Lawn 

How much work is done on the lawn mower by the person in [link](a) if he 
exerts a constant force of 75.0 N at an angle 35° below the horizontal and 
pushes the mower 25.0 m on level ground? Convert the amount of work 
from joules to kilocalories and compare it with this person’s average daily 
intake of 10,000 kJ (about 2400 kcal) of food energy. One calorie (1 cal) 
of heat is the amount required to warm 1 g of water by 1°C, and is 
equivalent to 4.184 J, while one food calorie (1 kcal) is equivalent to 
4184 J. 

Strategy 

We can solve this problem by substituting the given values into the 
definition of work done on a system, stated in the equation W = Fd cos 8. 
The force, angle, and displacement are given, so that only the work W is 
unknown. 

Solution 

The equation for the work is 

Equation: 


W = Fdcos 8. 


Substituting the known values gives 


Equation: 


= 
| 


(75.0 N)(25.0 m) cos (35.0°) 
WER ce hel SCM a 


Converting the work in joules to kilocalories yields 

W = (1536 J)(1 kcal/4184 J) = 0.367 kcal. The ratio of the work done 
to the daily consumption is 

Equation: 


WwW 


ae a Sa et at 
3400 keal L538 e LO. 


Discussion 

This ratio is a tiny fraction of what the person consumes, but it is typical. 
Very little of the energy released in the consumption of food is used to do 
work. Even when we “work” all day long, less than 10% of our food 
energy intake is used to do work and more than 90% is converted to 
thermal energy or stored as chemical energy in fat. 


Section Summary 


e Work is the transfer of energy by a force acting on an object as it is 
displaced. 

e The work W that a force F does on an object is the product of the 
magnitude F' of the force, times the magnitude d of the displacement, 
times the cosine of the angle 0 between them. In symbols, 

Equation: 


W = Fd cos 8. 


e The SI unit for work and energy is the joule (J), where 
1J=1N-m=1kg- m?/s’. 

e The work done by a force is zero if the displacement is either zero or 
perpendicular to the force. 


e The work done is positive if the force and displacement have the same 
direction, and negative if they have opposite direction. 


Conceptual Questions 


Exercise: 
Problem: 
Give an example of something we think of as work in everyday 
circumstances that is not work in the scientific sense. Is energy 


transferred or changed in form in your example? If so, explain how 
this is accomplished without doing work. 


Exercise: 
Problem: 
Give an example of a situation in which there is a force anda 


displacement, but the force does no work. Explain why it does no 
work. 


Exercise: 


Problem: 


Describe a situation in which a force is exerted for a long time but 
does no work. Explain. 


Problems & Exercises 


Exercise: 


Problem: 


How much work does a supermarket checkout attendant do on a can of 
soup he pushes 0.600 m horizontally with a force of 5.00 N? Express 
your answer in joules and kilocalories. 


Solution: 


Equation: 


3.00 J = 7.17 x 1074 kcal 


Exercise: 
Problem: 
A 75.0-kg person climbs stairs, gaining 2.50 meters in height. Find the 
work done to accomplish this task. 

Exercise: 
Problem: 
(a) Calculate the work done on a 1500-kg elevator car by its cable to 
lift it 40.0 m at constant speed, assuming friction averages 100 N. (b) 


What is the work done on the lift by the gravitational force in this 
process? (c) What is the total work done on the lift? 


Solution: 
(a) 5.92 x 10° J 
(b) —5.88 x 10° J 


(c) The net force is zero. 
Exercise: 


Problem: 


Suppose a car travels 108 km at a speed of 30.0 m/s, and uses 2.0 gal 
of gasoline. Only 30% of the gasoline goes into useful work by the 
force that keeps the car moving at constant speed despite friction. (See 
[link] for the energy content of gasoline.) (a) What is the magnitude of 
the force exerted to keep the car moving at constant speed? (b) If the 
required force is directly proportional to speed, how many gallons will 
be used to drive 108 km at a speed of 28.0 m/s? 


Exercise: 


Problem: 


Calculate the work done by an 85.0-kg man who pushes a crate 4.00 m 
up along a ramp that makes an angle of 20.0° with the horizontal. (See 
[link].) He exerts a force of 500 N on the crate parallel to the ramp and 
moves at a constant speed. Be certain to include the work he does on 
the crate and on his body to get up the ramp. 


A man pushes a crate up a 
ramp. 


Solution: 
Equation: 


3.14 x 10° J 


Exercise: 


Problem: 


How much work is done by the boy pulling his sister 30.0 m ina 
wagon as shown in [link]? Assume no friction acts on the wagon. 


The boy does work on the 
system of the wagon and the 
child when he pulls them as 

shown. 


Exercise: 


Problem: 


A shopper pushes a grocery cart 20.0 m at constant speed on level 
ground, against a 35.0 N frictional force. He pushes in a direction 
25.0° below the horizontal. (a) What is the work done on the cart by 
friction? (b) What is the work done on the cart by the gravitational 
force? (c) What is the work done on the cart by the shopper? (d) Find 
the force the shopper exerts, using energy considerations. (e) What is 
the total work done on the cart? 


Solution: 
(a) —700 J 
(b) 0 

(c) 700 J 


(d) 38.6 N 


(e) 0 
Exercise: 


Problem: 


Suppose the ski patrol lowers a rescue sled and victim, having a total 
mass of 90.0 kg, down a 60.0° slope at constant speed, as shown in 
[link]. The coefficient of friction between the sled and the snow is 
0.100. (a) How much work is done by friction as the sled moves 30.0 
m along the hill? (6b) How much work is done by the rope on the sled 
in this distance? (c) What is the work done by the gravitational force 
on the sled? (d) What is the total work done? 


A rescue 
sled and 
victim are 
lowered 
down a 


steep 
slope. 


Glossary 


energy 


the ability to do work 


work 
the transfer of energy by a force that causes an object to be displaced; 
the product of the component of the force in the direction of the 
displacement and the magnitude of the displacement 


joule 
SI unit of work and energy, equal to one newton-meter 


Kinetic Energy and the Work-Energy Theorem 


e Explain work as a transfer of energy and net work as the work done by 
the net force. 
e Explain and apply the work-energy theorem. 


Work Transfers Energy 


What happens to the work done on a system? Energy is transferred into the 
system, but in what form? Does it remain in the system or move on? The 
answers depend on the situation. For example, if the lawn mower in [link] 
(a) is pushed just hard enough to keep it going at a constant speed, then 
energy put into the mower by the person is removed continuously by 
friction, and eventually leaves the system in the form of heat transfer. In 
contrast, work done on the briefcase by the person carrying it up stairs in 
[link](d) is stored in the briefcase-Earth system and can be recovered at any 
time, as shown in [link](e). In fact, the building of the pyramids in ancient 
Egypt is an example of storing energy in a system by doing work on the 
system. Some of the energy imparted to the stone blocks in lifting them 
during construction of the pyramids remains in the stone-Earth system and 
has the potential to do work. 


In this section we begin the study of various types of work and forms of 
energy. We will find that some types of work leave the energy of a system 
constant, for example, whereas others change the system in some way, such 
as making it move. We will also develop definitions of important forms of 
energy, such as the energy of motion. 


Net Work and the Work-Energy Theorem 


We know from the study of Newton’s laws in Dynamics: Force and 
Newton's Laws of Motion that net force causes acceleration. We will see in 
this section that work done by the net force gives a system energy of 
motion, and in the process we will also find an expression for the energy of 
motion. 


Let us start by considering the total, or net, work done on a system. Net 
work is defined to be the sum of work done by all external forces—that is, 
net work is the work done by the net external force F’,,.;. In equation form, 
this is Whet = Frnetd cos 0 where @ is the angle between the force vector 
and the displacement vector. 


[link](a) shows a graph of force versus displacement for the component of 
the force in the direction of the displacement—that is, an F' cos 0 vs. d 
graph. In this case, F' cos @ is constant. You can see that the area under the 
graph is Fd cos 8, or the work done. [link](b) shows a more general 
process where the force varies. The area under the curve is divided into 
strips, each having an average force (F' cos O )itureys The work done is 

(F' cos 9) starve) d; for each strip, and the total work done is the sum of the 


W;,. Thus the total work done is the total area under the curve, a useful 
property to which we shall refer later. 


Fcos 64 


PGS es ’ * Area Fcos@xd 


= Fdcos@ 
= work= W 


W = xW, = total area 
under curve 


Focos @ 


W; = (F cos 9)j ave) 4 


(a) A graph of F' cos @ vs. 
d, when F' cos @ is 


constant. The area under 
the curve represents the 
work done by the force. 
(b) A graph of F' cos 0 
vs. d in which the force 
varies. The work done for 
each interval is the area 
of each strip; thus, the 
total area under the curve 
equals the total work 
done. 


Net work will be simpler to examine if we consider a one-dimensional 
situation where a force is used to accelerate an object in a direction parallel 
to its initial velocity. Such a situation occurs for the package on the roller 
belt conveyor system shown in [link]. 


A package on a roller belt is pushed 
horizontally through a distance d. 


The force of gravity and the normal force acting on the package are 
perpendicular to the displacement and do no work. Moreover, they are also 
equal in magnitude and opposite in direction so they cancel in calculating 
the net force. The net force arises solely from the horizontal applied force 
F app and the horizontal friction force f. Thus, as expected, the net force is 


parallel to the displacement, so that @ = 0° and cos @ = 1, and the net 
work is given by 
Equation: 


Wret = Ff, net @. 


The effect of the net force Fy. is to accelerate the package from v9 to v. 
The kinetic energy of the package increases, indicating that the net work 
done on the system is positive. (See [link].) By using Newton’s second law, 
and doing some algebra, we can reach an interesting conclusion. 
Substituting Fy. = ma from Newton’s second law gives 

Equation: 


Wrhet = mad. 


To get a relationship between net work and the speed given to a system by 
the net force acting on it, we take d = x — 2g and use the equation studied 
in Motion Equations for Constant Acceleration in One Dimension for the 
change in speed over a distance d if the acceleration has the constant value 
a; namely, v2 = v9? + 2ad (note that a appears in the expression for the 


2 2 
net work). Solving for acceleration gives a = ~ i . When a is substituted 


into the preceding expression for Wyet, we obtain 
Equation: 


The d cancels, and we rearrange this to obtain 
Equation: 


This expression is called the work-energy theorem, and it actually applies 
in general (even for forces that vary in direction and magnitude), although 
we have derived it for the special case of a constant force parallel to the 
displacement. The theorem implies that the net work on a system equals the 
change in the quantity $mv?. This quantity is our first example of a form 


of energy. 


Note: 

The Work-Energy Theorem 

The net work on a system equals the change in the quantity +mv’. 
Equation: 


The quantity +mv" in the work-energy theorem is defined to be the 
translational kinetic energy (KE) of a mass m moving at a speed v. 
(Translational kinetic energy is distinct from rotational kinetic energy, 
which is considered later.) In equation form, the translational kinetic energy, 
Equation: 


KE = —mv’, 


is the energy associated with translational motion. Kinetic energy is a form 
of energy associated with the motion of a particle, single body, or system of 
objects moving together. 


We are aware that it takes energy to get an object, like a car or the package 
in [link], up to speed, but it may be a bit surprising that kinetic energy is 
proportional to speed squared. This proportionality means, for example, that 
a car traveling at 100 km/h has four times the kinetic energy it has at 50 


km/h, helping to explain why high-speed collisions are so devastating. We 
will now consider a series of examples to illustrate various aspects of work 
and energy. 


Example: 

Calculating the Kinetic Energy of a Package 

Suppose a 30.0-kg package on the roller belt conveyor system in [link] is 
moving at 0.500 m/s. What is its kinetic energy? 

Strategy 

Because the mass m and speed v are given, the kinetic energy can be 
calculated from its definition as given in the equation KE = +mv*. 
Solution 

The kinetic energy is given by 


Equation: 
KE = > mv? 
= me 6 
Entering known values gives 
Equation: 
KE = 0.5(30.0 kg) (0.500 m/s)’, 
which yields 
Equation: 
KE = 3.75 kg: m?/s” = 3.75 J. 
Discussion 


Note that the unit of kinetic energy is the joule, the same as the unit of 
work, as mentioned when work was first defined. It is also interesting that, 
although this is a fairly massive package, its kinetic energy is not large at 
this relatively low speed. This fact is consistent with the observation that 
people can move packages like this without exhausting themselves. 


Example: 

Determining the Work to Accelerate a Package 

Suppose that you push on the 30.0-kg package in [link] with a constant 
force of 120 N through a distance of 0.800 m, and that the opposing 
friction force averages 5.00 N. 

(a) Calculate the net work done on the package. (b) Solve the same 
problem as in part (a), this time by finding the work done by each force 
that contributes to the net force. 

Strategy and Concept for (a) 

This is a motion in one dimension problem, because the downward force 
(from the weight of the package) and the normal force have equal 
magnitude and opposite direction, so that they cancel in calculating the net 
force, while the applied force, friction, and the displacement are all 
horizontal. (See [link].) As expected, the net work is the net force times 
distance. 

Solution for (a) 

The net force is the push force minus friction, or 

Fret= 120 N — 5.00 N = 115 N. Thus the net work is 

Equation: 


Woe = Fred = (115 N) (0.800 m) 
92.0N-m = 92.0 J. 


Discussion for (a) 

This value is the net work done on the package. The person actually does 
more work than this, because friction opposes the motion. Friction does 
negative work and removes some of the energy the person expends and 
converts it to thermal energy. The net work equals the sum of the work 
done by each individual force. 

Strategy and Concept for (b) 

The forces acting on the package are gravity, the normal force, the force of 
friction, and the applied force. The normal force and force of gravity are 
each perpendicular to the displacement, and therefore do no work. 
Solution for (b) 

The applied force does work. 

Equation: 


We =F Bd cos( 0") Fd, 
(120 N)(0.800 m) 
96.0 J 


| 


The friction force and displacement are in opposite directions, so that 
= 180°, and the work done by friction is 
Equation: 


We = Fy,.d cos(180°) = —Fy,d 


—(5.00 N)(0.800 m) 
—4.00 J. 


| 


So the amounts of work done by gravity, by the normal force, by the 
applied force, and by friction are, respectively, 


Equation: 
Vie ae AU, 
Wn = QO, 
Wapp = 96.0 J, 
Wr = —4.00 J. 


The total work done as the sum of the work done by each force is then seen 
to be 
Equation: 


Wotal = Wer sil Wn ais Wapp =i Wr = 92.0 Ae 


Discussion for (b) 

The calculated total work Wo¢a) as the sum of the work by each force 
agrees, as expected, with the work W,y.¢ done by the net force. The work 
done by a collection of forces acting on an object can be calculated by 
either approach. 


Example: 


Determining Speed from Work and Energy 

Find the speed of the package in [link] at the end of the push, using work 
and energy concepts. 

Strategy 

Here the work-energy theorem can be used, because we have just 
calculated the net work, Wnet, and the initial kinetic energy, SmMuvo?. 
These calculations allow us to find the final kinetic energy, +mv’, and 
thus the final speed v. 


Solution 
The work-energy theorem in equation form is 
Equation: 
Woe = ee = Sao 
ne ) y) 


Solving for +mv" gives 


Equation: 
— mv? = Whee + ae 
y te, 

Thus, 

Equation: 


1 
zm — 92.03 + 3.75 J = 95.75 J. 


Solving for the final speed as requested and entering known values gives 


Equation: 
_ / 2(95.75 3) _ _ / 191.5 kg-m2/s? 
ee _S 30.0 kg 


2.53 m/s. 


| 


Discussion 
Using work and energy, we not only arrive at an answer, we see that the 
final kinetic energy is the sum of the initial kinetic energy and the net work 


done on the package. This means that the work indeed adds to the energy 
of the package. 


Example: 

Work and Energy Can Reveal Distance, Too 

How far does the package in [link] coast after the push, assuming friction 
remains constant? Use work and energy considerations. 

Strategy 

We know that once the person stops pushing, friction will bring the 
package to rest. In terms of energy, friction does negative work until it has 
removed all of the package’s kinetic energy. The work done by friction is 
the force of friction times the distance traveled times the cosine of the 
angle between the friction force and displacement; hence, this gives us a 
way of finding the distance traveled after the person stops pushing. 
Solution 

The normal force and force of gravity cancel in calculating the net force. 
The horizontal friction force is then the net force, and it acts opposite to the 
displacement, so 8 = 180°. To reduce the kinetic energy of the package to 
zero, the work W¢, by friction must be minus the kinetic energy that the 
package started with plus what the package accumulated due to the 
pushing. Thus Wz, = —95.75 J. Furthermore, W;, = fd/ cos 0 = —fdi, 
where d/ is the distance it takes to stop. Thus, 


Equation: 
ae We, __ —95.75 a 
f 5.00 N 
and so 
Equation: 
di= 19.2 m. 
Discussion 


This is a reasonable distance for a package to coast on a relatively friction- 
free conveyor system. Note that the work done by friction is negative (the 


force is in the opposite direction of motion), so it removes the kinetic 
energy. 


Some of the examples in this section can be solved without considering 
energy, but at the expense of missing out on gaining insights about what 
work and energy are doing in this situation. On the whole, solutions 
involving energy are generally shorter and easier than those using 
kinematics and dynamics alone. 


Section Summary 


e The net work Wye is the work done by the net force acting on an 
object. 

e Work done on an object transfers energy to the object. 

e The translational kinetic energy of an object of mass m moving at 
speed v is KE = +mv?. 

e The work-energy theorem states that the net work Wy.¢4 on a system 


changes its kinetic energy, Wnet = +mv" — +mMvo". 


Conceptual Questions 


Exercise: 


Problem: 


The person in [link] does work on the lawn mower. Under what 
conditions would the mower gain energy? Under what conditions 


would it lose energy? 
W= Fdcos 6 


Exercise: 
Problem: 
Work done on a system puts energy into it. Work done by a system 
removes energy from it. Give an example for each statement. 
Exercise: 


Problem: 


When solving for speed in [link], we kept only the positive root. Why? 


Problems & Exercises 


Exercise: 
Problem: 


Compare the kinetic energy of a 20,000-kg truck moving at 110 km/h 
with that of an 80.0-kg astronaut in orbit moving at 27,500 km/h. 


Solution: 


1/250 
Exercise: 
Problem: 
(a) How fast must a 3000-kg elephant move to have the same kinetic 
energy as a 65.0-kg sprinter running at 10.0 m/s? (b) Discuss how the 


larger energies needed for the movement of larger animals would 
relate to metabolic rates. 


Exercise: 
Problem: 
Confirm the value given for the kinetic energy of an aircraft carrier in 


[link]. You will need to look up the definition of a nautical mile (1 knot 
= 1 nautical mile/h). 


Solution: 


|e ca ae 

Exercise: 
Problem: 
(a) Calculate the force needed to bring a 950-kg car to rest from a 
speed of 90.0 km/h in a distance of 120 m (a fairly typical distance for 
a non-panic stop). (b) Suppose instead the car hits a concrete abutment 


at full speed and is brought to a stop in 2.00 m. Calculate the force 
exerted on the car and compare it with the force found in part (a). 


Exercise: 
Problem: 
A car’s bumper is designed to withstand a 4.0-km/h (1.1-m/s) collision 
with an immovable object without damage to the body of the car. The 
bumper cushions the shock by absorbing the force over a distance. 
Calculate the magnitude of the average force on a bumper that 


collapses 0.200 m while bringing a 900-kg car to rest from an initial 
speed of 1.1 m/s. 


Solution: 


2.8 x 10° N 


Exercise: 


Problem: 


Boxing gloves are padded to lessen the force of a blow. (a) Calculate 
the force exerted by a boxing glove on an opponent’s face, if the glove 
and face compress 7.50 cm during a blow in which the 7.00-kg arm 
and glove are brought to rest from an initial speed of 10.0 m/s. (b) 
Calculate the force exerted by an identical blow in the gory old days 
when no gloves were used and the knuckles and face would compress 
only 2.00 cm. (c) Discuss the magnitude of the force with glove on. 
Does it seem high enough to cause damage even though it is lower 
than the force with no glove? 


Exercise: 
Problem: 
Using energy considerations, calculate the average force a 60.0-kg 
sprinter exerts backward on the track to accelerate from 2.00 to 8.00 


m/s in a distance of 25.0 m, if he encounters a headwind that exerts an 
average force of 30.0 N against him. 


Solution: 


102 N 


Glossary 


net work 
work done by the net force, or vector sum of all the forces, acting on 
an object 


work-energy theorem 
the result, based on Newton’s laws, that the net work done on an object 
is equal to its change in kinetic energy 


kinetic energy 


the energy an object has by reason of its motion, equal to smu" for 


the translational (i.e., non-rotational) motion of an object of mass m 
moving at speed v 


Gravitational Potential Energy 


e Explain gravitational potential energy in terms of work done against gravity. 

e Show that the gravitational potential energy of an object of mass m at height h 
on Earth is given by PE, = mgh. 

e Show how knowledge of the potential energy as a function of position can be 
used to simplify calculations and explain physical phenomena. 


Work Done Against Gravity 


Climbing stairs and lifting objects is work in both the scientific and everyday sense 
— it is work done against the gravitational force. When there is work, there is a 
transformation of energy. The work done against the gravitational force goes into an 
important form of stored energy that we will explore in this section. 


Let us calculate the work done in lifting an object of mass m through a height h, 
such as in [link]. If the object is lifted straight up at constant speed, then the force 
needed to lift it is equal to its weight mg. The work done on the mass is then 

W = Fd = mgh. We define this to be the gravitational potential energy (PE, ) 
put into (or gained by) the object-Earth system. This energy is associated with the 
state of separation between two objects that attract each other by the gravitational 
force. For convenience, we refer to this as the PE, gained by the object, 
recognizing that this is energy stored in the gravitational field of Earth. Why do we 
use the word “system”? Potential energy is a property of a system rather than of a 
single object—due to its physical position. An object’s gravitational potential is due 
to its position relative to the surroundings within the Earth-object system. The force 
applied to the object is an external force, from outside the system. When it does 
positive work it increases the gravitational potential energy of the system. Because 
gravitational potential energy depends on relative position, we need a reference 
level at which to set the potential energy equal to 0. We usually choose this point to 
be Earth’s surface, but this point is arbitrary; what is important is the difference in 
gravitational potential energy, because this difference is what relates to the work 
done. The difference in gravitational potential energy of an object (in the Earth- 
object system) between two rungs of a ladder will be the same for the first two rungs 
as for the last two rungs. 


Converting Between Potential Energy and Kinetic Energy 


Gravitational potential energy may be converted to other forms of energy, such as 
kinetic energy. If we release the mass, gravitational force will do an amount of work 


equal to mgh on it, thereby increasing its kinetic energy by that same amount (by 
the work-energy theorem). We will find it more useful to consider just the 
conversion of PE, to KE without explicitly considering the intermediate step of 
work. (See [link].) This shortcut makes it is easier to solve problems using energy 
(if possible) rather than explicitly using forces. 


E = mgh 


(b) 


(a) The work done to lift the weight is 
stored in the mass-Earth system as 
gravitational potential energy. (b) As 
the weight moves downward, this 
gravitational potential energy is 
transferred to the cuckoo clock. 


More precisely, we define the change in gravitational potential energy APE, to be 
Equation: 


APE, = mgh, 


where, for simplicity, we denote the change in height by A rather than the usual Ah. 
Note that h is positive when the final height is greater than the initial height, and 
vice versa. For example, if a 0.500-kg mass hung from a cuckoo clock is raised 1.00 
m, then its change in gravitational potential energy is 

Equation: 


mgh = (0.500kg) (9.80 m/s”) (1.00 m) 


4.90 kg - m?/s’= 4.90 J. 


Note that the units of gravitational potential energy turn out to be joules, the same as 
for work and other forms of energy. As the clock runs, the mass is lowered. We can 
think of the mass as gradually giving up its 4.90 J of gravitational potential energy, 
without directly considering the force of gravity that does the work. 


Using Potential Energy to Simplify Calculations 


The equation APE, = mgh applies for any path that has a change in height of h, 
not just when the mass is lifted straight up. (See [link].) It is much easier to calculate 
mgh (a simple multiplication) than it is to calculate the work done along a 
complicated path. The idea of gravitational potential energy has the double 
advantage that it is very broadly applicable and it makes calculations easier. From 
now on, we will consider that any change in vertical position h of a mass m is 
accompanied by a change in gravitational potential energy mgh, and we will avoid 
the equivalent but more difficult task of calculating work done by or against the 
gravitational force. 


t 


A 


The change in 
gravitational 
potential energy 
(APE,) 
between points 
A and B is 
independent of 
the path. 
APE, = mgh 
for any path 
between the two 
points. Gravity 
is one of a small 
class of forces 
where the work 
done by or 
against the force 
depends only on 
the starting and 
ending points, 
not on the path 
between them. 


Example: 

The Force to Stop Falling 

A 60.0-kg person jumps onto the floor from a height of 3.00 m. If he lands stiffly 
(with his knee joints compressing by 0.500 cm), calculate the force on the knee 
joints. 

Strategy 

This person’s energy is brought to zero in this situation by the work done on him by 
the floor as he stops. The initial PE, is transformed into KE as he falls. The work 
done by the floor reduces this kinetic energy to zero. 

Solution 

The work done on the person by the floor as he stops is given by 

Equation: 


W = Fd cos 6 = —Fd, 


with a minus sign because the displacement while stopping and the force from floor 
are in opposite directions (cos 8 = cos 180° = —1). The floor removes energy 
from the system, so it does negative work. 

The kinetic energy the person has upon reaching the floor is the amount of potential 
energy lost by falling through height h: 

Equation: 


KE = —APE, = —mgh, 


The distance d that the person’s knees bend is much smaller than the height h of the 
fall, so the additional change in gravitational potential energy during the knee bend 
is ignored. 

The work W done by the floor on the person stops the person and brings the 
person’s kinetic energy to zero: 

Equation: 


W = —KE = mgh. 


Combining this equation with the expression for W gives 
Equation: 


—Fd = mgh. 


Recalling that h is negative because the person fell down, the force on the knee 
joints is given by 
Equation: 


(60.0 kg) (9.80 m/s”) (—3.00 m) 


ES SS ee 5 0 N 
d 5.00 x 10-2 m 


Discussion 

Such a large force (500 times more than the person’s weight) over the short impact 
time is enough to break bones. A much better way to cushion the shock is by 
bending the legs or rolling on the ground, increasing the time over which the force 
acts. A bending motion of 0.5 m this way yields a force 100 times smaller than in 
the example. A kangaroo's hopping shows this method in action. The kangaroo is 
the only large animal to use hopping for locomotion, but the shock in hopping is 
cushioned by the bending of its hind legs in each jump.(See [link].) 


The work done by the 
ground upon the 
kangaroo reduces its 
kinetic energy to zero 
as it lands. However, by 
applying the force of 
the ground on the hind 
legs over a longer 
distance, the impact on 
the bones is reduced. 
(credit: Chris Samuel, 
Flickr) 


Example: 

Finding the Speed of a Roller Coaster from its Height 

(a) What is the final speed of the roller coaster shown in [link] if it starts from rest 
at the top of the 20.0 m hill and work done by frictional forces is negligible? (b) 
What is its final speed (again assuming negligible friction) if its initial speed is 5.00 
m/s? 


The speed of a roller coaster increases as gravity pulls it 
downhill and is greatest at its lowest point. Viewed in terms 
of energy, the roller-coaster-Earth system’s gravitational 
potential energy is converted to kinetic energy. If work done 
by friction is negligible, all APE, is converted to KE. 


Strategy 

The roller coaster loses potential energy as it goes downhill. We neglect friction, so 
that the remaining force exerted by the track is the normal force, which is 
perpendicular to the direction of motion and does no work. The net work on the 
roller coaster is then done by gravity alone. The loss of gravitational potential 
energy from moving downward through a distance h equals the gain in kinetic 
energy. This can be written in equation form as -APE, = AKE. Using the 
equations for PE, and KE, we can solve for the final speed v, which is the desired 
quantity. 

Solution for (a) 

Here the initial kinetic energy is zero, so that AKE = Smv’. The equation for 
change in potential energy states that APE, = mgh. Since h is negative in this 
case, we will rewrite this as APE, = —mg | h | to show the minus sign clearly. 
Thus, 


Equation: 
—APE, = AKE 


becomes 
Equation: 


1 
mg | h |= zm 


Solving for v, we find that mass cancels and that 


Equation: 
v= 1/2g|h|. 


Substituting known values, 
Equation: 


ie \? (9.80 m/s”) (20.0 m) 
= 19.8m/s. 


Solution for (b) 
Again -APE, = AKE. In this case there is initial kinetic energy, so 


TS +mv" = $mMvo". Thus, 
Equation: 


1 i) 
mg | h |= zm - amvo 


Rearranging gives 

Equation: 
1 2 1 2 
—mv* =mg|h| +—mr~. 
9 g | | 5 0 


This means that the final kinetic energy is the sum of the initial kinetic energy and 
the gravitational potential energy. Mass again cancels, and 


Equation: 
v= 1/2g|h| +v0?. 


This equation is very similar to the kinematics equation v = 7, ug? + 2ad, but it is 
more general—the kinematics equation is valid only for constant acceleration, 
whereas our equation above is valid for any path regardless of whether the object 
moves with a constant acceleration. Now, substituting known values gives 
Equation: 


v= 1/2(9.80 m/s”)(20.0 m) + (5.00 m/s)? 
= 20.4m/s. 


Discussion and Implications 

First, note that mass cancels. This is quite consistent with observations made in 
Falling Objects that all objects fall at the same rate if friction is negligible. Second, 
only the speed of the roller coaster is considered; there is no information about its 
direction at any point. This reveals another general truth. When friction is 
negligible, the speed of a falling body depends only on its initial speed and height, 
and not on its mass or the path taken. For example, the roller coaster will have the 
same final speed whether it falls 20.0 m straight down or takes a more complicated 
path like the one in the figure. Third, and perhaps unexpectedly, the final speed in 
part (b) is greater than in part (a), but by far less than 5.00 m/s. Finally, note that 
speed can be found at any height along the way by simply using the appropriate 
value of h at the point of interest. 


We have seen that work done by or against the gravitational force depends only on 
the starting and ending points, and not on the path between, allowing us to define 
the simplifying concept of gravitational potential energy. We can do the same thing 
for a few other forces, and we will see that this leads to a formal definition of the 
law of conservation of energy. 


Note: 

Making Connections: Take-Home Investigation—Converting Potential to Kinetic 
Energy 

One can study the conversion of gravitational potential energy into kinetic energy 
in this experiment. On a smooth, level surface, use a ruler of the kind that has a 
groove running along its length and a book to make an incline (see [link]). Place a 
marble at the 10-cm position on the ruler and let it roll down the ruler. When it hits 
the level surface, measure the time it takes to roll one meter. Now place the marble 


at the 20-cm and the 30-cm positions and again measure the times it takes to roll 1 
m on the level surface. Find the velocity of the marble on the level surface for all 
three positions. Plot velocity squared versus the distance traveled by the marble. 
What is the shape of each plot? If the shape is a straight line, the plot shows that the 
marble’s kinetic energy at the bottom is proportional to its potential energy at the 
release point. 


Q 


A marble rolls down a ruler, and its speed on the 


level surface is measured. 


Section Summary 


Work done against gravity in lifting an object becomes potential energy of the 
object-Earth system. 

The change in gravitational potential energy, APE,, is APE, = mgh, with h 
being the increase in height and g the acceleration due to gravity. 

The gravitational potential energy of an object near Earth’s surface is due to its 
position in the mass-Earth system. Only differences in gravitational potential 
energy, APE,, have physical significance. 

As an object descends without friction, its gravitational potential energy 
changes into kinetic energy corresponding to increasing speed, so that 

AKE= —APE,. 


Conceptual Questions 


Exercise: 


Problem: 


In [link], we calculated the final speed of a roller coaster that descended 20 m 
in height and had an initial speed of 5 m/s downhill. Suppose the roller coaster 
had had an initial speed of 5 m/s uphill instead, and it coasted uphill, stopped, 
and then rolled back down to a final point 20 m below the start. We would find 
in that case that its final speed is the same as its initial speed. Explain in terms 
of conservation of energy. 


Exercise: 
Problem: 
Does the work you do on a book when you lift it onto a shelf depend on the 


path taken? On the time taken? On the height of the shelf? On the mass of the 
book? 


Problems & Exercises 


Exercise: 


Problem: 


A hydroelectric power facility (see [link]) converts the gravitational potential 
energy of water behind a dam to electric energy. (a) What is the gravitational 
potential energy relative to the generators of a lake of volume 50.0 km? ( 
mass = 5.00 x 10'° kg), given that the lake has an average height of 40.0 m 
above the generators? (b) Compare this with the energy stored in a 9-megaton 
fusion bomb. 


Hydroelectric facility (credit: Denis 


Belevich, Wikimedia Commons) 


Solution: 
(a) 1.961016 J 


(b) The ratio of gravitational potential energy in the lake to the energy stored in 
the bomb is 0.52. That is, the energy stored in the lake is approximately half 
that in a 9-megaton fusion bomb. 


Exercise: 
Problem: 
(a) How much gravitational potential energy (relative to the ground on which it 
is built) is stored in the Great Pyramid of Cheops, given that its mass is about 


7 x 10° kg and its center of mass is 36.5 m above the surrounding ground? (b) 
How does this energy compare with the daily food intake of a person? 


Exercise: 
Problem: 
Suppose a 350-g kookaburra (a large kingfisher bird) picks up a 75-g snake and 
raises it 2.5 m from the ground to a branch. (a) How much work did the bird do 


on the snake? (b) How much work did it do to raise its own center of mass to 
the branch? 


Solution: 
(a) 1.8 J 


(b) 8.6 J 
Exercise: 
Problem: 
In [link], we found that the speed of a roller coaster that had descended 20.0 m 
was only slightly greater when it had an initial speed of 5.00 m/s than when it 


started from rest. This implies that APE >> KE;. Confirm this statement by 
taking the ratio of APE to KE;. (Note that mass cancels.) 


Exercise: 


Problem: 


A 100-g toy car is propelled by a compressed spring that starts it moving. The 
car follows the curved track in [link]. Show that the final speed of the toy car is 
0.687 m/s if its initial speed is 2.00 m/s and it coasts up the frictionless slope, 
gaining 0.180 m in altitude. 


A toy car moves up a sloped track. 
(credit: Leszek Leszczynski, Flickr) 


Solution: 
Equation: 


Uf= /2gh + v9? = \/2(9.80 m/s”)(—0.180 m) + (2.00 m/s)? = 0.687 m/s 


Exercise: 


Problem: 


In a downhill ski race, surprisingly, little advantage is gained by getting a 
running start. (This is because the initial kinetic energy is small compared with 
the gain in gravitational potential energy on even small hills.) To demonstrate 
this, find the final speed and the time taken for a skier who skies 70.0 m along 
a 30° slope neglecting friction: (a) Starting from rest. (b) Starting with an initial 
speed of 2.50 m/s. (c) Does the answer surprise you? Discuss why it is still 
advantageous to get a running start in very competitive events. 


Glossary 


gravitational potential energy 
the energy an object has due to its position in a gravitational field 


Conservative Forces and Potential Energy 


e Define conservative force, potential energy, and mechanical energy. 

e Explain the potential energy of a spring in terms of its compression 
when Hooke’s law applies. 

e Use the work-energy theorem to show how having only conservative 
forces implies conservation of mechanical energy. 


Potential Energy and Conservative Forces 


Work is done by a force, and some forces, such as weight, have special 
characteristics. A conservative force is one, like the gravitational force, for 
which work done by or against it depends only on the starting and ending 
points of a motion and not on the path taken. We can define a potential 
energy (PE) for any conservative force, just as we did for the gravitational 
force. For example, when you wind up a toy, an egg timer, or an old- 
fashioned watch, you do work against its spring and store energy in it. (We 
treat these springs as ideal, in that we assume there is no friction and no 
production of thermal energy.) This stored energy is recoverable as work, 
and it is useful to think of it as potential energy contained in the spring. 
Indeed, the reason that the spring has this characteristic is that its force is 
conservative. That is, a conservative force results in stored or potential 
energy. Gravitational potential energy is one example, as is the energy 
stored in a spring. We will also see how conservative forces are related to 
the conservation of energy. 


Note: 

Potential Energy and Conservative Forces 

Potential energy is the energy a system has due to position, shape, or 
configuration. It is stored energy that is completely recoverable. 

A conservative force is one for which work done by or against it depends 
only on the starting and ending points of a motion and not on the path 
taken. 

We can define a potential energy (PE) for any conservative force. The 
work done against a conservative force to reach a final configuration 


depends on the configuration, not the path followed, and is the potential 
energy added. 


Potential Energy of a Spring 


First, let us obtain an expression for the potential energy stored in a spring ( 
PE,). We calculate the work done to stretch or compress a spring that obeys 
Hooke’s law. (Hooke’s law was examined in Elasticity: Stress and Strain, 
and states that the magnitude of force F’ on the spring and the resulting 
deformation AL are proportional, F = KAZ.) (See [link].) For our spring, 
we will replace AL (the amount of deformation produced by a force F’) by 
the distance x that the spring is stretched or compressed along its length. So 
the force needed to stretch the spring has magnitude F = kx, where k is the 
spring’s force constant. The force increases linearly from 0 at the start to kx 
in the fully stretched position. The average force is kx /2. Thus the work 
done in stretching or compressing the spring is 

W,=Fd= ()a = + kx’. Alternatively, we noted in Kinetic Energy 
and the Work-Energy Theorem that the area under a graph of F' vs. z is the 
work done by the force. In [link](c) we see that this area is also + kx*, We 


therefore define the potential energy of a spring, PE,, to be 
Equation: 


where k is the spring’s force constant and z is the displacement from its 
undeformed position. The potential energy represents the work done on the 
spring and the energy stored in it as a result of stretching or compressing it 
a distance x. The potential energy of the spring PE, does not depend on the 
path taken; it depends only on the stretch or squeeze z in the final 
configuration. 


(a) 


(a) An undeformed spring has no PE, stored in it. (b) The force 
needed to stretch (or compress) the spring a distance zx has a 
magnitude F' = ka , and the work done to stretch (or compress) it is 
Ska’. Because the force is conservative, this work is stored as 
potential energy (PE;) in the spring, and it can be fully recovered. 
(c) A graph of F' vs. x has a slope of k, and the area under the graph 


is + ke? Thus the work done or potential energy stored is tke? 


The equation PE, = Ska’? has general validity beyond the special case for 
which it was derived. Potential energy can be stored in any elastic medium 
by deforming it. Indeed, the general definition of potential energy is 
energy due to position, shape, or configuration. For shape or position 
deformations, stored energy is PE, = Ska’, where k is the force constant 
of the particular system and z is its deformation. Another example is seen 
in [link] for a guitar string. 


Work is done 
to deform the 


guitar string, 
giving it 
potential 
energy. 
When 
released, the 
potential 
energy is 
converted to 
kinetic 
energy and 
back to 
potential as 
the string 
oscillates 
back and 
forth. A very 
small 
fraction is 
dissipated as 


sound 
energy, 
slowly 
removing 
energy from 
the string. 


Conservation of Mechanical Energy 


Let us now consider what form the work-energy theorem takes when only 
conservative forces are involved. This will lead us to the conservation of 
energy principle. The work-energy theorem states that the net work done by 
all forces acting on a system equals its change in kinetic energy. In equation 
form, this is 

Equation: 


1 1 
Wie = zm — mv — AKE. 


If only conservative forces act, then 
Equation: 


Wet = W. ) 


where W, is the total work done by all conservative forces. Thus, 
Equation: 


W, = AKE. 


Now, if the conservative force, such as the gravitational force or a spring 
force, does work, the system loses potential energy. That is, W, = —APE. 
Therefore, 

Equation: 


—APE = AKE 


or 
Equation: 


AKE + APE = 0. 


This equation means that the total kinetic and potential energy is constant 
for any process involving only conservative forces. That is, 
Equation: 


KE + PE = constant 


or (conservative forces only), 
KE; + PE; = KE; + PEs 


where i and f denote initial and final values. This equation is a form of the 
work-energy theorem for conservative forces; it is known as the 
conservation of mechanical energy principle. Remember that this applies 
to the extent that all the forces are conservative, so that friction is 
negligible. The total kinetic plus potential energy of a system is defined to 
be its mechanical energy, (KE + PE). In a system that experiences only 
conservative forces, there is a potential energy associated with each force, 
and the energy only changes form between KE and the various types of PE 
, with the total energy remaining constant. 


Example: 

Using Conservation of Mechanical Energy to Calculate the Speed of a 
Toy Car 

A 0.100-kg toy car is propelled by a compressed spring, as shown in [link]. 
The car follows a track that rises 0.180 m above the starting point. The 
spring is compressed 4.00 cm and has a force constant of 250.0 N/m. 
Assuming work done by friction to be negligible, find (a) how fast the car 


is going before it starts up the slope and (b) how fast it is going at the top 
of the slope. 


Path of the Car 


Alternate path 


A toy car is pushed by a compressed spring and coasts 
up a slope. Assuming negligible friction, the potential 
energy in the spring is first completely converted to 
kinetic energy, and then to a combination of kinetic and 
gravitational potential energy as the car rises. The details 
of the path are unimportant because all forces are 
conservative—the car would have the same final speed if 
it took the alternate path shown. 


Strategy 

The spring force and the gravitational force are conservative forces, so 
conservation of mechanical energy can be used. Thus, 

Equation: 


KE; + PE; = KE; + PEs 


or 
Equation: 


re + mgh, + hae = eo + mgh, + ae 
2 2 2 2 

where h is the height (vertical position) and x is the compression of the 
spring. This general statement looks complex but becomes much simpler 
when we start considering specific situations. First, we must identify the 
initial and final conditions in a problem; then, we enter them into the last 
equation to solve for an unknown. 

Solution for (a) 


This part of the problem is limited to conditions just before the car is 
released and just after it leaves the spring. Take the initial height to be zero, 
so that both h; and hg are zero. Furthermore, the initial speed v; is zero and 
the final compression of the spring x¢ is zero, and so several terms in the 
conservation of mechanical energy equation are zero and it simplifies to 
Equation: 

= he? = smu. 
In other words, the initial potential energy in the spring is converted 
completely to kinetic energy in the absence of friction. Solving for the final 
speed and entering known values yields 
Equation: 


k 
Uf = m “i 


250.0 N/m 


= 2.00 m/s. 


Solution for (b) 

One method of finding the speed at the top of the slope is to consider 
conditions just before the car is released and just after it reaches the top of 
the slope, completely ignoring everything in between. Doing the same type 
of analysis to find which terms are zero, the conservation of mechanical 
energy becomes 

Equation: 


1 
Se amr + mghr. 


This form of the equation means that the spring’s initial potential energy is 
converted partly to gravitational potential energy and partly to kinetic 
energy. The final speed at the top of the slope will be less than at the 
bottom. Solving for v¢ and substituting known values gives 

Equation: 


Ce y BE — 2gh; 


= i/ (Sane ) (0.0400 m)? — 2(9.80 m/s”) (0.180 m) 


0.687 m/s. 


Discussion 

Another way to solve this problem is to realize that the car’s kinetic energy 
before it goes up the slope is converted partly to potential energy—that is, 
to take the final conditions in part (a) to be the initial conditions in part (b). 


Note that, for conservative forces, we do not directly calculate the work 
they do; rather, we consider their effects through their corresponding 
potential energies, just as we did in [link]. Note also that we do not consider 
details of the path taken—only the starting and ending points are important 
(as long as the path is not impossible). This assumption is usually a 
tremendous simplification, because the path may be complicated and forces 
may vary along the way. 


Note: 

PhET Explorations: Energy Skate Park 

Learn about conservation of energy with a skater dude! Build tracks, ramps 
and jumps for the skater and view the kinetic energy, potential energy and 
friction as he moves. You can also take the skater to different planets or 
even space! 
https://phet.colorado.edu/sims/html/energy-skate-park-basics/latest/energy- 
skate-park-basics_en.html 


Section Summary 


e A conservative force is one for which work depends only on the 
Starting and ending points of a motion, not on the path taken. 

e We can define potential energy (PE) for any conservative force, just 
as we defined PE, for the gravitational force. 


¢ The potential energy of a spring is PE, = skx’, where k is the 


spring’s force constant and z is the displacement from its undeformed 
position. 
¢ Mechanical energy is defined to be KE + PE for a conservative force. 
¢ When only conservative forces act on and within a system, the total 
mechanical energy is constant. In equation form, 


Equation: 


KE + PE = constant 
or 
KE; + PE; = KE; + PEs 


where i and f denote initial and final values. This is known as the 
conservation of mechanical energy. 


Conceptual Questions 


Exercise: 


Problem: What is a conservative force? 

Exercise: 
Problem: 
The force exerted by a diving board is conservative, provided the 
internal friction is negligible. Assuming friction is negligible, describe 
changes in the potential energy of a diving board as a swimmer dives 


from it, starting just before the swimmer steps on the board until just 
after his feet leave it. 


Exercise: 


Problem: 


Define mechanical energy. What is the relationship of mechanical 
energy to nonconservative forces? What happens to mechanical energy 
if only conservative forces act? 


Exercise: 


Problem: 


What is the relationship of potential energy to conservative force? 


Problems & Exercises 


Exercise: 
Problem: 
A 5.00 x 10°-kg subway train is brought to a stop from a speed of 


0.500 m/s in 0.400 m by a large spring bumper at the end of its track. 
What is the force constant k of the spring? 


Solution: 
Equation: 


7.81x10° N/m 


Exercise: 


Problem: 


A pogo stick has a spring with a force constant of 2.50 10+ N /m, 
which can be compressed 12.0 cm. To what maximum height can a 
child jump on the stick using only the energy in the spring, if the child 
and stick have a total mass of 40.0 kg? Explicitly show how you 
follow the steps in the Problem-Solving Strategies for Energy. 


Glossary 


conservative force 
a force that does the same work for any given initial and final 
configuration, regardless of the path followed 


potential energy 
energy due to position, shape, or configuration 


potential energy of a spring 
the stored energy of a spring as a function of its displacement; when 
Hooke’s law applies, it is given by the expression Ska? where z is the 
distance the spring is compressed or extended and k is the spring 
constant 


conservation of mechanical energy 
the rule that the sum of the kinetic energies and potential energies 
remains constant if only conservative forces act on and within a system 


mechanical energy 
the sum of kinetic energy and potential energy 


Nonconservative Forces 


e Define nonconservative forces and explain how they affect mechanical 
energy. 

¢ Show how the principle of conservation of energy can be applied by 
treating the conservative forces in terms of their potential energies and 
any nonconservative forces in terms of the work they do. 


Nonconservative Forces and Friction 


Forces are either conservative or nonconservative. Conservative forces were 
discussed in Conservative Forces and Potential Energy. A nonconservative 
force is one for which work depends on the path taken. Friction is a good 
example of a nonconservative force. As illustrated in [link], work done 
against friction depends on the length of the path between the starting and 
ending points. Because of this dependence on path, there is no potential 
energy associated with nonconservative forces. An important characteristic 
is that the work done by a nonconservative force adds or removes 
mechanical energy from a system. Friction, for example, creates thermal 
energy that dissipates, removing energy from the system. Furthermore, 
even if the thermal energy is retained or captured, it cannot be fully 
converted back to work, so it is lost or not recoverable in that sense as well. 


(a) 


The amount of the happy face erased 
depends on the path taken by the 
eraser between points A and B, as 
does the work done against friction. 
Less work is done and less of the face 


is erased for the path in (a) than for 
the path in (b). The force here is 
friction, and most of the work goes 
into thermal energy that subsequently 
leaves the system (the happy face plus 
the eraser). The energy expended 
cannot be fully recovered. 


How Nonconservative Forces Affect Mechanical Energy 


Mechanical energy may not be conserved when nonconservative forces act. 
For example, when a car is brought to a stop by friction on level ground, it 
loses kinetic energy, which is dissipated as thermal energy, reducing its 
mechanical energy. [link] compares the effects of conservative and 
nonconservative forces. We often choose to understand simpler systems 
such as that described in [link](a) first before studying more complicated 
systems as in [link](b). 


System 


Heat, 
sound, and 
deformation 
of ground 


Wi 


PE, = PE, = KE + PE, 
(a) (b) 


Comparison of the effects of conservative and 
nonconservative forces on the mechanical energy 
of a system. (a) A system with only conservative 


forces. When a rock is dropped onto a spring, its 
mechanical energy remains constant (neglecting 
air resistance) because the force in the spring is 
conservative. The spring can propel the rock back 
to its original height, where it once again has only 
potential energy due to gravity. (b) A system with 
nonconservative forces. When the same rock is 
dropped onto the ground, it is stopped by 
nonconservative forces that dissipate its 
mechanical energy as thermal energy, sound, and 
surface distortion. The rock has lost mechanical 
energy. 


How the Work-Energy Theorem Applies 


Now let us consider what form the work-energy theorem takes when both 
conservative and nonconservative forces act. We will see that the work done 
by nonconservative forces equals the change in the mechanical energy of a 
system. As noted in Kinetic Energy and the Work-Energy Theorem, the 
work-energy theorem states that the net work on a system equals the change 
in its kinetic energy, or Wy, = AKE. The net work is the sum of the work 
by nonconservative forces plus the work by conservative forces. That is, 
Equation: 


Wret = Wae + W.; 


so that 
Equation: 


Woe + We = AKE, 


where W,, is the total work done by all nonconservative forces and W, is 
the total work done by all conservative forces. 


A person pushes a crate up a 
ramp, doing work on the 
crate. Friction and 
gravitational force (not 
shown) also do work on the 
crate; both forces oppose the 
person’s push. As the crate is 
pushed up the ramp, it gains 
mechanical energy, implying 
that the work done by the 
person is greater than the 
work done by friction. 


Consider [link], in which a person pushes a crate up a ramp and is opposed 
by friction. As in the previous section, we note that work done by a 
conservative force comes from a loss of gravitational potential energy, so 


that W. = —APE. Substituting this equation into the previous one and 
solving for Wye gives 
Equation: 


Wace = AKE + APE. 


This equation means that the total mechanical energy (KE + PE) changes 
by exactly the amount of work done by nonconservative forces. In [link], 
this is the work done by the person minus the work done by friction. So 
even if energy is not conserved for the system of interest (such as the crate), 
we know that an equal amount of work was done to cause the change in 
total mechanical energy. 


We rearrange W,, = AKE + APE to obtain 
Equation: 


KE; + PE; + Wa. = KEs + PEs. 


This means that the amount of work done by nonconservative forces adds to 
the mechanical energy of a system. If W,,. is positive, then mechanical 
energy is increased, such as when the person pushes the crate up the ramp 
in [link]. If Wn. is negative, then mechanical energy is decreased, such as 
when the rock hits the ground in [link](b). If W,. is zero, then mechanical 
energy is conserved, and nonconservative forces are balanced. For example, 
when you push a lawn mower at constant speed on level ground, your work 
done is removed by the work of friction, and the mower has a constant 
energy. 


Applying Energy Conservation with Nonconservative Forces 


When no change in potential energy occurs, applying 

KE; + PE; + Wae = KE¢ + PEs amounts to applying the work-energy 
theorem by setting the change in kinetic energy to be equal to the net work 
done on the system, which in the most general case includes both 
conservative and nonconservative forces. But when seeking instead to find 
a change in total mechanical energy in situations that involve changes in 
both potential and kinetic energy, the previous equation 

KE; + PE; + Wace = KE¢ + PEs says that you can start by finding the 
change in mechanical energy that would have resulted from just the 
conservative forces, including the potential energy changes, and add to it 
the work done, with the proper sign, by any nonconservative forces 
involved. 


Example: 

Calculating Distance Traveled: How Far a Baseball Player Slides 
Consider the situation shown in [link], where a baseball player slides to a 
stop on level ground. Using energy considerations, calculate the distance 


the 65.0-kg baseball player slides, given that his initial speed is 6.00 m/s 
and the force of friction against him is a constant 450 N. 


The baseball player slides to a stop in a distance 
d. In the process, friction removes the player’s 
kinetic energy by doing an amount of work fd 

equal to the initial kinetic energy. 


Strategy 

Friction stops the player by converting his kinetic energy into other forms, 
including thermal energy. In terms of the work-energy theorem, the work 
done by friction, which is negative, is added to the initial kinetic energy to 
reduce it to zero. The work done by friction is negative, because f is in the 


opposite direction of the motion (that is, 9 = 180°, and so cos 8 = —1). 
Thus W,,. = —fd. The equation simplifies to 
Equation: 

1 2 

—mv; —fd=0 

5 mw 
or 
Equation: 

1 
id. = amu. 


This equation can now be solved for the distance d. 
Solution 


Solving the previous equation for d and substituting known values yields 


Equation: 
oa m iC 
d= oF 
(65.0 kg)(6.00 m/s)? 
(2)(450 N) 
= 2.60 m. 
Discussion 


The most important point of this example is that the amount of 
nonconservative work equals the change in mechanical energy. For 
example, you must work harder to stop a truck, with its large mechanical 
energy, than to stop a mosquito. 


Example: 

Calculating Distance Traveled: Sliding Up an Incline 

Suppose that the player from [link] is running up a hill having a 5.00° 
incline upward with a surface similar to that in the baseball stadium. The 
player slides with the same initial speed, and the frictional force is still 450 
N. Determine how far he slides. 


The same baseball player slides to a stop on a 5.00° slope. 


Strategy 

In this case, the work done by the nonconservative friction force on the 
player reduces the mechanical energy he has from his kinetic energy at 
zero height, to the final mechanical energy he has by moving through 


distance d to reach height h along the hill, with h = d sin 5.00°. This is 
expressed by the equation 
Equation: 


KE; + PE; + Woe — KE; + PE:. 


Solution 

The work done by friction is again W,. = —fd; initially the potential 
energy is PE; = mg- 0 = 0 and the kinetic energy is KE; = i the 
final energy contributions are KE = 0 for the kinetic energy and 

PEs = mgh = mgd sin @ for the potential energy. 

Substituting these values gives 

Equation: 


1 
zm ete (- #4) = 0+ med sin 0. 


Solve this for d to obtain 


Equation: 
= ($)mv;? 
oT Faun sin 0 
(0.5)(65.0 kg)(6.00 m/s)? 
450 N+(65.0 kg)(9.80 m/s”) sin (5.00°) 
=) eo. alia 
Discussion 


As might have been expected, the player slides a shorter distance by 
sliding uphill. Note that the problem could also have been solved in terms 
of the forces directly and the work energy theorem, instead of using the 
potential energy. This method would have required combining the normal 
force and force of gravity vectors, which no longer cancel each other 
because they point in different directions, and friction, to find the net force. 
You could then use the net force and the net work to find the distance d 
that reduces the kinetic energy to zero. By applying conservation of energy 
and using the potential energy instead, we need only consider the 
gravitational potential energy mgh, without combining and resolving force 
vectors. This simplifies the solution considerably. 


Note: 

Making Connections: Take-Home Investigation—Determining Friction 
from the Stopping Distance 

This experiment involves the conversion of gravitational potential energy 
into thermal energy. Use the ruler, book, and marble from Take-Home 
Investigation—Converting Potential to Kinetic Energy. In addition, you 
will need a foam cup with a small hole in the side, as shown in [link]. From 
the 10-cm position on the ruler, let the marble roll into the cup positioned 
at the bottom of the ruler. Measure the distance d the cup moves before 
stopping. What forces caused it to stop? What happened to the kinetic 
energy of the marble at the bottom of the ruler? Next, place the marble at 
the 20-cm and the 30-cm positions and again measure the distance the cup 
moves after the marble enters it. Plot the distance the cup moves versus the 
initial marble position on the ruler. Is this relationship linear? 

With some simple assumptions, you can use these data to find the 
coefficient of kinetic friction 4, of the cup on the table. The force of 
friction f on the cup is j;,.N, where the normal force JN is just the weight 
of the cup plus the marble. The normal force and force of gravity do no 
work because they are perpendicular to the displacement of the cup, which 
moves horizontally. The work done by friction is fd. You will need the 
mass of the marble as well to calculate its initial kinetic energy. 

It is interesting to do the above experiment also with a steel marble (or ball 
bearing). Releasing it from the same positions on the ruler as you did with 
the glass marble, is the velocity of this steel marble the same as the 
velocity of the marble at the bottom of the ruler? Is the distance the cup 
moves proportional to the mass of the steel and glass marbles? 


Rolling a marble down a ruler into a 
foam cup. 


Note: 

PhET Explorations: The Ramp 

Explore forces, energy and work as you push household objects up and 
down a ramp. Lower and raise the ramp to see how the angle of inclination 
affects the parallel forces acting on the file cabinet. Graphs show forces, 
energy and work. 


Section Summary 


e A nonconservative force is one for which work depends on the path. 

e Friction is an example of a nonconservative force that changes 
mechanical energy into thermal energy. 

e Work W,, done by a nonconservative force changes the mechanical 
energy of a system. In equation form, W,, = AKE + APE or, 
equivalently, KE; + PE; + Wane = KE¢ + PEg¢. 

¢ When both conservative and nonconservative forces act, energy 
conservation can be applied and used to calculate motion in terms of 
the known potential energies of the conservative forces and the work 
done by nonconservative forces, instead of finding the net work from 
the net force, or having to directly apply Newton’s laws. 


Problems & Exercises 


Exercise: 


Problem: 


A 60.0-kg skier with an initial speed of 12.0 m/s coasts up a 2.50-m- 
high rise as shown in [link]. Find her final speed at the top, given that 
the coefficient of friction between her skis and the snow is 0.0800. 
(Hint: Find the distance traveled up the incline assuming a straight-line 
path as shown in the figure.) 


The skier’s initial kinetic energy 
is partially used in coasting to 
the top of a rise. 


Solution: 


9.46 m/s 

Exercise: 
Problem: 
(a) How high a hill can a car coast up (engine disengaged) if work 
done by friction is negligible and its initial speed is 110 km/h? (b) If, 
in actuality, a 750-kg car with an initial speed of 110 km/h is observed 
to coast up a hill to a height 22.0 m above its starting point, how much 


thermal energy was generated by friction? (c) What is the average 
force of friction if the hill has a slope 2.5° above the horizontal? 


Glossary 


nonconservative force 


a force whose work depends on the path followed between the given 
initial and final configurations 


friction 
the force between surfaces that opposes one sliding on the other; 
friction changes mechanical energy into thermal energy 


Conservation of Energy 


e Explain the law of the conservation of energy. 

e Describe some of the many forms of energy. 

e Define efficiency of an energy conversion process as the fraction left as 
useful energy or work, rather than being transformed, for example, into 
thermal energy. 


Law of Conservation of Energy 


Energy, as we have noted, is conserved, making it one of the most important 
physical quantities in nature. The law of conservation of energy can be stated 
as follows: 


Total energy is constant in any process. It may change in form or be 
transferred from one system to another, but the total remains the same. 


We have explored some forms of energy and some ways it can be transferred 
from one system to another. This exploration led to the definition of two major 
types of energy—mechanical energy (KE + PE) and energy transferred via 
work done by nonconservative forces (W,,.). But energy takes many other 
forms, manifesting itself in many different ways, and we need to be able to 
deal with all of these before we can write an equation for the above general 
statement of the conservation of energy. 


Other Forms of Energy than Mechanical Energy 


At this point, we deal with all other forms of energy by lumping them into a 
single group called other energy (OE). Then we can state the conservation of 
energy in equation form as 

Equation: 


KE; + PE; + Woe + OB; = KEs + PEs + OF. 
All types of energy and work can be included in this very general statement of 


conservation of energy. Kinetic energy is KE, work done by a conservative 
force is represented by PE, work done by nonconservative forces is W,-, and 


all other energies are included as OE. This equation applies to all previous 
examples; in those situations OE was constant, and so it subtracted out and 
was not directly considered. 


Note: 

Making Connections: Usefulness of the Energy Conservation Principle 

The fact that energy is conserved and has many forms makes it very 
important. You will find that energy is discussed in many contexts, because it 
is involved in all processes. It will also become apparent that many situations 
are best understood in terms of energy and that problems are often most 
easily conceptualized and solved by considering energy. 


When does OF play a role? One example occurs when a person eats. Food is 
oxidized with the release of carbon dioxide, water, and energy. Some of this 
chemical energy is converted to kinetic energy when the person moves, to 
potential energy when the person changes altitude, and to thermal energy 
(another form of OF). 


Some of the Many Forms of Energy 


What are some other forms of energy? You can probably name a number of 
forms of energy not yet discussed. Many of these will be covered in later 
chapters, but let us detail a few here. Electrical energy is a common form that 
is converted to many other forms and does work in a wide range of practical 
situations. Fuels, such as gasoline and food, carry chemical energy that can 
be transferred to a system through oxidation. Chemical fuel can also produce 
electrical energy, such as in batteries. Batteries can in turn produce light, 
which is a very pure form of energy. Most energy sources on Earth are in fact 
stored energy from the energy we receive from the Sun. We sometimes refer 
to this as radiant energy, or electromagnetic radiation, which includes visible 
light, infrared, and ultraviolet radiation. Nuclear energy comes from 
processes that convert measurable amounts of mass into energy. Nuclear 
energy is transformed into the energy of sunlight, into electrical energy in 
power plants, and into the energy of the heat transfer and blast in weapons. 


Atoms and molecules inside all objects are in random motion. This internal 
mechanical energy from the random motions is called thermal energy, 
because it is related to the temperature of the object. These and all other forms 
of energy can be converted into one another and can do work. 


[link] gives the amount of energy stored, used, or released from various 
objects and in various phenomena. The range of energies and the variety of 
types and situations is impressive. 


Note: 

Problem-Solving Strategies for Energy 

You will find the following problem-solving strategies useful whenever you 
deal with energy. The strategies help in organizing and reinforcing energy 
concepts. In fact, they are used in the examples presented in this chapter. The 
familiar general problem-solving strategies presented earlier—involving 
identifying physical principles, knowns, and unknowns, checking units, and 
so on—continue to be relevant here. 

Step 1. Determine the system of interest and identify what information is 
given and what quantity is to be calculated. A sketch will help. 

Step 2. Examine all the forces involved and determine whether you know or 
are given the potential energy from the work done by the forces. Then use 
step 3 or step 4. 

Step 3. If you know the potential energies for the forces that enter into the 
problem, then forces are all conservative, and you can apply conservation of 
mechanical energy simply in terms of potential and kinetic energy. The 
equation expressing conservation of energy is 

Equation: 


KE; + PE; = KE¢ + PEs. 


Step 4. If you know the potential energy for only some of the forces, possibly 
because some of them are nonconservative and do not have a potential 
energy, or if there are other energies that are not easily treated in terms of 
force and work, then the conservation of energy law in its most general form 
must be used. 

Equation: 


KE; + PB; + Woe + OB; = KE¢ + PE¢ + OEFs. 


In most problems, one or more of the terms is zero, simplifying its solution. 
Do not calculate W,, the work done by conservative forces; it is already 
incorporated in the PE terms. 

Step 5. You have already identified the types of work and energy involved (in 
step 2). Before solving for the unknown, eliminate terms wherever possible to 
simplify the algebra. For example, choose h = 0 at either the initial or final 
point, so that PE, is zero there. Then solve for the unknown in the customary 
manner. 

Step 6. Check the answer to see if it is reasonable. Once you have solved a 
problem, reexamine the forms of work and energy to see if you have set up 
the conservation of energy equation correctly. For example, work done 
against friction should be negative, potential energy at the bottom of a hill 
should be less than that at the top, and so on. Also check to see that the 
numerical value obtained is reasonable. For example, the final speed of a 
skateboarder who coasts down a 3-m-high ramp could reasonably be 20 
km/h, but not 80 km/h. 


Transformation of Energy 


The transformation of energy from one form into others is happening all the 
time. The chemical energy in food is converted into thermal energy through 
metabolism; light energy is converted into chemical energy through 
photosynthesis. In a larger example, the chemical energy contained in coal is 
converted into thermal energy as it burns to turn water into steam in a boiler. 
This thermal energy in the steam in turn is converted to mechanical energy as 
it spins a turbine, which is connected to a generator to produce electrical 
energy. (In all of these examples, not all of the initial energy is converted into 
the forms mentioned. This important point is discussed later in this section.) 


Another example of energy conversion occurs in a solar cell. Sunlight 
impinging on a solar cell (see [link]) produces electricity, which in turn can be 
used to run an electric motor. Energy is converted from the primary source of 
solar energy into electrical energy and then into mechanical energy. 


Solar energy is converted into 
electrical energy by solar cells, 
which is used to run a motor in 

this solar-power aircraft. (credit: 
NASA) 


Object/phenomenon 


Big Bang 


Energy released in a supernova 


Fusion of all the hydrogen in Earth’s oceans 


Annual world energy use 


Energy in joules 


10° 


10= 


1034 


4x 102° 


Object/phenomenon 


Large fusion bomb (9 megaton) 


1 kg hydrogen (fusion to helium) 


1 kg uranium (nuclear fission) 


Hiroshima-size fission bomb (10 kiloton) 


90,000-ton aircraft carrier at 30 knots 


1 barrel crude oil 


1 ton TNT 


1 gallon of gasoline 


Daily home electricity use (developed countries) 


Daily adult food intake (recommended) 


Energy in joules 


3.8x1016 


6.4x10!4 


8.0x101° 


4.2x10!8 


1.1x10!° 


5.9x10° 


4.2x 109 


1.2x108 


7x10° 


1.210" 


Object/phenomenon Energy in joules 


1000-kg car at 90 km/h 


3.1x10° 
1 g fat (9.3 kcal) 3.9x104 
ATP hydrolysis reaction 3.2104 
1 g carbohydrate (4.1 kcal) 1.7104 
1 g protein (4.1 kcal) 1.7x104 
Tennis ball at 100 km/h 22 
Mosquito (10 g at 0.5 m/s) 1.3x10~° 
Single electron in a TV tube beam 4.0x10-% 
Energy to break one DNA strand 10-19 


Energy of Various Objects and Phenomena 


Efficiency 


Even though energy is conserved in an energy conversion process, the output 
of useful energy or work will be less than the energy input. The efficiency Eff 
of an energy conversion process is defined as 

Equation: 


Effici (Eff) useful energy or work output Woy 
icienc ee 
total energy input Ein 


[link] lists some efficiencies of mechanical devices and human activities. In a 
coal-fired power plant, for example, about 40% of the chemical energy in the 
coal becomes useful electrical energy. The other 60% transforms into other 
(perhaps less useful) energy forms, such as thermal energy, which is then 
released to the environment through combustion gases and cooling towers. 


Efficiency (%)[footnote} 


Activity/device Representative values 
Cycling and climbing 20 

Swimming, surface 2 

Swimming, submerged 4 

Shoveling 3 

Weightlifting 9 

Steam engine 17 


Gasoline engine 30 


Efficiency (%)[footnote} 


Activity/device Representative values 
Diesel engine 35 
Nuclear power plant 35 
Coal power plant 42 
Electric motor 98 
Compact fluorescent light 20 
Gas heater (residential) 90 
Solar cell 10 


Efficiency of the Human Body and Mechanical Devices 


Note: 

PhET Explorations: Masses and Springs 

A realistic mass and spring laboratory. Hang masses from springs and adjust 
the spring stiffness and damping. You can even slow time. Transport the lab 
to different planets. A chart shows the kinetic, potential, and thermal energies 
for each spring. 
https://phet.colorado.edu/sims/mass-spring-lab/mass-spring-lab_en. html 


Section Summary 


e The law of conservation of energy states that the total energy is constant 
in any process. Energy may change in form or be transferred from one 
system to another, but the total remains the same. 

e When all forms of energy are considered, conservation of energy is 
written in equation form as 


KE; + PE; + Wae + OB; = KE¢ + PEs + OE, where OB is all 
other forms of energy besides mechanical energy. 

e Commonly encountered forms of energy include electric energy, 
chemical energy, radiant energy, nuclear energy, and thermal energy. 

e Energy is often utilized to do work, but it is not possible to convert all the 
energy of a system to work. 

e The efficiency Eff of a machine or human is defined to be Eff = a, 


where Wout is useful work output and £;, is the energy consumed. 


Conceptual Questions 


Exercise: 


Problem: 


Consider the following scenario. A car for which friction is not negligible 
accelerates from rest down a hill, running out of gasoline after a short 
distance. The driver lets the car coast farther down the hill, then up and 
over a small crest. He then coasts down that hill into a gas station, where 
he brakes to a stop and fills the tank with gasoline. Identify the forms of 
energy the car has, and how they are changed and transferred in this 


series of events. (See [link].) 
Coasts Down 
Hill 
Coasts Up 
Over Crest 
Coasts Down 
Hill 
Stops for 
Gasoline 


A 


A car experiencing non-negligible friction coasts down a 
hill, over a small crest, then downhill again, and comes 
to a stop at a gas station. 


Exercise: 
Problem: 
Describe the energy transfers and transformations for a javelin, starting 


from the point at which an athlete picks up the javelin and ending when 
the javelin is stuck into the ground after being thrown. 


Exercise: 
Problem: 
Do devices with efficiencies of less than one violate the law of 
conservation of energy? Explain. 

Exercise: 
Problem: 
List four different forms or types of energy. Give one example of a 
conversion from each of these forms to another form. 


Exercise: 


Problem: List the energy conversions that occur when riding a bicycle. 


Problems & Exercises 


Exercise: 
Problem: 
Using values from [link], how many DNA molecules could be broken by 
the energy carried by a single electron in the beam of an old-fashioned 
TV tube? (These electrons were not dangerous in themselves, but they 


did create dangerous x rays. Later model tube TVs had shielding that 
absorbed x rays before they escaped and exposed viewers.) 


Solution: 


4x10* molecules 


Exercise: 


Problem: 


Using energy considerations and assuming negligible air resistance, show 
that a rock thrown from a bridge 20.0 m above water with an initial speed 
of 15.0 m/s strikes the water with a speed of 24.8 m/s independent of the 
direction thrown. 


Solution: 


Equating APE, and AKE, we obtain 
v = \/2gh + vo? = 1/ 2(9.80 m/s”) (20.0 m) + (15.0 m/s)? = 24.8 m/s 
Exercise: 


Problem: 


If the energy in fusion bombs were used to supply the energy needs of the 
world, how many of the 9-megaton variety would be needed for a year’s 
supply of energy (using data from [link])? This is not as far-fetched as it 
may sound—there are thousands of nuclear bombs, and their energy can 
be trapped in underground explosions and converted to electricity, as 
natural geothermal energy is. 


Exercise: 


Problem: 


(a) Use of hydrogen fusion to supply energy is a dream that may be 
realized in the next century. Fusion would be a relatively clean and 
almost limitless supply of energy, as can be seen from [link]. To illustrate 
this, calculate how many years the present energy needs of the world 
could be supplied by one millionth of the oceans’ hydrogen fusion 
energy. (b) How does this time compare with historically significant 
events, such as the duration of stable economic systems? 


Solution: 


(a) 25 x 10° years 


(b) This is much, much longer than human time scales. 


Glossary 


law of conservation of energy 
the general law that total energy is constant in any process; energy may 
change in form or be transferred from one system to another, but the total 
remains the same 


electrical energy 
the energy carried by a flow of charge 


chemical energy 
the energy in a substance stored in the bonds between atoms and 
molecules that can be released in a chemical reaction 


radiant energy 
the energy carried by electromagnetic waves 


nuclear energy 
energy released by changes within atomic nuclei, such as the fusion of 
two light nuclei or the fission of a heavy nucleus 


thermal energy 
the energy within an object due to the random motion of its atoms and 
molecules that accounts for the object's temperature 


efficiency 
a measure of the effectiveness of the input of energy to do work; useful 
energy or work divided by the total input of energy 


Power 


e Calculate power by calculating changes in energy over time. 
e Examine power consumption and calculations of the cost of energy 
consumed. 


What is Power? 


Power—the word conjures up many images: a professional football player 
muscling aside his opponent, a dragster roaring away from the starting line, 
a volcano blowing its lava into the atmosphere, or a rocket blasting off, as 
in [link]. 


This powerful rocket on the 
Space Shuttle Endeavor did 
work and consumed energy at a 
very high rate. (credit: NASA) 


These images of power have in common the rapid performance of work, 
consistent with the scientific definition of power (P) as the rate at which 
work is done. 


Note: 

Power 

Power is the rate at which work is done. 
Equation: 


W 
P= — 

t 
The SI unit for power is the watt (W), where 1 watt equals 1 joule/second 
(iW = 1 J/s). 


Because work is energy transfer, power is also the rate at which energy is 
expended. A 60-W light bulb, for example, expends 60 J of energy per 
second. Great power means a large amount of work or energy developed in 
a short time. For example, when a powerful car accelerates rapidly, it does a 
large amount of work and consumes a large amount of fuel in a short time. 


Calculating Power from Energy 


Example: 

Calculating the Power to Climb Stairs 

What is the power output for a 60.0-kg woman who runs up a 3.00 m high 
flight of stairs in 3.50 s, starting from rest but having a final speed of 2.00 
m/s? (See [link].) 


When this woman runs upstairs 
starting from rest, she converts the 
chemical energy originally from 
food into kinetic energy and 
gravitational potential energy. Her 
power output depends on how fast 
she does this. 


Strategy and Concept 
The work going into mechanical energy is W= KE + PE. At the bottom 
of the stairs, we take both KE and PE, as initially zero; thus, 


W = KE; + PE, = smu? + mgh, where h is the vertical height of the 
Stairs. Because all terms are given, we can calculate W and then divide it 
by time to get power. 

Solution 

Substituting the expression for W into the definition of power given in the 
previous equation, P = W//t yields 

Equation: 


WwW +muv_” + mgh 


[= 
t t 


Entering known values yields 


Equation: 


0.5(60.0 kg)(2.00 m/s)?+(60.0 kg) (9.80 m/s”) (3.00 m) 


ee 3.50 s 
120 J-+1764 J 
3.50 s 


= 538 W. 


Discussion 

The woman does 1764 J of work to move up the stairs compared with only 
120 J to increase her kinetic energy; thus, most of her power output is 
required for climbing rather than accelerating. 


It is impressive that this woman’s useful power output is slightly less than 1 
horsepower (1 hp = 746 W)! People can generate more than a 
horsepower with their leg muscles for short periods of time by rapidly 
converting available blood sugar and oxygen into work output. (A horse can 
put out 1 hp for hours on end.) Once oxygen is depleted, power output 
decreases and the person begins to breathe rapidly to obtain oxygen to 
metabolize more food—this is known as the aerobic stage of exercise. If the 
woman Climbed the stairs slowly, then her power output would be much 
less, although the amount of work done would be the same. 


Note: 

Making Connections: Take-Home Investigation—Measure Your Power 
Rating 

Determine your own power rating by measuring the time it takes you to 
climb a flight of stairs. We will ignore the gain in kinetic energy, as the 
above example showed that it was a small portion of the energy gain. Don’t 
expect that your output will be more than about 0.5 hp. 


Examples of Power 


Examples of power are limited only by the imagination, because there are 
as many types as there are forms of work and energy. (See [link] for some 
examples.) Sunlight reaching Earth’s surface carries a maximum power of 
about 1.3 kilowatts per square meter (kW /m/’). A tiny fraction of this is 
retained by Earth over the long term. Our consumption rate of fossil fuels is 
far greater than the rate at which they are stored, so it is inevitable that they 
will be depleted. Power implies that energy is transferred, perhaps changing 
form. It is never possible to change one form completely into another 
without losing some of it as thermal energy. For example, a 60-W 
incandescent bulb converts only 5 W of electrical power to light, with 55 W 
dissipating into thermal energy. Furthermore, the typical electric power 
plant converts only 35 to 40% of its fuel into electricity. The remainder 
becomes a huge amount of thermal energy that must be dispersed as heat 
transfer, as rapidly as it is created. A coal-fired power plant may produce 
1000 megawatts; 1 megawatt (MW) is 10° W of electric power. But the 
power plant consumes chemical energy at a rate of about 2500 MW, 
creating heat transfer to the surroundings at a rate of 1500 MW. (See [link].) 


Tremendous amounts of electric 


power are generated by coal- 
fired power plants such as this 
one in China, but an even larger 
amount of power goes into heat 
transfer to the surroundings. 


The large cooling towers here 
are needed to transfer heat as 
rapidly as it is produced. The 
transfer of heat is not unique to 
coal plants but is an 
unavoidable consequence of 
generating electric power from 
any fuel—nuclear, coal, oil, 
natural gas, or the like. (credit: 
Kleinolive, Wikimedia 
Commons) 


Object or Phenomenon 


Supernova (at peak) 


Milky Way galaxy 


Crab Nebula pulsar 


The Sun 


Power in 
Watts 


5x 103” 


103” 


1028 


4x 1076 


Object or Phenomenon 


Volcanic eruption (maximum) 


Lightning bolt 


Nuclear power plant (total electric and heat 
transfer) 


Aircraft carrier (total useful and heat transfer) 


Dragster (total useful and heat transfer) 


Car (total useful and heat transfer) 


Football player (total useful and heat transfer) 


Clothes dryer 


Person at rest (all heat transfer) 


Power in 
Watts 


4x10! 


2x10 


3x10? 


108 


2x10 


8x104 


510° 


4x 10° 


100 


Power in 
Object or Phenomenon Watts 


Typical incandescent light bulb (total useful and 


heat transfer) ee 
Heart, person at rest (total useful and heat transfer) 8 
Electric clock 3 
Pocket calculator 10° 


Power Output or Consumption 


Power and Energy Consumption 


We usually have to pay for the energy we use. It is interesting and easy to 
estimate the cost of energy for an electrical appliance if its power 
consumption rate and time used are known. The higher the power 
consumption rate and the longer the appliance is used, the greater the cost 
of that appliance. The power consumption rate is P = W/t = E/t, where 
F is the energy supplied by the electricity company. So the energy 
consumed over a time ¢ is 

Equation: 


B= Pt. 


Electricity bills state the energy used in units of kilowatt-hours (kW - h), 
which is the product of power in kilowatts and time in hours. This unit is 
convenient because electrical power consumption at the kilowatt level for 
hours at a time is typical. 


Example: 

Calculating Energy Costs 

What is the cost of running a 0.200-kW computer 6.00 h per day for 30.0 d 
if the cost of electricity is $0.120 per kW - h? 

Strategy 

Cost is based on energy consumed; thus, we must find & from & = Pt and 
then calculate the cost. Because electrical energy is expressed in kW - h, at 
the start of a problem such as this it is convenient to convert the units into 
kW and hours. 

Solution 

The energy consumed in kW - h is 

Equation: 


E = Pt =(0.200kW)(6.00 h/d)(30.0d) 


36.0 kW -h, 


and the cost is simply given by 
Equation: 


cost = (36.0 kW - h)($0.120 per kW - h) = $4.32 per month. 


Discussion 

The cost of using the computer in this example is neither exorbitant nor 
negligible. It is clear that the cost is a combination of power and time. 
When both are high, such as for an air conditioner in the summer, the cost 
is high. 


The motivation to save energy has become more compelling with its ever- 
increasing price. Armed with the knowledge that energy consumed is the 
product of power and time, you can estimate costs for yourself and make 
the necessary value judgments about where to save energy. Either power or 
time must be reduced. It is most cost-effective to limit the use of high- 
power devices that normally operate for long periods of time, such as water 
heaters and air conditioners. This would not include relatively high power 
devices like toasters, because they are on only a few minutes per day. It 
would also not include electric clocks, in spite of their 24-hour-per-day 


usage, because they are very low power devices. It is sometimes possible to 
use devices that have greater efficiencies—that is, devices that consume 
less power to accomplish the same task. One example is the compact 
fluorescent light bulb, which produces over four times more light per watt 
of power consumed than its incandescent cousin. 


Modern civilization depends on energy, but current levels of energy 
consumption and production are not sustainable. The likelihood of a link 
between global warming and fossil fuel use (with its concomitant 
production of carbon dioxide), has made reduction in energy use as well as 
a shift to non-fossil fuels of the utmost importance. Even though energy in 
an isolated system is a conserved quantity, the final result of most energy 
transformations is waste heat transfer to the environment, which is no 
longer useful for doing work. As we will discuss in more detail in 
Thermodynamics, the potential for energy to produce useful work has been 
“degraded” in the energy transformation. 


Section Summary 


e Power is the rate at which work is done, or in equation form, for the 
average power P for work W done over a time t, P = W//t. 

e The SI unit for power is the watt (W), where 1 W = 1 J/s. 

e The power of many devices such as electric motors is also often 
expressed in horsepower (hp), where 1 hp = 746 W. 


Conceptual Questions 


Exercise: 
Problem: 
Most electrical appliances are rated in watts. Does this rating depend 


on how long the appliance is on? (When off, it is a zero-watt device.) 
Explain in terms of the definition of power. 


Exercise: 


Problem: 


Explain, in terms of the definition of power, why energy consumption 
is sometimes listed in kilowatt-hours rather than joules. What is the 
relationship between these two energy units? 


Exercise: 
Problem: 
A spark of static electricity, such as that you might receive from a 


doorknob on a cold dry day, may carry a few hundred watts of power. 
Explain why you are not injured by such a spark. 


Problems & Exercises 


Exercise: 


Problem: 


The Crab Nebula (see [link]) pulsar is the remnant of a supernova that 
occurred in A.D. 1054. Using data from [link], calculate the 
approximate factor by which the power output of this astronomical 
object has declined since its explosion. 


Crab Nebula (credit: ESO, via 
Wikimedia Commons) 


Solution: 
Equation: 


2x10~1° 


Exercise: 


Problem: 


Suppose a star 1000 times brighter than our Sun (that is, emitting 1000 
times the power) suddenly goes supernova. Using data from [link]: (a) 
By what factor does its power output increase? (b) How many times 
brighter than our entire Milky Way galaxy is the supernova? (c) Based 
on your answers, discuss whether it should be possible to observe 
supernovas in distant galaxies. Note that there are on the order of 101! 
observable galaxies, the average brightness of which is somewhat less 
than our own galaxy. 


Exercise: 


Problem: 


A person in good physical condition can put out 100 W of useful 
power for several hours at a stretch, perhaps by pedaling a mechanism 
that drives an electric generator. Neglecting any problems of generator 
efficiency and practical considerations such as resting time: (a) How 
many people would it take to run a 4.00-kW electric clothes dryer? (b) 
How many people would it take to replace a large electric power plant 
that generates 800 MW? 


Solution: 


(a) 40 


(b) 8 million 
Exercise: 
Problem: 
What is the cost of operating a 3.00-W electric clock for a year if the 
cost of electricity is $0.0900 per kW - h? 
Exercise: 
Problem: 
A large household air conditioner may consume 15.0 kW of power. 


What is the cost of operating this air conditioner 3.00 h per day for 
30.0 d if the cost of electricity is $0.110 per kW - h? 


Solution: 


$149 
Exercise: 
Problem: 
(a) What is the average power consumption in watts of an appliance 


that uses 5.00 kW - h of energy per day? (b) How many joules of 
energy does this appliance consume in a year? 


Exercise: 
Problem: 
(a) What is the average useful power output of a person who does 
6.00 10° J of useful work in 8.00 h? (b) Working at this rate, how 
long will it take this person to lift 2000 kg of bricks 1.50 m to a 


platform? (Work done to lift his body can be omitted because it is not 
considered useful output here.) 


Solution: 


(a) 208 W 


(b) 141s 
Exercise: 
Problem: 
A 500-kg dragster accelerates from rest to a final speed of 110 m/s in 
400 m (about a quarter of a mile) and encounters an average frictional 


force of 1200 N. What is its average power output in watts and 
horsepower if this takes 7.30 s? 


Exercise: 
Problem: 
(a) How long will it take an 850-kg car with a useful power output of 
40.0 hp (1 hp = 746 W) to reach a speed of 15.0 m/s, neglecting 


friction? (b) How long will this acceleration take if the car also climbs 
a 3.00-m-high hill in the process? 


Solution: 
(a) 3.20 s 


(b) 4.04 s 
Exercise: 


Problem: 


(a) Find the useful power output of an elevator motor that lifts a 2500- 
kg load a height of 35.0 m in 12.0, if it also increases the speed from 
rest to 4.00 m/s. Note that the total mass of the counterbalanced system 
is 10,000 kg—so that only 2500 kg is raised in height, but the full 
10,000 kg is accelerated. (b) What does it cost, if electricity is $0.0900 
per kW - h? 


Exercise: 


Problem: 


(a) What is the available energy content, in joules, of a battery that 
operates a 2.00-W electric clock for 18 months? (b) How long can a 
battery that can supply 8.00104 J run a pocket calculator that 
consumes energy at the rate of 1.00x10-° W? 


Solution: 
(a) 9.4610" J 


(b) 2.54 y 
Exercise: 


Problem: 


(a) How long would it take a 1.50 10°-kg airplane with engines that 
produce 100 MW of power to reach a speed of 250 m/s and an altitude 
of 12.0 km if air resistance were negligible? (b) If it actually takes 900 
s, what is the power? (c) Given this power, what is the average force of 
air resistance if the airplane takes 1200 s? (Hint: You must find the 
distance the plane travels in 1200 s assuming constant acceleration.) 


Exercise: 
Problem: 
Calculate the power output needed for a 950-kg car to climb a 2.00° 
slope at a constant 30.0 m/s while encountering wind resistance and 


friction totaling 600 N. Explicitly show how you follow the steps in 
the Problem-Solving Strategies for Energy. 


Solution: 


Identify knowns: m = 950 kg, slope angle 0 = 2.00°, v = 3.00 m/s, 
f =600N 


Identify unknowns: power FP of the car, force F’ that car applies to road 


Solve for unknown: 


al ee Ma A ela 


where F' is parallel to the incline and must oppose the resistive forces 
and the force of gravity: 


F = f+w=600N-+ mg sin 0 

Insert this into the expression for power and solve: 

P = (f+mg sin 6)v 

[600 N + (950 kg) (9.80 m/s”) sin 2°| (30.0 m/s) 


= 2.77x104°W 


About 28 kW (or about 37 hp) is reasonable for a car to climb a gentle 
incline. 


Exercise: 


Problem: 


(a) Calculate the power per square meter reaching Earth’s upper 
atmosphere from the Sun. (Take the power output of the Sun to be 
4.00 x 107° W.) (b) Part of this is absorbed and reflected by the 
atmosphere, so that a maximum of 1.30 kW/ m? reaches Earth’s 
surface. Calculate the area in km? of solar energy collectors needed to 
replace an electric power plant that generates 750 MW if the collectors 
convert an average of 2.00% of the maximum power into electricity. 
(This small conversion efficiency is due to the devices themselves, and 
the fact that the sun is directly overhead only briefly.) With the same 
assumptions, what area would be needed to meet the United States’ 
energy needs (1.05 x 107° J)? Australia’s energy needs 

(5.4 x 1018 J)? China’s energy needs (6.3 x 10'° J)? (These energy 
consumption values are from 2006.) 


Glossary 


power 
the rate at which work is done 


watt 
(W) SI unit of power, with 1 W = 1 J/s 


horsepower 
an older non-SI unit of power, with 1 hp = 746 W 


kilowatt-hour 
(kW - h) unit used primarily for electrical energy provided by electric 


utility companies 


Work, Energy, and Power in Humans 


e Explain the human body’s consumption of energy when at rest vs. 
when engaged in activities that do useful work. 
¢ Calculate the conversion of chemical energy in food into useful work. 


Energy Conversion in Humans 


Our own bodies, like all living organisms, are energy conversion machines. 
Conservation of energy implies that the chemical energy stored in food is 
converted into work, thermal energy, and/or stored as chemical energy in 
fatty tissue. (See [link].) The fraction going into each form depends both on 
how much we eat and on our level of physical activity. If we eat more than 
is needed to do work and stay warm, the remainder goes into body fat. 


W,. (negative) 


eo OE, 


Stored 
fat 


OE, 
Food 
energy 


OE; + Wac = OE, 


Energy consumed by 
humans is converted to 
work, thermal energy, and 
stored fat. By far the largest 
fraction goes to thermal 
energy, although the fraction 
varies depending on the type 
of physical activity. 


Power Consumed at Rest 


The rate at which the body uses food energy to sustain life and to do 
different activities is called the metabolic rate. The total energy conversion 
rate of a person at rest is called the basal metabolic rate (BMR) and is 
divided among various systems in the body, as shown in [link]. The largest 
fraction goes to the liver and spleen, with the brain coming next. Of course, 
during vigorous exercise, the energy consumption of the skeletal muscles 
and heart increase markedly. About 75% of the calories burned in a day go 
into these basic functions. The BMR is a function of age, gender, total body 
weight, and amount of muscle mass (which burns more calories than body 
fat). Athletes have a greater BMR due to this last factor. 


Power Oxygen 

consumed at consumption Percent 
Organ rest (W) (mL/min) of BMR 
Liver & 23 67 97 
spleen 
Brain 16 47 19 
Skeletal 15 45 18 
muscle 
Kidney 9 26 10 
Heart 6 17 7 
Other 16 48 19 


Totals 85 W 250 mL/min 100% 


Basal Metabolic Rates (BMR) 


Energy consumption is directly proportional to oxygen consumption 
because the digestive process is basically one of oxidizing food. We can 
measure the energy people use during various activities by measuring their 
oxygen use. (See [link].) Approximately 20 kJ of energy are produced for 
each liter of oxygen consumed, independent of the type of food. [link] 
shows energy and oxygen consumption rates (power expended) for a variety 
of activities. 


Power of Doing Useful Work 


Work done by a person is sometimes called useful work, which is work 
done on the outside world, such as lifting weights. Useful work requires a 
force exerted through a distance on the outside world, and so it excludes 
internal work, such as that done by the heart when pumping blood. Useful 
work does include that done in climbing stairs or accelerating to a full run, 
because these are accomplished by exerting forces on the outside world. 
Forces exerted by the body are nonconservative, so that they can change the 
mechanical energy (KE + PE) of the system worked upon, and this is 
often the goal. A baseball player throwing a ball, for example, increases 
both the ball’s kinetic and potential energy. 


If a person needs more energy than they consume, such as when doing 
vigorous work, the body must draw upon the chemical energy stored in fat. 
So exercise can be helpful in losing fat. However, the amount of exercise 
needed to produce a loss in fat, or to burn off extra calories consumed that 
day, can be large, as [link] illustrates. 


Example: 

Calculating Weight Loss from Exercising 

If a person who normally requires an average of 12,000 kJ (3000 kcal) of 
food energy per day consumes 13,000 kJ per day, he will steadily gain 
weight. How much bicycling per day is required to work off this extra 
1000 kJ? 


Solution 
[link] states that 400 W are used when cycling at a moderate speed. The 
time required to work off 1000 kJ at this rate is then 


Equation: 
; energy 1000 kJ 
Time — (25) = “400 W = 2500 s = 42 min. 
time 
Discussion 


If this person uses more energy than he or she consumes, the person’s body 
will obtain the needed energy by metabolizing body fat. If the person uses 
13,000 kJ but consumes only 12,000 kJ, then the amount of fat loss will be 
Equation: 


1.0 g fat 
Fat loss = (1000 kJ) see) = 26g, 


assuming the energy content of fat to be 39 kJ/g. 


A pulse oxymeter is an 
apparatus that measures the 
amount of oxygen in blood. 

Oxymeters can be used to 
determine a person’s metabolic 
rate, which is the rate at which 

food energy is converted to 
another form. Such 


measurements can indicate the 
level of athletic conditioning as 
well as certain medical 
problems. (credit: UusiAjaja, 
Wikimedia Commons) 


Activity 
Sleeping 
Sitting at rest 


Standing 
relaxed 


Sitting in class 


Walking (5 
km/h) 


Cycling (13-18 
km/h) 


Shivering 


Playing tennis 


Energy 
consumption in 
watts 


83 


120 


125 


210 


280 


400 


425 


440 


Oxygen consumption 


in liters O2/min 
0.24 


0.34 


0.36 


0.60 


0.80 


1.14 


1.21 


1.26 


Energy 


consumption in Oxygen consumption 
Activity watts in liters O,/min 
“wie 475 1.36 
breaststroke 
Ice skating 
(14.5 km/h) 545 1.56 
Climbing stairs 
(116/min) eee ae 
Cycling (21 
km/h) 700 2.00 
Running cross- 7A0 2.10 
country 
Playing 
basketball sic ca 
Cycling, 
professional 1855 5.30 
racer 
Sprinting 2415 6.90 


Energy and Oxygen Consumption Rates| footnote | (Power) 
for an average 76-kg male 


All bodily functions, from thinking to lifting weights, require energy. (See 
[link].) The many small muscle actions accompanying all quiet activity, 
from sleeping to head scratching, ultimately become thermal energy, as do 
less visible muscle actions by the heart, lungs, and digestive tract. 
Shivering, in fact, is an involuntary response to low body temperature that 
pits muscles against one another to produce thermal energy in the body (and 


do no work). The kidneys and liver consume a surprising amount of energy, 
but the biggest surprise of all is that a full 25% of all energy consumed by 
the body is used to maintain electrical potentials in all living cells. (Nerve 
cells use this electrical potential in nerve impulses.) This bioelectrical 
energy ultimately becomes mostly thermal energy, but some is utilized to 
power chemical processes such as in the kidneys and liver, and in fat 
production. 


This fMRI scan shows an 
increased level of energy 
consumption in the vision 
center of the brain. Here, 
the patient was being 
asked to recognize faces. 
(credit: NIH via 
Wikimedia Commons) 


Section Summary 


e The human body converts energy stored in food into work, thermal 
energy, and/or chemical energy that is stored in fatty tissue. 

e The rate at which the body uses food energy to sustain life and to do 
different activities is called the metabolic rate, and the corresponding 
rate when at rest is called the basal metabolic rate (BMR) 

e The energy included in the basal metabolic rate is divided among 
various systems in the body, with the largest fraction going to the liver 
and spleen, and the brain coming next. 

e About 75% of food calories are used to sustain basic body functions 
included in the basal metabolic rate. 

e The energy consumption of people during various activities can be 
determined by measuring their oxygen use, because the digestive 
process is basically one of oxidizing food. 


Conceptual Questions 


Exercise: 
Problem: 
Explain why it is easier to climb a mountain on a zigzag path rather 
than one straight up the side. Is your increase in gravitational potential 


energy the same in both cases? Is your energy consumption the same 
in both? 


Exercise: 
Problem: 
Do you do work on the outside world when you rub your hands 
together to warm them? What is the efficiency of this activity? 
Exercise: 
Problem: 
Shivering is an involuntary response to lowered body temperature. 


What is the efficiency of the body when shivering, and is this a 
desirable value? 


Exercise: 


Problem: 


Discuss the relative effectiveness of dieting and exercise in losing 
weight, noting that most athletic activities consume food energy at a 
rate of 400 to 500 W, while a single cup of yogurt can contain 1360 kJ 
(325 kcal). Specifically, is it likely that exercise alone will be sufficient 
to lose weight? You may wish to consider that regular exercise may 
increase the metabolic rate, whereas protracted dieting may reduce it. 


Problems & Exercises 


Exercise: 
Problem: 
(a) How long can you rapidly climb stairs (116/min) on the 93.0 kcal 


of energy in a 10.0-g pat of butter? (b) How many flights is this if each 
flight has 16 stairs? 


Solution: 
(a) 9.5 min 


(b) 69 flights of stairs 
Exercise: 
Problem: 
(a) What is the power output in watts and horsepower of a 70.0-kg 
sprinter who accelerates from rest to 10.0 m/s in 3.00 s? (b) 


Considering the amount of power generated, do you think a well- 
trained athlete could do this repetitively for long periods of time? 


Exercise: 


Problem: 


Calculate the power output in watts and horsepower of a shot-putter 
who takes 1.20 s to accelerate the 7.27-kg shot from rest to 14.0 m/s, 
while raising it 0.800 m. (Do not include the power produced to 
accelerate his body.) 

—_ 


Shot putter at the 
Dornoch Highland 
Gathering in 2007. 

(credit: John Haslam, 
Flickr) 


Solution: 


641 W, 0.860 hp 
Exercise: 
Problem: 
(a) What is the efficiency of an out-of-condition professor who does 
2.10 10° J of useful work while metabolizing 500 kcal of food 


energy? (b) How many food calories would a well-conditioned athlete 
metabolize in doing the same work with an efficiency of 20%? 


Exercise: 


Problem: 


Energy that is not utilized for work or heat transfer is converted to the 
chemical energy of body fat containing about 39 kJ/g. How many 
grams of fat will you gain if you eat 10,000 kJ (about 2500 kcal) one 
day and do nothing but sit relaxed for 16.0 h and sleep for the other 
8.00 h? Use data from [link] for the energy consumption rates of these 
activities. 


Solution: 


31¢g 
Exercise: 
Problem: 
Using data from [link], calculate the daily energy needs of a person 
who sleeps for 7.00 h, walks for 2.00 h, attends classes for 4.00 h, 


cycles for 2.00 h, sits relaxed for 3.00 h, and studies for 6.00 h. 
(Studying consumes energy at the same rate as sitting in class.) 


Exercise: 
Problem: 
What is the efficiency of a subject on a treadmill who puts out work at 


the rate of 100 W while consuming oxygen at the rate of 2.00 L/min? 
(Hint: See [link].) 


Solution: 


14.3% 


Exercise: 


Problem: 


Shoveling snow can be extremely taxing because the arms have such a 
low efficiency in this activity. Suppose a person shoveling a footpath 
metabolizes food at the rate of 800 W. (a) What is her useful power 
output? (b) How long will it take her to lift 3000 kg of snow 1.20 m? 
(This could be the amount of heavy snow on 20 m of footpath.) (c) 
How much waste heat transfer in kilojoules will she generate in the 
process? 


Exercise: 
Problem: 
Very large forces are produced in joints when a person jumps from 
some height to the ground. (a) Calculate the magnitude of the force 
produced if an 80.0-kg person jumps from a 0.600—m-high ledge and 
lands stiffly, compressing joint material 1.50 cm as a result. (Be certain 
to include the weight of the person.) (b) In practice the knees bend 
almost involuntarily to help extend the distance over which you stop. 


Calculate the magnitude of the force produced if the stopping distance 
is 0.300 m. (c) Compare both forces with the weight of the person. 


Solution: 
(a) 3.21x10*N 
(b) 2.35x10° N 


(c) Ratio of net force to weight of person is 41.0 in part (a); 3.00 in 
part (b) 


Exercise: 


Problem: 


Jogging on hard surfaces with insufficiently padded shoes produces 
large forces in the feet and legs. (a) Calculate the magnitude of the 
force needed to stop the downward motion of a jogger’s leg, if his leg 
has a mass of 13.0 kg, a speed of 6.00 m/s, and stops in a distance of 
1.50 cm. (Be certain to include the weight of the 75.0-kg jogger’s 
body.) (b) Compare this force with the weight of the jogger. 


Exercise: 
Problem: 
(a) Calculate the energy in kJ used by a 55.0-kg woman who does 50 
deep knee bends in which her center of mass is lowered and raised 
0.400 m. (She does work in both directions.) You may assume her 


efficiency is 20%. (b) What is the average power consumption rate in 
watts if she does this in 3.00 min? 


Solution: 
(a) 108 kJ 


(b) 599 W 
Exercise: 


Problem: 


Kanellos Kanellopoulos flew 119 km from Crete to Santorini, Greece, 
on April 23, 1988, in the Daedalus 88, an aircraft powered by a 
bicycle-type drive mechanism (see [link]). His useful power output for 
the 234-min trip was about 350 W. Using the efficiency for cycling 
from [link], calculate the food energy in kilojoules he metabolized 
during the flight. 


The Daedalus 88 in flight. 
(credit: NASA photo by 
Beasley) 


Exercise: 


Problem: 


The swimmer shown in [link] exerts an average horizontal backward 
force of 80.0 N with his arm during each 1.80 m long stroke. (a) What 
is his work output in each stroke? (b) Calculate the power output of his 
arms if he does 120 strokes per minute. 


Solution: 


(a) 144 J 


(b) 288 W 


Exercise: 


Problem: 


Mountain climbers carry bottled oxygen when at very high altitudes. 
(a) Assuming that a mountain climber uses oxygen at twice the rate for 
climbing 116 stairs per minute (because of low air temperature and 
winds), calculate how many liters of oxygen a climber would need for 
10.0 h of climbing. (These are liters at sea level.) Note that only 40% 
of the inhaled oxygen is utilized; the rest is exhaled. (b) How much 
useful work does the climber do if he and his equipment have a mass 
of 90.0 kg and he gains 1000 m of altitude? (c) What is his efficiency 
for the 10.0-h climb? 


Exercise: 


Problem: 


The awe-inspiring Great Pyramid of Cheops was built more than 4500 
years ago. Its square base, originally 230 m on a side, covered 13.1 
acres, and it was 146 m high, with a mass of about 7 x 10° kg. (The 
pyramid’s dimensions are slightly different today due to quarrying and 
some sagging.) Historians estimate that 20,000 workers spent 20 years 
to construct it, working 12-hour days, 330 days per year. (a) Calculate 
the gravitational potential energy stored in the pyramid, given its 
center of mass is at one-fourth its height. (b) Only a fraction of the 
workers lifted blocks; most were involved in support services such as 
building ramps (see [link]), bringing food and water, and hauling 
blocks to the site. Calculate the efficiency of the workers who did the 
lifting, assuming there were 1000 of them and they consumed food 
energy at the rate of 300 kcal/h. What does your answer imply about 
how much of their work went into block-lifting, versus how much 
work went into friction and lifting and lowering their own bodies? (c) 
Calculate the mass of food that had to be supplied each day, assuming 
that the average worker required 3600 kcal per day and that their diet 
was 5% protein, 60% carbohydrate, and 35% fat. (These proportions 
neglect the mass of bulk and nondigestible materials consumed.) 


Ancient pyramids were 
probably constructed using 
ramps as simple machines. 

(credit: Franck Monnier, 

Wikimedia Commons) 


Solution: 

(a) 2.50x10" J 

(b) 2.52% 

(c) 1.4x10*kg (14 metric tons) 


Exercise: 


Problem: 
(a) How long can you play tennis on the 800 kJ (about 200 kcal) of 
energy in a candy bar? (b) Does this seem like a long time? Discuss 


why exercise is necessary but may not be sufficient to cause a person 
to lose weight. 


Glossary 


metabolic rate 
the rate at which the body uses food energy to sustain life and to do 
different activities 


basal metabolic rate 
the total energy conversion rate of a person at rest 


useful work 
work done on an external system 


World Energy Use 


e Describe the distinction between renewable and nonrenewable energy sources. 
e Explain why the inevitable conversion of energy to less useful forms makes it necessary to conserve energy 
resources. 


Energy is an important ingredient in all phases of society. We live in a very interdependent world, and access to 
adequate and reliable energy resources is crucial for economic growth and for maintaining the quality of our lives. 
But current levels of energy consumption and production are not sustainable. About 40% of the world’s energy 
comes from oil, and much of that goes to transportation uses. Oil prices are dependent as much upon new (or 
foreseen) discoveries as they are upon political events and situations around the world. The U.S., with 4.5% of the 
world’s population, consumes 24% of the world’s oil production per year; 66% of that oil is imported! 


Renewable and Nonrenewable Energy Sources 


The principal energy resources used in the world are shown in [link]. The fuel mix has changed over the years but 
now is dominated by oil, although natural gas and solar contributions are increasing. Renewable forms of energy 
are those sources that cannot be used up, such as water, wind, solar, and biomass. About 85% of our energy comes 
from nonrenewable fossil fuels—oil, natural gas, coal. The likelihood of a link between global warming and fossil 
fuel use, with its production of carbon dioxide through combustion, has made, in the eyes of many scientists, a 
shift to non-fossil fuels of utmost importance—but it will not be easy. 


Petroleum: 3527 ~ 35.43% 
Coal: 2802 ~ 28.15% 
Dry natural gas: 2335 ~ 23.46% 

{) Hydro-electricity: 624~ 6.27% 


Nuclear-electricity: 576~ 5.79% 


1) Geothermal, wind, 


} ‘ 86~ 0.86% 
solar, biomass: 


1 Geothermal, biomass, 
solar not used 
for electricity: 


5~ 0.05% 


Total: 9955 


World energy consumption by source, in billions 
of kilowatt-hours: 2006. (credit: KVDP) 


The World’s Growing Energy Needs 


World energy consumption continues to rise, especially in the developing countries. (See [Link].) Global demand 
for energy has tripled in the past 50 years and might triple again in the next 30 years. While much of this growth 
will come from the rapidly booming economies of China and India, many of the developed countries, especially 
those in Europe, are hoping to meet their energy needs by expanding the use of renewable sources. Although 
presently only a small percentage, renewable energy is growing very fast, especially wind energy. For example, 
Germany plans to meet 20% of its electricity and 10% of its overall energy needs with renewable resources by the 
year 2020. (See [link].) Energy is a key constraint in the rapid economic growth of China and India. In 2003, 
China surpassed Japan as the world’s second largest consumer of oil. However, over 1/3 of this is imported. Unlike 
most Western countries, coal dominates the commercial energy resources of China, accounting for 2/3 of its energy 
consumption. In 2009 China surpassed the United States as the largest generator of CO2. In India, the main energy 
resources are biomass (wood and dung) and coal. Half of India’s oil is imported. About 70% of India’s electricity 
is generated by highly polluting coal. Yet there are sizeable strides being made in renewable energy. India has a 
rapidly growing wind energy base, and it has the largest solar cooking program in the world. 


World Energy Consumption 
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Past and projected world energy use 
(source: Based on data from U.S. 
Energy Information Administration, 
2011) 


Solar cell arrays at a power plant in 
Steindorf, Germany (credit: Michael 
Betke, Flickr) 


[link] displays the 2006 commercial energy mix by country for some of the prime energy users in the world. While 
non-renewable sources dominate, some countries get a sizeable percentage of their electricity from renewable 
resources. For example, about 67% of New Zealand’s electricity demand is met by hydroelectric. Only 10% of the 
U.S. electricity is generated by renewable resources, primarily hydroelectric. It is difficult to determine total 
contributions of renewable energy in some countries with a large rural population, so these percentages in this 
table are left blank. 


Consumption, Natural Other 
Country in EJ (1018 J) Oil Gas Coal Nuclear Hydro Renewables 


Australia 5.4 34% 17% 44% 0% 3% 1% 


Country 
Brazil 
China 
Egypt 
Germany 
India 
Indonesia 
Japan 


New 
Zealand 


Russia 
U.S. 


World 


Consumption, 
in EJ (101° J) 


9.6 
63 
2.4 
16 
15 
4.9 


24 
0.44 


31 
105 


432 


Oil 


48% 


22% 


50% 


37% 


34% 


51% 


48% 


32% 


19% 


40% 


39% 


Natural 
Gas 


7% 


3% 


41% 


24% 


7% 


26% 


14% 


26% 


53% 


23% 


23% 


Energy Consumption—Selected Countries (2006) 


Energy and Economic Well-being 


Coal 


5% 


69% 


1% 


24% 


52% 


16% 


21% 


6% 


16% 


22% 


24% 


Nuclear 


1% 


1% 


0% 


11% 


1% 


0% 


12% 


0% 


5% 


8% 


6% 


Hydro 
35% 
6% 
6% 
1% 
5% 
2% 


4% 


11% 


6% 
3% 


6% 


Other 
Renewables 


2% 


3% 


3% 


1% 


19% 


1% 


2% 


The last two columns in this table examine the energy and electricity use per capita. Economic well-being is 
dependent upon energy use, and in most countries higher standards of living, as measured by GDP (gross domestic 
product) per capita, are matched by higher levels of energy consumption per capita. This is borne out in [link]. 
Increased efficiency of energy use will change this dependency. A global problem is balancing energy resource 
development against the harmful effects upon the environment in its extraction and use. 
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Power consumption per capita versus GDP per capita for 
various countries. Note the increase in energy usage with 
increasing GDP. (2007, credit: Frank van Mierlo, 
Wikimedia Commons) 


Conserving Energy 


As we finish this chapter on energy and work, it is relevant to draw some distinctions between two sometimes 
misunderstood terms in the area of energy use. As has been mentioned elsewhere, the “law of the conservation of 
energy” is a very useful principle in analyzing physical processes. It is a statement that cannot be proven from 
basic principles, but is a very good bookkeeping device, and no exceptions have ever been found. It states that the 
total amount of energy in an isolated system will always remain constant. Related to this principle, but remarkably 
different from it, is the important philosophy of energy conservation. This concept has to do with seeking to 
decrease the amount of energy used by an individual or group through (1) reduced activities (e.g., turning down 
thermostats, driving fewer kilometers) and/or (2) increasing conversion efficiencies in the performance of a 
particular task—such as developing and using more efficient room heaters, cars that have greater miles-per-gallon 
ratings, energy-efficient compact fluorescent lights, etc. 


Since energy in an isolated system is not destroyed or created or generated, one might wonder why we need to be 
concerned about our energy resources, since energy is a conserved quantity. The problem is that the final result of 
most energy transformations is waste heat transfer to the environment and conversion to energy forms no longer 
useful for doing work. To state it in another way, the potential for energy to produce useful work has been 
“degraded” in the energy transformation. (This will be discussed in more detail in Thermodynamics.) 


Section Summary 


e The relative use of different fuels to provide energy has changed over the years, but fuel use is currently 
dominated by oil, although natural gas and solar contributions are increasing. 

e Although non-renewable sources dominate, some countries meet a sizeable percentage of their electricity 
needs from renewable resources. 

e The United States obtains only about 10% of its energy from renewable sources, mostly hydroelectric power. 

e Economic well-being is dependent upon energy use, and in most countries higher standards of living, as 
measured by GDP (Gross Domestic Product) per capita, are matched by higher levels of energy consumption 
per capita. 

e Even though, in accordance with the law of conservation of energy, energy can never be created or destroyed, 
energy that can be used to do work is always partly converted to less useful forms, such as waste heat to the 
environment, in all of our uses of energy for practical purposes. 


Conceptual Questions 


Exercise: 


Problem: 


What is the difference between energy conservation and the law of conservation of energy? Give some 
examples of each. 


Exercise: 


Problem: 


If the efficiency of a coal-fired electrical generating plant is 35%, then what do we mean when we say that 
energy is a conserved quantity? 


Problems & Exercises 


Exercise: 


Problem: Integrated Concepts 


(a) Calculate the force the woman in [link] exerts to do a push-up at constant speed, taking all data to be 
known to three digits. (b) How much work does she do if her center of mass rises 0.240 m? (c) What is her 
useful power output if she does 25 push-ups in 1 min? (Should work done lowering her body be included? 


See the discussion of useful work in Work, Energy, and Power in Humans, 
m= 50 kg 


Freaction 


Forces involved in doing 
push-ups. The woman’s 
weight acts as a force 
exerted downward on her 
center of gravity (CG). 


Solution: 
(a) 294 N 
(b) 118 J 
(c) 49.0 W 


Exercise: 


Problem: Integrated Concepts 


A 75.0-kg cross-country skier is climbing a 3.0° slope at a constant speed of 2.00 m/s and encounters air 
resistance of 25.0 N. Find his power output for work done against the gravitational force and air resistance. 
(b) What average force does he exert backward on the snow to accomplish this? (c) If he continues to exert 


this force and to experience the same air resistance when he reaches a level area, how long will it take him to 
reach a velocity of 10.0 m/s? 


Exercise: 


Problem: Integrated Concepts 


The 70.0-kg swimmer in [link] starts a race with an initial velocity of 1.25 m/s and exerts an average force of 
80.0 N backward with his arms during each 1.80 m long stroke. (a) What is his initial acceleration if water 
resistance is 45.0 N? (b) What is the subsequent average resistance force from the water during the 5.00 s it 
takes him to reach his top velocity of 2.50 m/s? (c) Discuss whether water resistance seems to increase 
linearly with velocity. 


Solution: 
(a) 0.500 m/s” 
(b) 62.5 N 


(c) Assuming the acceleration of the swimmer decreases linearly with time over the 5.00 s interval, the 
frictional force must therefore be increasing linearly with time, since f = F' — maa. If the acceleration 
decreases linearly with time, the velocity will contain a term dependent on time squared (t”). Therefore, the 
water resistance will not depend linearly on the velocity. 


Exercise: 


Problem: Integrated Concepts 


A toy gun uses a spring with a force constant of 300 N/m to propel a 10.0-g steel ball. If the spring is 
compressed 7.00 cm and friction is negligible: (a) How much force is needed to compress the spring? (b) To 
what maximum height can the ball be shot? (c) At what angles above the horizontal may a child aim to hit a 
target 3.00 m away at the same height as the gun? (d) What is the gun’s maximum range on level ground? 


Exercise: 
Problem: Integrated Concepts 


(a) What force must be supplied by an elevator cable to produce an acceleration of 0.800 m/ s° against a 200- 
N frictional force, if the mass of the loaded elevator is 1500 kg? (b) How much work is done by the cable in 
lifting the elevator 20.0 m? (c) What is the final speed of the elevator if it starts from rest? (d) How much 
work went into thermal energy? 


Solution: 
(a) 16.1 x 10° N 
(b) 3.22 x 10° J 
(c) 5.66 m/s 
(d) 4.00 kJ 
Exercise: 
Problem: Unreasonable Results 
A car advertisement claims that its 900-kg car accelerated from rest to 30.0 m/s and drove 100 km, gaining 
3.00 km in altitude, on 1.0 gal of gasoline. The average force of friction including air resistance was 700 N. 


Assume all values are known to three significant figures. (a) Calculate the car’s efficiency. (b) What is 
unreasonable about the result? (c) Which premise is unreasonable, or which premises are inconsistent? 


Exercise: 


Problem: Unreasonable Results 


Body fat is metabolized, supplying 9.30 kcal/g, when dietary intake is less than needed to fuel metabolism. 
The manufacturers of an exercise bicycle claim that you can lose 0.500 kg of fat per day by vigorously 
exercising for 2.00 h per day on their machine. (a) How many kcal are supplied by the metabolization of 
0.500 kg of fat? (b) Calculate the kcal/min that you would have to utilize to metabolize fat at the rate of 0.500 
kg in 2.00 h. (c) What is unreasonable about the results? (d) Which premise is unreasonable, or which 
premises are inconsistent? 


Solution: 
(a) 4.65 x 10° kcal 
(b) 38.8 kcal/min 


(c) This power output is higher than the highest value on [link], which is about 35 kcal/min (corresponding to 
2415 watts) for sprinting. 
(d) It would be impossible to maintain this power output for 2 hours (imagine sprinting for 2 hours!). 


Exercise: 


Problem: Construct Your Own Problem 


Consider a person climbing and descending stairs. Construct a problem in which you calculate the long-term 
rate at which stairs can be climbed considering the mass of the person, his ability to generate power with his 
legs, and the height of a single stair step. Also consider why the same person can descend stairs at a faster rate 
for a nearly unlimited time in spite of the fact that very similar forces are exerted going down as going up. 
(This points to a fundamentally different process for descending versus climbing stairs.) 


Exercise: 


Problem: Construct Your Own Problem 


Consider humans generating electricity by pedaling a device similar to a stationary bicycle. Construct a 
problem in which you determine the number of people it would take to replace a large electrical generation 
facility. Among the things to consider are the power output that is reasonable using the legs, rest time, and the 
need for electricity 24 hours per day. Discuss the practical implications of your results. 


Exercise: 
Problem: Integrated Concepts 


A 105-kg basketball player crouches down 0.400 m while waiting to jump. After exerting a force on the floor 
through this 0.400 m, his feet leave the floor and his center of gravity rises 0.950 m above its normal standing 
erect position. (a) Using energy considerations, calculate his velocity when he leaves the floor. (b) What 
average force did he exert on the floor? (Do not neglect the force to support his weight as well as that to 
accelerate him.) (c) What was his power output during the acceleration phase? 


Solution: 
(a) 4.32 m/s 
(b) 3.47 x 10° N 


(c) 8.93 kw 


Glossary 


renewable forms of energy 
those sources that cannot be used up, such as water, wind, solar, and biomass 


fossil fuels 
oil, natural gas, and coal 


Introduction to Linear Momentum and Collisions 
class="introduction" 


"Each 
rugby 
player has 
great 
momentum 
, which will 
affect the 
outcome of 
their 
collisions 
with each 
other and 
the ground. 
(credit: 
vjpaul, 
Flickr)" 


We use the term momentum in various ways in everyday language, and 
most of these ways are consistent with its precise scientific definition. We 
speak of sports teams or politicians gaining and maintaining the momentum 
to win. We also recognize that momentum has something to do with 
collisions. For example, looking at the rugby players in the photograph 
colliding and falling to the ground, we expect their momenta to have great 
effects in the resulting collisions. Generally, momentum implies a tendency 
to continue on course—to move in the same direction—and is associated 
with great mass and speed. 


Momentum, like energy, is important because it is conserved. Only a few 
physical quantities are conserved in nature, and studying them yields 
fundamental insight into how nature works, as we shall see in our study of 
momentum. 


Linear Momentum and Force 


e Define linear momentum. 

e Explain the relationship between momentum and force. 

e State Newton’s second law of motion in terms of momentum. 
e Calculate momentum given mass and velocity. 


Linear Momentum 


The scientific definition of linear momentum is consistent with most 
people’s intuitive understanding of momentum: a large, fast-moving object 
has greater momentum than a smaller, slower object. Linear momentum is 
defined as the product of a system’s mass multiplied by its velocity. In 
symbols, linear momentum is expressed as 

Equation: 


p = Mv. 


Momentum is directly proportional to the object’s mass and also its 
velocity. Thus the greater an object’s mass or the greater its velocity, the 
greater its momentum. Momentum p is a vector having the same direction 
as the velocity v. The SI unit for momentum is kg - m/s. 


Note: 

Linear Momentum 

Linear momentum is defined as the product of a system’s mass multiplied 
by its velocity: 

Equation: 


Example: 


Calculating Momentum: A Football Player and a Football 

(a) Calculate the momentum of a 110-kg football player running at 8.00 
m/s. (b) Compare the player’s momentum with the momentum of a hard- 
thrown 0.410-kg football that has a speed of 25.0 m/s. 

Strategy 

No information is given regarding direction, and so we can calculate only 
the magnitude of the momentum, p. (As usual, a symbol that is in italics is 
a magnitude, whereas one that is italicized, boldfaced, and has an arrow is 
a vector.) In both parts of this example, the magnitude of momentum can 
be calculated directly from the definition of momentum given in the 
equation, which becomes 

Equation: 


p=mv 


when only magnitudes are considered. 

Solution for (a) 

To determine the momentum of the player, substitute the known values for 
the player’s mass and speed into the equation. 

Equation: 


Pplayer = (110 kg)(8.00 m/s) = 880 kg - m/s 


Solution for (b) 

To determine the momentum of the ball, substitute the known values for 
the ball’s mass and speed into the equation. 

Equation: 


Poall = (0.410 kg)(25.0 m/s) = 10.3 kg- m/s 


The ratio of the player’s momentum to that of the ball is 
Equation: 


Pplayer = 880 — 859. 
Pball 10.3 


Discussion 


Although the ball has greater velocity, the player has a much greater mass. 
Thus the momentum of the player is much greater than the momentum of 
the football, as you might guess. As a result, the player’s motion is only 
slightly affected if he catches the ball. We shall quantify what happens in 
such collisions in terms of momentum in later sections. 


Momentum and Newton’s Second Law 


The importance of momentum, unlike the importance of energy, was 
recognized early in the development of classical physics. Momentum was 
deemed so important that it was called the “quantity of motion.” Newton 
actually stated his second law of motion in terms of momentum: The net 
external force equals the change in momentum of a system divided by the 
time over which it changes. Using symbols, this law is 

Equation: 


where F ,¢ is the net external force, Ap is the change in momentum, and 
At is the change in time. 


Note: 

Newton’s Second Law of Motion in Terms of Momentum 
The net external force equals the change in momentum of a system divided 
by the time over which it changes. 
Equation: 


Note: 

Making Connections: Force and Momentum 

Force and momentum are intimately related. Force acting over time can 
change momentum, and Newton’s second law of motion, can be stated in 
its most broadly applicable form in terms of momentum. Momentum 
continues to be a key concept in the study of atomic and subatomic 
particles in quantum mechanics. 


This statement of Newton’s second law of motion includes the more 
familiar F,.-,=ma as a special case. We can derive this form as follows. 
First, note that the change in momentum Ap is given by 

Equation: 


Ap = A(mv). 


If the mass of the system is constant, then 
Equation: 


A(mv) = mAv. 


So that for constant mass, Newton’s second law of motion becomes 
Equation: 


Ap _ mAv 


Bnet = Ae = Ay 


Because —— = a, we get the familiar equation 
Equation: 


F=ma 


when the mass of the system is constant. 


Newton’s second law of motion stated in terms of momentum is more 
generally applicable because it can be applied to systems where the mass is 
changing, such as rockets, as well as to systems of constant mass. We will 
consider systems with varying mass in some detail; however, the 
relationship between momentum and force remains useful when mass is 
constant, such as in the following example. 


Example: 

Calculating Force: Venus Williams’ Racquet 

During the 2007 French Open, Venus Williams hit the fastest recorded 
serve in a premier women’s match, reaching a speed of 58 m/s (209 km/h). 
What is the average force exerted on the 0.057-kg tennis ball by Venus 
Williams’ racquet, assuming that the ball’s speed just after impact is 58 
m/s, that the initial horizontal component of the velocity before impact is 
negligible, and that the ball remained in contact with the racquet for 5.0 ms 
(milliseconds)? 

Strategy 

This problem involves only one dimension because the ball starts from 
having no horizontal velocity component before impact. Newton’s second 
law stated in terms of momentum is then written as 

Equation: 


F =e 
net — At 0 
As noted above, when mass is constant, the change in momentum is given 


by 
Equation: 


Ap = mAv = m(vz — Uj). 


In this example, the velocity just after impact and the change in time are 
given; thus, once Ap is calculated, Fret = = can be used to find the 


force. 
Solution 


To determine the change in momentum, substitute the values for the initial 
and final velocities into the equation above. 
Equation: 


Ap m(vs—- 0; ) 
(0.057 kg)(58 m/s—0 m/s) 


3.306 kg - m/s © 3.3 kg- m/s 


Now the magnitude of the net external force can determined by using 


Ap . 
Pret = At: 
Equation: 
F _ Ap _ 3.306 kg-m/s 
at At ~  5.0x10-3s 


— 661N ~ 660N, 


where we have retained only two significant figures in the final step. 
Discussion 

This quantity was the average force exerted by Venus Williams’ racquet on 
the tennis ball during its brief impact (note that the ball also experienced 
the 0.56-N force of gravity, but that force was not due to the racquet). This 
problem could also be solved by first finding the acceleration and then 
using Pye, = ma, but one additional step would be required compared with 
the strategy used in this example. 


Section Summary 


e Linear momentum (momentum for brevity) is defined as the product of 
a system’s mass multiplied by its velocity. 

e In symbols, linear momentum p is defined to be 
Equation: 


pI, 


where m is the mass of the system and v is its velocity. 
e The SI unit for momentum is kg - m/s. 


e Newton’s second law of motion in terms of momentum states that the 


net external force equals the change in momentum of a system divided 


by the time over which it changes. 
e In symbols, Newton’s second law of motion is defined to be 
Equation: 


Ap 


| ee yer ae, 
eA 


F net is the net external force, Ap is the change in momentum, and At 


is the change time. 


Conceptual Questions 


Exercise: 
Problem: 
An object that has a small mass and an object that has a large mass 


have the same momentum. Which object has the largest kinetic 
energy? 


Exercise: 
Problem: 
An object that has a small mass and an object that has a large mass 
have the same kinetic energy. Which mass has the largest momentum? 


Exercise: 


Problem: Professional Application 


Football coaches advise players to block, hit, and tackle with their feet 
on the ground rather than by leaping through the air. Using the 
concepts of momentum, work, and energy, explain how a football 
player can be more effective with his feet on the ground. 


Exercise: 


Problem: 


How can a small force impart the same momentum to an object as a 
large force? 


Problems & Exercises 


Exercise: 
Problem: 
(a) Calculate the momentum of a 2000-kg elephant charging a hunter 
at a speed of 7.50 m/s. (b) Compare the elephant’s momentum with 
the momentum of a 0.0400-kg tranquilizer dart fired at a speed of 


600 m/s. (c) What is the momentum of the 90.0-kg hunter running at 
7.40 m/s after missing the elephant? 


Solution: 

(a) 1.50 x 104 kg - m/s 

(b) 625 to 1 

(c) 6.66 x 10? kg - m/s 
Exercise: 


Problem: 


(a) What is the mass of a large ship that has a momentum of 

1.60 x 10° kg - m/s, when the ship is moving at a speed of 

48.0 km/h? (b) Compare the ship’s momentum to the momentum of a 
1100-kg artillery shell fired at a speed of 1200 m/s. 


Exercise: 


Problem: 


(a) At what speed would a 2.00 x 10*-kg airplane have to fly to have 
a momentum of 1.60 x 10° kg-m /s (the same as the ship’s 
momentum in the problem above)? (b) What is the plane’s momentum 
when it is taking off at a speed of 60.0 m/s? (c) If the ship is an 
aircraft carrier that launches these airplanes with a catapult, discuss the 
implications of your answer to (b) as it relates to recoil effects of the 
catapult on the ship. 


Solution: 
(a) 8.00 x 104 m/s 
(b) 1.20 x 10° kg - m/s 


(c) Because the momentum of the airplane is 3 orders of magnitude 
smaller than of the ship, the ship will not recoil very much. The recoil 
would be —0.0100 m/s, which is probably not noticeable. 


Exercise: 
Problem: 
(a) What is the momentum of a garbage truck that is 1.20 x 104 kg 


and is moving at 10.0 m/s? (b) At what speed would an 8.00-kg trash 
can have the same momentum as the truck? 


Exercise: 


Problem: 


A runaway train car that has a mass of 15,000 kg travels at a speed of 
5.4 m/s down a track. Compute the time required for a force of 1500 
N to bring the car to rest. 


Solution: 


545 


Exercise: 


Problem: 


The mass of Earth is 5.972 x 1074 kg and its orbital radius is an 
average of 1.496 x 10" m. Calculate its linear momentum. 


Glossary 


linear momentum 
the product of mass and velocity 


second law of motion 
physical law that states that the net external force equals the change in 
momentum of a system divided by the time over which it changes 


Impulse 


e Define impulse. 

¢ Describe effects of impulses in everyday life. 

e Determine the average effective force using graphical representation. 
¢ Calculate average force and impulse given mass, velocity, and time. 


The effect of a force on an object depends on how long it acts, as well as 
how great the force is. In [link], a very large force acting for a short time 
had a great effect on the momentum of the tennis ball. A small force could 
cause the same change in momentum, but it would have to act for a much 
longer time. For example, if the ball were thrown upward, the gravitational 
force (which is much smaller than the tennis racquet’s force) would 
eventually reverse the momentum of the ball. Quantitatively, the effect we 
are talking about is the change in momentum Ap. 


By rearranging the equation Fy = ap to be 
Equation: 


Ap — Fret At, 


we can see how the change in momentum equals the average net external 
force multiplied by the time this force acts. The quantity F,,., At is given 
the name impulse. Impulse is the same as the change in momentum. 


Note: 

Impulse: Change in Momentum 

Change in momentum equals the average net external force multiplied by 
the time this force acts. 

Equation: 


Ap = eee NG 


The quantity F,,, At is given the name impulse. 


There are many ways in which an understanding of impulse can save lives, 
or at least limbs. The dashboard padding in a car, and certainly the airbags, 
allow the net force on the occupants in the car to act over a much longer 
time when there is a sudden stop. The momentum change is the same for 
an occupant, whether an air bag is deployed or not, but the force (to bring 
the occupant to a stop) will be much less if it acts over a larger time. Cars 
today have many plastic components. One advantage of plastics is their 
lighter weight, which results in better gas mileage. Another advantage is 
that a car will crumple in a collision, especially in the event of a head-on 
collision. A longer collision time means the force on the car will be less. 
Deaths during car races decreased dramatically when the rigid frames of 
racing cars were replaced with parts that could crumple or collapse in the 
event of an accident. 

Bones in a body will fracture if the force on them is too large. If you jump 
onto the floor from a table, the force on your legs can be immense if you 
land stiff-legged on a hard surface. Rolling on the ground after jumping 
from the table, or landing with a parachute, extends the time over which 
the force (on you from the ground) acts. 


Example: 

Calculating Magnitudes of Impulses: Two Billiard Balls Striking a 
Rigid Wall 

Two identical billiard balls strike a rigid wall with the same speed, and are 
reflected without any change of speed. The first ball strikes perpendicular 
to the wall. The second ball strikes the wall at an angle of 30° from the 
perpendicular, and bounces off at an angle of 30° from perpendicular to the 
wall. 

(a) Determine the direction of the force on the wall due to each ball. 

(b) Calculate the ratio of the magnitudes of impulses on the two balls by 
the wall. 

Strategy for (a) 

In order to determine the force on the wall, consider the force on the ball 
due to the wall using Newton’s second law and then apply Newton’s third 
law to determine the direction. Assume the z-axis to be normal to the wall 
and to be positive in the initial direction of motion. Choose the y-axis to be 


along the wall in the plane of the second ball’s motion. The momentum 
direction and the velocity direction are the same. 

Solution for (a) 

The first ball bounces directly into the wall and exerts a force on it in the 
+z direction. Therefore the wall exerts a force on the ball in the —x 
direction. The second ball continues with the same momentum component 
in the y direction, but reverses its x-component of momentum, as seen by 
sketching a diagram of the angles involved and keeping in mind the 
proportionality between velocity and momentum. 

These changes mean the change in momentum for both balls is in the —z 
direction, so the force of the wall on each ball is along the —2 direction. 
Strategy for (b) 

Calculate the change in momentum for each ball, which is equal to the 
impulse imparted to the ball. 

Solution for (b) 

Let u be the speed of each ball before and after collision with the wall, and 
m the mass of each ball. Choose the z-axis and y-axis as previously 
described, and consider the change in momentum of the first ball which 
strikes perpendicular to the wall. 

Equation: 


Pxi = MU; Pyi = 0 
Equation: 
Pxf = —MUu, Pyf = 0 


Impulse is the change in momentum vector. Therefore the z-component of 
impulse is equal to —2mu and the y-component of impulse is equal to 
zero. 

Now consider the change in momentum of the second ball. 

Equation: 


Pxi = Mu cos 30°; py; = —mu sin 30° 
Equation: 


Pxt =— mu cos 30°; pys = —mu sin 30° 


It should be noted here that while p, changes sign after the collision, py 
does not. Therefore the x-component of impulse is equal to —2mu cos 30° 
and the y-component of impulse is equal to zero. 

The ratio of the magnitudes of the impulse imparted to the balls is 
Equation: 


2mu 2 
2mucos 30° 4/3 


Discussion 

The direction of impulse and force is the same as in the case of (a); it is 
normal to the wall and along the negative x-direction. Making use of 
Newton’s third law, the force on the wall due to each ball is normal to the 
wall along the positive x -direction. 


Our definition of impulse includes an assumption that the force is constant 
over the time interval At. Forces are usually not constant. Forces vary 
considerably even during the brief time intervals considered. It is, however, 
possible to find an average effective force Fre that produces the same result 
as the corresponding time-varying force. [link] shows a graph of what an 
actual force looks like as a function of time for a ball bouncing off the floor. 
The area under the curve has units of momentum and is equal to the 
impulse or change in momentum between times f; and f2. That area is 
equal to the area inside the rectangle bounded by Fe, ¢1, and t2. Thus the 
impulses and their effects are the same for both the actual and effective 
forces. 


Faeries 


Fe 


A graph of force versus time 
with time along the x-axis and 
force along the y-axis for an 
actual force and an equivalent 
effective force. The areas under 
the two curves are equal. 


Note: 

Making Connections: Take-Home Investigation—Hand Movement and 
Impulse 

Try catching a ball while “giving” with the ball, pulling your hands toward 
your body. Then, try catching a ball while keeping your hands still. Hit 
water in a tub with your full palm. After the water has settled, hit the water 
again by diving your hand with your fingers first into the water. (Your full 
palm represents a swimmer doing a belly flop and your diving hand 
represents a swimmer doing a dive.) Explain what happens in each case 
and why. Which orientations would you advise people to avoid and why? 


Note: 

Making Connections: Constant Force and Constant Acceleration 

The assumption of a constant force in the definition of impulse is 
analogous to the assumption of a constant acceleration in kinematics. In 
both cases, nature is adequately described without the use of calculus. 


Section Summary 


e Impulse, or change in momentum, equals the average net external 
force multiplied by the time this force acts: 
Equation: 


Ap = F net NT: 


e Forces are usually not constant over a period of time. 


Conceptual Questions 


Exercise: 


Problem: Professional Application 


Explain in terms of impulse how padding reduces forces in a collision. 
State this in terms of a real example, such as the advantages of a 
carpeted vs. tile floor for a day care center. 


Exercise: 
Problem: 
While jumping on a trampoline, sometimes you land on your back and 


other times on your feet. In which case can you reach a greater height 
and why? 


Exercise: 
Problem: Professional Application 
Tennis racquets have “sweet spots.” If the ball hits a sweet spot then 
the player's arm is not jarred as much as it would be otherwise. Explain 
why this is the case. 


Problems & Exercises 


Exercise: 


Problem: 


A bullet is accelerated down the barrel of a gun by hot gases produced 
in the combustion of gun powder. What is the average force exerted on 
a 0.0300-kg bullet to accelerate it to a speed of 600 m/s in a time of 
2.00 ms (milliseconds)? 


Solution: 


9.00 x 102? N 


Exercise: 


Problem: Professional Application 


A car moving at 10 m/s crashes into a tree and stops in 0.26 s. 
Calculate the force the seat belt exerts on a passenger in the car to 
bring him to a halt. The mass of the passenger is 70 kg. 


Exercise: 
Problem: 
A person slaps her leg with her hand, bringing her hand to rest in 2.50 
milliseconds from an initial speed of 4.00 m/s. (a) What is the average 
force exerted on the leg, taking the effective mass of the hand and 
forearm to be 1.50 kg? (b) Would the force be any different if the 


woman clapped her hands together at the same speed and brought them 
to rest in the same time? Explain why or why not. 


Solution: 
a) 2.40 x 10° N toward the leg 


b) The force on each hand would have the same magnitude as that 
found in part (a) (but in opposite directions by Newton’s third law) 
because the change in momentum and the time interval are the same. 


Exercise: 


Problem: Professional Application 


A professional boxer hits his opponent with a 1000-N horizontal blow 
that lasts for 0.150 s. (a) Calculate the impulse imparted by this blow. 
(b) What is the opponent’s final velocity, if his mass is 105 kg and he 
is motionless in midair when struck near his center of mass? (c) 
Calculate the recoil velocity of the opponent’s 10.0-kg head if hit in 
this manner, assuming the head does not initially transfer significant 
momentum to the boxer’s body. (d) Discuss the implications of your 
answers for parts (b) and (c). 


Exercise: 


Problem: Professional Application 


Suppose a child drives a bumper car head on into the side rail, which 
exerts a force of 4000 N on the car for 0.200 s. (a) What impulse is 
imparted by this force? (b) Find the final velocity of the bumper car if 
its initial velocity was 2.80 m/s and the car plus driver have a mass of 
200 kg. You may neglect friction between the car and floor. 


Solution: 
a) 800 kg - m/s away from the wall 


b) 1.20 m/s away from the wall 


Exercise: 


Problem: Professional Application 


One hazard of space travel is debris left by previous missions. There 
are several thousand objects orbiting Earth that are large enough to be 
detected by radar, but there are far greater numbers of very small 
objects, such as flakes of paint. Calculate the force exerted by a 0.100- 
mg chip of paint that strikes a spacecraft window at a relative speed of 
4.00 x 10° m/s, given the collision lasts 6.00 x 10° s. 


Exercise: 


Problem: Professional Application 


A 75.0-kg person is riding in a car moving at 20.0 m/s when the car 
runs into a bridge abutment. (a) Calculate the average force on the 
person if he is stopped by a padded dashboard that compresses an 
average of 1.00 cm. (b) Calculate the average force on the person if he 
is stopped by an air bag that compresses an average of 15.0 cm. 


Solution: 
(a) 1.50 x 10° N away from the dashboard 


(b) 1.00 x 10° N away from the dashboard 


Exercise: 


Problem: Professional Application 


Military rifles have a mechanism for reducing the recoil forces of the 
gun on the person firing it. An internal part recoils over a relatively 
large distance and is stopped by damping mechanisms in the gun. The 
larger distance reduces the average force needed to stop the internal 
part. (a) Calculate the recoil velocity of a 1.00-kg plunger that directly 
interacts with a 0.0200-kg bullet fired at 600 m/s from the gun. (b) If 
this part is stopped over a distance of 20.0 cm, what average force is 
exerted upon it by the gun? (c) Compare this to the force exerted on 
the gun if the bullet is accelerated to its velocity in 10.0 ms 
(milliseconds). 


Exercise: 


Problem: 


A cruise ship with a mass of 1.00 x 10’ kg strikes a pier at a speed of 
0.750 m/s. It comes to rest 6.00 m later, damaging the ship, the pier, 
and the tugboat captain’s finances. Calculate the average force exerted 
on the pier using the concept of impulse. (Hint: First calculate the time 
it took to bring the ship to rest.) 


Solution: 


4.69 x 10° N in the boat’s original direction of motion 
Exercise: 

Problem: 

Calculate the final speed of a 110-kg rugby player who is initially 


running at 8.00 m/s but collides head-on with a padded goalpost and 
experiences a backward force of 1.76 x 10* N for 5.50 x 10s. 


Exercise: 
Problem: 
Water from a fire hose is directed horizontally against a wall at a rate 
of 50.0 kg/s and a speed of 42.0 m/s. Calculate the magnitude of the 


force exerted on the wall, assuming the water’s horizontal momentum 
is reduced to zero. 


Solution: 


2.10 x 10° N away from the wall 
Exercise: 
Problem: 
A 0.450-kg hammer is moving horizontally at 7.00 m/s when it strikes 
a nail and comes to rest after driving the nail 1.00 cm into a board. (a) 


Calculate the duration of the impact. (b) What was the average force 
exerted on the nail? 


Exercise: 
Problem: 
Starting with the definitions of momentum and kinetic energy, derive 


an equation for the kinetic energy of a particle expressed as a function 
of its momentum. 


Solution: 
Equation: 
2 
p=mv=> p=mv => > = mv 
2 
= £. = imv’? = KE 
KE= 2 
2m 
Exercise: 
Problem: 


A ball with an initial velocity of 10 m/s moves at an angle 60° above 
the + x-direction. The ball hits a vertical wall and bounces off so that it 
is moving 60° above the —z-direction with the same speed. What is 
the impulse delivered by the wall? 


Exercise: 
Problem: 
When serving a tennis ball, a player hits the ball when its velocity is 
zero (at the highest point of a vertical toss). The racquet exerts a force 


of 540 N on the ball for 5.00 ms, giving it a final velocity of 45.0 m/s. 
Using these data, find the mass of the ball. 


Solution: 


60.0 g 


Exercise: 


Problem: 


A punter drops a ball from rest vertically 1 meter down onto his foot. 
The ball leaves the foot with a speed of 18 m/s at an angle 55° above 
the horizontal. What is the impulse delivered by the foot (magnitude 
and direction)? 


Glossary 


change in momentum 
the difference between the final and initial momentum; the mass times 
the change in velocity 


impulse 
the average net external force times the time it acts; equal to the 
change in momentum 


Conservation of Momentum 


e Describe the principle of conservation of momentum. 

e Derive an expression for the conservation of momentum. 

e Explain conservation of momentum with examples. 

e Explain the principle of conservation of momentum as it relates to 
atomic and subatomic particles. 


Momentum is an important quantity because it is conserved. Yet it was not 
conserved in the examples in Impulse and Linear Momentum and Force, 
where large changes in momentum were produced by forces acting on the 
system of interest. Under what circumstances is momentum conserved? 


The answer to this question entails considering a sufficiently large system. 
It is always possible to find a larger system in which total momentum is 
constant, even if momentum changes for components of the system. If a 
football player runs into the goalpost in the end zone, there will be a force 
on him that causes him to bounce backward. However, the Earth also 
recoils —conserving momentum—because of the force applied to it through 
the goalpost. Because Earth is many orders of magnitude more massive 
than the player, its recoil is immeasurably small and can be neglected in any 
practical sense, but it is real nevertheless. 


Consider what happens if the masses of two colliding objects are more 
similar than the masses of a football player and Earth—for example, one car 
bumping into another, as shown in [link]. Both cars are coasting in the same 
direction when the lead car (labeled mz) is bumped by the trailing car 
(labeled m ,). The only unbalanced force on each car is the force of the 
collision. (Assume that the effects due to friction are negligible.) Car 1 
slows down as a result of the collision, losing some momentum, while car 2 
speeds up and gains some momentum. We shall now show that the total 
momentum of the two-car system remains constant. 


Before net F = 0 


System 
of interest 


System 
of interest 


After 


Pi + Po = Prot 


A car of mass m, moving with a velocity of v; bumps into 
another car of mass mz and velocity v2 that it is following. As 
a result, the first car slows down to a velocity of v/; and the 
second speeds up to a velocity of v/z. The momentum of each 
car is changed, but the total momentum piot of the two cars is 
the same before and after the collision (if you assume friction 
is negligible). 


Using the definition of impulse, the change in momentum of car 1 is given 


by 
Equation: 


Ap, = FiAt, 


where F; is the force on car 1 due to car 2, and At is the time the force acts 
(the duration of the collision). Intuitively, it seems obvious that the collision 
time is the same for both cars, but it is only true for objects traveling at 
ordinary speeds. This assumption must be modified for objects travelling 


near the speed of light, without affecting the result that momentum is 
conserved. 


Similarly, the change in momentum of car 2 is 
Equation: 


Ap» = FL,At, 


where fF» is the force on car 2 due to car 1, and we assume the duration of 
the collision At is the same for both cars. We know from Newton’s third 
law that Fy = —F, and so 

Equation: 


Ap2 = —F,At = —Ap}. 


Thus, the changes in momentum are equal and opposite, and 
Equation: 


Ap; + Ap, = 0. 


Because the changes in momentum add to zero, the total momentum of the 
two-car system is constant. That is, 
Equation: 


pi + po = constant, 
Equation: 

Pi + po = ply + plo, 
where p/, and p/, are the momenta of cars 1 and 2 after the collision. (We 
often use primes to denote the final state.) 


This result—that momentum is conserved—has validity far beyond the 
preceding one-dimensional case. It can be similarly shown that total 
momentum is conserved for any isolated system, with any number of 


objects in it. In equation form, the conservation of momentum principle 
for an isolated system is written 
Equation: 


Ptot = constant, 


or 
Equation: 


Ptot = Phtots 


where Pitot is the total momentum (the sum of the momenta of the 
individual objects in the system) and p/,,, is the total momentum some time 
later. (The total momentum can be shown to be the momentum of the center 
of mass of the system.) An isolated system is defined to be one for which 
the net external force is zero (Fyet = 0). 


Note: 
Conservation of Momentum Principle 
Equation: 
Piop = constant 
Piot = Plio¢ (isolated system) 
Note: 


Isolated System 
An isolated system is defined to be one for which the net external force is 
Zero (Fret = 0). 


Perhaps an easier way to see that momentum is conserved for an isolated 
system is to consider Newton’s second law in terms of momentum, 


| AP it . For an isolated system, (Fnet = 0); thus, Apiot = 0, and 
Ptot 1S constant. 


We have noted that the three length dimensions in nature—z, y, and z—are 
independent, and it is interesting to note that momentum can be conserved 
in different ways along each dimension. For example, during projectile 
motion and where air resistance is negligible, momentum is conserved in 
the horizontal direction because horizontal forces are zero and momentum 
is unchanged. But along the vertical direction, the net vertical force is not 
zero and the momentum of the projectile is not conserved. (See [link].) 
However, if the momentum of the projectile-Earth system is considered in 


the vertical direction, we find that the total momentum is conserved. 
After 


net F,=0 P, = const 


net F,= mg#0 py # const 


The horizontal component of a projectile’s momentum is 
conserved if air resistance is negligible, even in this case 
where a space probe separates. The forces causing the 
separation are internal to the system, so that the net external 
horizontal force Fy, net is still zero. The vertical component 
of the momentum is not conserved, because the net vertical 
force Fyy-net is not zero. In the vertical direction, the space 
probe-Earth system needs to be considered and we find that 
the total momentum is conserved. The center of mass of the 


space probe takes the same path it would if the separation 
did not occur. 


The conservation of momentum principle can be applied to systems as 
different as a comet striking Earth and a gas containing huge numbers of 
atoms and molecules. Conservation of momentum is violated only when the 
net external force is not zero. But another larger system can always be 
considered in which momentum is conserved by simply including the 
source of the external force. For example, in the collision of two cars 
considered above, the two-car system conserves momentum while each 
one-car system does not. 


Note: 

Making Connections: Take-Home Investigation—Drop of Tennis Ball and 
a Basketball 

Hold a tennis ball side by side and in contact with a basketball. Drop the 
balls together. (Be careful!) What happens? Explain your observations. 
Now hold the tennis ball above and in contact with the basketball. What 
happened? Explain your observations. What do you think will happen if 
the basketball ball is held above and in contact with the tennis ball? 


Note: 

Making Connections: Take-Home Investigation—Two Tennis Balls in a 
Ballistic Trajectory 

Tie two tennis balls together with a string about a foot long. Hold one ball 
and let the other hang down and throw it in a ballistic trajectory. Explain 
your observations. Now mark the center of the string with bright ink or 
attach a brightly colored sticker to it and throw again. What happened? 
Explain your observations. 

Some aquatic animals such as jellyfish move around based on the 
principles of conservation of momentum. A jellyfish fills its umbrella 
section with water and then pushes the water out resulting in motion in the 
opposite direction to that of the jet of water. Squids propel themselves in a 


similar manner but, in contrast with jellyfish, are able to control the 
direction in which they move by aiming their nozzle forward or backward. 
Typical squids can move at speeds of 8 to 12 km/h. 

The ballistocardiograph (BCG) was a diagnostic tool used in the second 
half of the 20th century to study the strength of the heart. About once a 
second, your heart beats, forcing blood into the aorta. A force in the 
opposite direction is exerted on the rest of your body (recall Newton’s third 
law). A ballistocardiograph is a device that can measure this reaction force. 
This measurement is done by using a sensor (resting on the person) or by 
using a moving table suspended from the ceiling. This technique can gather 
information on the strength of the heart beat and the volume of blood 
passing from the heart. However, the electrocardiogram (ECG or EKG) 
and the echocardiogram (cardiac ECHO or ECHO; a technique that uses 
ultrasound to see an image of the heart) are more widely used in the 
practice of cardiology. 


Note: 

Making Connections: Conservation of Momentum and Collision 
Conservation of momentum is quite useful in describing collisions. 
Momentum is crucial to our understanding of atomic and subatomic 
particles because much of what we know about these particles comes from 
collision experiments. 


Subatomic Collisions and Momentum 


The conservation of momentum principle not only applies to the 
macroscopic objects, it is also essential to our explorations of atomic and 
subatomic particles. Giant machines hurl subatomic particles at one another, 
and researchers evaluate the results by assuming conservation of 
momentum (among other things). 


On the small scale, we find that particles and their properties are invisible to 
the naked eye but can be measured with our instruments, and models of 
these subatomic particles can be constructed to describe the results. 


Momentum is found to be a property of all subatomic particles including 
massless particles such as photons that compose light. Momentum being a 
property of particles hints that momentum may have an identity beyond the 
description of an object’s mass multiplied by the object’s velocity. Indeed, 
momentum relates to wave properties and plays a fundamental role in what 
measurements are taken and how we take these measurements. 
Furthermore, we find that the conservation of momentum principle is valid 
when considering systems of particles. We use this principle to analyze the 
masses and other properties of previously undetected particles, such as the 
nucleus of an atom and the existence of quarks that make up particles of 
nuclei. [link] below illustrates how a particle scattering backward from 
another implies that its target is massive and dense. Experiments seeking 
evidence that quarks make up protons (one type of particle that makes up 
nuclei) scattered high-energy electrons off of protons (nuclei of hydrogen 
atoms). Electrons occasionally scattered straight backward in a manner that 
implied a very small and very dense particle makes up the proton—this 
observation is considered nearly direct evidence of quarks. The analysis 
was based partly on the same conservation of momentum principle that 
works so well on the large scale. 


Macroscopic 


Proton 


A subatomic particle scatters straight 
backward from a target particle. In 
experiments seeking evidence for 


quarks, electrons were observed to 
occasionally scatter straight backward 
from a proton. 


Section Summary 


e The conservation of momentum principle is written 
Equation: 


Ptot = constant 


or 
Equation: 


Prot =P/ict (isolated system), 


Ptot is the initial total momentum and p/,,;, is the total momentum 
some time later. 

e An isolated system is defined to be one for which the net external force 
is zero (Fret = 0). 

e During projectile motion and where air resistance is negligible, 
momentum is conserved in the horizontal direction because horizontal 
forces are zero. 

e Conservation of momentum applies only when the net external force is 
zero. 

e The conservation of momentum principle is valid when considering 
systems of particles. 


Conceptual Questions 


Exercise: 


Problem: Professional Application 


If you dive into water, you reach greater depths than if you do a belly 
flop. Explain this difference in depth using the concept of conservation 
of energy. Explain this difference in depth using what you have learned 
in this chapter. 


Exercise: 


Problem: Under what circumstances is momentum conserved? 
Exercise: 

Problem: 

Can momentum be conserved for a system if there are external forces 

acting on the system? If so, under what conditions? If not, why not? 
Exercise: 

Problem: 

Momentum for a system can be conserved in one direction while not 


being conserved in another. What is the angle between the directions? 
Give an example. 


Exercise: 


Problem: Professional Application 


Explain in terms of momentum and Newton’s laws how a car’s air 
resistance is due in part to the fact that it pushes air in its direction of 
motion. 


Exercise: 
Problem: 
Can objects in a system have momentum while the momentum of the 
system is zero? Explain your answer. 


Exercise: 


Problem: 


Must the total energy of a system be conserved whenever its 
momentum is conserved? Explain why or why not. 


Problems & Exercises 


Exercise: 


Problem: Professional Application 


Train cars are coupled together by being bumped into one another. 
Suppose two loaded train cars are moving toward one another, the first 
having a mass of 150,000 kg and a velocity of 0.300 m/s, and the 
second having a mass of 110,000 kg and a velocity of —0.120 m/s. 
(The minus indicates direction of motion.) What is their final velocity? 


Solution: 


0.122 m/s 
Exercise: 
Problem: 
Suppose a clay model of a koala bear has a mass of 0.200 kg and slides 
on ice at a speed of 0.750 m/s. It runs into another clay model, which 


is initially motionless and has a mass of 0.350 kg. Both being soft clay, 
they naturally stick together. What is their final velocity? 


Exercise: 


Problem: Professional Application 


Consider the following question: A car moving at 10 m/s crashes into 
a tree and stops in 0.26 s. Calculate the force the seatbelt exerts on a 

passenger in the car to bring him to a halt. The mass of the passenger 
is 70 kg. Would the answer to this question be different if the car with 


the 70-kg passenger had collided with a car that has a mass equal to 
and is traveling in the opposite direction and at the same speed? 
Explain your answer. 


Solution: 


In a collision with an identical car, momentum is conserved. 
Afterwards vs = 0 for both cars. The change in momentum will be the 
same as in the crash with the tree. However, the force on the body is 
not determined since the time is not known. A padded stop will reduce 
injurious force on body. 


Exercise: 
Problem: 
What is the velocity of a 900-kg car initially moving at 30.0 m/s, just 


after it hits a 150-kg deer initially running at 12.0 m/s in the same 
direction? Assume the deer remains on the car. 


Exercise: 


Problem: 


A 1.80-kg falcon catches a 0.650-kg dove from behind in midair. What 
is their velocity after impact if the falcon’s velocity is initially 28.0 m/s 
and the dove’s velocity is 7.00 m/s in the same direction? 


Solution: 


22.4 m/s in the same direction as the original motion 


Glossary 


conservation of momentum principle 
when the net external force is zero, the total momentum of the system 
is conserved or constant 


isolated system 


a system in which the net external force is zero 


quark 
fundamental constituent of matter and an elementary particle 


Elastic Collisions in One Dimension 


e Describe an elastic collision of two objects in one dimension. 

e Define internal kinetic energy. 

e Derive an expression for conservation of internal kinetic energy in a one 
dimensional collision. 

e Determine the final velocities in an elastic collision given masses and 
initial velocities. 


Let us consider various types of two-object collisions. These collisions are the 
easiest to analyze, and they illustrate many of the physical principles involved 
in collisions. The conservation of momentum principle is very useful here, 
and it can be used whenever the net external force on a system is zero. 


We start with the elastic collision of two objects moving along the same line 
—a one-dimensional problem. An elastic collision is one that also conserves 
internal kinetic energy. Internal kinetic energy is the sum of the kinetic 
energies of the objects in the system. [link] illustrates an elastic collision in 
which internal kinetic energy and momentum are conserved. 


Truly elastic collisions can only be achieved with subatomic particles, such as 
electrons striking nuclei. Macroscopic collisions can be very nearly, but not 
quite, elastic—some kinetic energy is always converted into other forms of 
energy such as heat transfer due to friction and sound. One macroscopic 
collision that is nearly elastic is that of two steel blocks on ice. Another nearly 
elastic collision is that between two carts with spring bumpers on an air track. 
Icy surfaces and air tracks are nearly frictionless, more readily allowing 
nearly elastic collisions on them. 


Note: 
Elastic Collision 
An elastic collision is one that conserves internal kinetic energy. 


Note: 
Internal Kinetic Energy 


Internal kinetic energy is the sum of the kinetic energies of the objects in 
the system. 


System of interest 
net F =0 


Frictionless surface 


System of interest 
Elastic => KE; + KE5 = KE, + KE, 
After 


Pi + P2 = Prot 


Frictionless surface 


An elastic one-dimensional 
two-object collision. 
Momentum and internal kinetic 
energy are conserved. 


Now, to solve problems involving one-dimensional elastic collisions between 
two objects we can use the equations for conservation of momentum and 
conservation of internal kinetic energy. First, the equation for conservation of 
momentum for two objects in a one-dimensional collision is 

Equation: 


pit p,=phyt+ ply (Fret = 0) 


Or 


Equation: 


MV, + Mev. = Mv + Mylo (Fret = 0), 


where the primes (') indicate values after the collision. By definition, an 
elastic collision conserves internal kinetic energy, and so the sum of kinetic 
energies before the collision equals the sum after the collision. Thus, 
Equation: 


1 1 1 1 
zyme + 5 Marr’ = zm + a movie’ (two-object elastic collision) 


expresses the equation for conservation of internal kinetic energy in a one- 
dimensional collision. 


Example: 

Calculating Velocities Following an Elastic Collision 

Calculate the velocities of two objects following an elastic collision, given 
that 

Equation: 


m, = 0.500 kg, m2 = 3.50 kg, v1; = 4.00 m/s, and v2 = 0. 


Strategy and Concept 

First, visualize what the initial conditions mean—a small object strikes a 
larger object that is initially at rest. This situation is slightly simpler than the 
situation shown in [link] where both objects are initially moving. We are 
asked to find two unknowns (the final velocities v/; and v/2). To find two 
unknowns, we must use two independent equations. Because this collision is 
elastic, we can use the above two equations. Both can be simplified by the 
fact that object 2 is initially at rest, and thus v2 = 0. Once we simplify these 
equations, we combine them algebraically to solve for the unknowns. 
Solution 

For this problem, note that v2 = 0 and use conservation of momentum. Thus, 
Equation: 


Pi = ph + plo 


or 
Equation: 


MV, = MV, + MyVlo. 


Using conservation of internal kinetic energy and that v2 = 0, 
Equation: 


i Su, yall pe 1 RY 
Si = Sah —MyvI9". 
es as a 2! 
Solving the first equation (momentum equation) for v/2, we obtain 
Equation: 


Oj — mG = vl). 
ms 


Substituting this expression into the second equation (internal kinetic energy 
equation) eliminates the variable v/2, leaving only v/; as an unknown (the 
algebra is left as an exercise for the reader). There are two solutions to any 
quadratic equation; in this example, they are 


Equation: 
vl, = 4.00 m/s 
and 
Equation: 
vl, = —3.00 m/s. 


As noted when quadratic equations were encountered in earlier chapters, both 
solutions may or may not be meaningful. In this case, the first solution is the 
same as the initial condition. The first solution thus represents the situation 
before the collision and is discarded. The second solution 

(v4; = —3.00 m/s) is negative, meaning that the first object bounces 
backward. When this negative value of v/; is used to find the velocity of the 
second object after the collision, we get 

Equation: 


0.500 kg 


tine —*(v4 a) = [4.00 — (—3.00)] m/s 


3.50 kg 
or 
Equation: 
vlog = 1.00 m/s. 
Discussion 


The result of this example is intuitively reasonable. A small object strikes a 
larger one at rest and bounces backward. The larger one is knocked forward, 
but with a low speed. (This is like a compact car bouncing backward off a 
full-size SUV that is initially at rest.) As a check, try calculating the internal 
kinetic energy before and after the collision. You will see that the internal 
kinetic energy is unchanged at 4.00 J. Also check the total momentum before 
and after the collision; you will find it, too, is unchanged. 

The equations for conservation of momentum and internal kinetic energy as 
written above can be used to describe any one-dimensional elastic collision 
of two objects. These equations can be extended to more objects if needed. 


Note: 

Making Connections: Take-Home Investigation—Ice Cubes and Elastic 
Collision 

Find a few ice cubes which are about the same size and a smooth kitchen 
tabletop or a table with a glass top. Place the ice cubes on the surface several 
centimeters away from each other. Flick one ice cube toward a stationary ice 
cube and observe the path and velocities of the ice cubes after the collision. 
Try to avoid edge-on collisions and collisions with rotating ice cubes. Have 
you created approximately elastic collisions? Explain the speeds and 
directions of the ice cubes using momentum. 


Note: 

PhET Explorations: Collision Lab 

Investigate collisions on an air hockey table. Set up your own experiments: 
vary the number of discs, masses and initial conditions. Is momentum 


conserved? Is kinetic energy conserved? Vary the elasticity and see what 
happens. 
https://phet.colorado.edu/sims/collision-lab/collision-lab_en.html 


Section Summary 


e An elastic collision is one that conserves internal kinetic energy. 

e Conservation of kinetic energy and momentum together allow the final 
velocities to be calculated in terms of initial velocities and masses in one 
dimensional two-body collisions. 


Conceptual Questions 


Exercise: 


Problem: What is an elastic collision? 


Problems & Exercises 


Exercise: 
Problem: 
Two identical objects (such as billiard balls) have a one-dimensional 
collision in which one is initially motionless. After the collision, the 
moving object is stationary and the other moves with the same speed as 


the other originally had. Show that both momentum and kinetic energy 
are conserved. 


Exercise: 
Problem: Professional Application 


Two manned satellites approach one another at a relative speed of 0.250 
m/s, intending to dock. The first has a mass of 4.00 x 10? kg, and the 


second a mass of 7.50 x 10° kg. If the two satellites collide elastically 
rather than dock, what is their final relative velocity? 


Solution: 


0.250 m/s 

Exercise: 
Problem: 
A 70.0-kg ice hockey goalie, originally at rest, catches a 0.150-kg 
hockey puck slapped at him at a velocity of 35.0 m/s. Suppose the goalie 
and the ice puck have an elastic collision and the puck is reflected back 


in the direction from which it came. What would their final velocities be 
in this case? 


Glossary 


elastic collision 
a collision that also conserves internal kinetic energy 


internal kinetic energy 
the sum of the kinetic energies of the objects in a system 


Inelastic Collisions in One Dimension 


¢ Define inelastic collision. 

e Explain perfectly inelastic collision. 

e Apply an understanding of collisions to sports. 

e Determine recoil velocity and loss in kinetic energy given mass and 
initial velocity. 


We have seen that in an elastic collision, internal kinetic energy is 
conserved. An inelastic collision is one in which the internal kinetic energy 
changes (it is not conserved). This lack of conservation means that the 
forces between colliding objects may remove or add internal kinetic energy. 
Work done by internal forces may change the forms of energy within a 
system. For inelastic collisions, such as when colliding objects stick 
together, this internal work may transform some internal kinetic energy into 
heat transfer. Or it may convert stored energy into internal kinetic energy, 
such as when exploding bolts separate a satellite from its launch vehicle. 


Note: 

Inelastic Collision 

An inelastic collision is one in which the internal kinetic energy changes (it 
is not conserved). 


[link] shows an example of an inelastic collision. Two objects that have 
equal masses head toward one another at equal speeds and then stick 
together. Their total internal kinetic energy is initially 

$mv? + +mv* = mv”. The two objects come to rest after sticking 
together, conserving momentum. But the internal kinetic energy is zero 
after the collision. A collision in which the objects stick together is 
sometimes called a perfectly inelastic collision because it reduces internal 
kinetic energy more than does any other type of inelastic collision. In fact, 
such a collision reduces internal kinetic energy to the minimum it can have 
while still conserving momentum. 


Note: 

Perfectly Inelastic Collision 

A collision in which the objects stick together is sometimes called 
“perfectly inelastic.” 


System of interest 


KE, = mv? 


System of interest 


net F = 0 


v=0 


Frictionless surface 


(b) 


An inelastic one-dimensional two-object collision. 
Momentum is conserved, but internal kinetic energy is not 
conserved. (a) Two objects of equal mass initially head 
directly toward one another at the same speed. (b) The 
objects stick together (a perfectly inelastic collision), and 
so their final velocity is zero. The internal kinetic energy of 
the system changes in any inelastic collision and is reduced 
to zero in this example. 


Example: 

Calculating Velocity and Change in Kinetic Energy: Inelastic Collision 
of a Puck and a Goalie 

(a) Find the recoil velocity of a 70.0-kg ice hockey goalie, originally at 
rest, who catches a 0.150-kg hockey puck slapped at him at a velocity of 
35.0 m/s. (b) How much kinetic energy is lost during the collision? 
Assume friction between the ice and the puck-goalie system is negligible. 
(See [link] ) 


After System of interest 


Before System of interest 
net F = 0 KE ‘int < KE int 
Py’ + Po = Prot ee 


Ps = Prot —— 


ae 


Frictionless ice surface Frictionless ice surface 


An ice hockey goalie catches a hockey puck and recoils 
backward. The initial kinetic energy of the puck is 
almost entirely converted to thermal energy and sound in 
this inelastic collision. 


Strategy 

Momentum is conserved because the net external force on the puck-goalie 
system is zero. We can thus use conservation of momentum to find the 
final velocity of the puck and goalie system. Note that the initial velocity 
of the goalie is zero and that the final velocity of the puck and goalie are 
the same. Once the final velocity is found, the kinetic energies can be 
calculated before and after the collision and compared as requested. 
Solution for (a) 

Momentum is conserved because the net external force on the puck-goalie 
system is zero. 

Conservation of momentum is 

Equation: 


Pi + p2 = ph, + pl, 


or 
Equation: 


MV, +Mv2 = MV + MygVv!o. 


Because the goalie is initially at rest, we know v2 = 0. Because the goalie 
catches the puck, the final velocities are equal, or v/) = v/yg = v/. Thus, the 


conservation of momentum equation simplifies to 
Equation: 


MV, = (M1 + M2)". 


Solving for v/ yields 
Equation: 
my 


v= ————1. 
m, +m 


Entering known values in this equation, we get 
Equation: 


0.150 kg _2 
Ul= ( 0.150 ke + 70.0 ke ) (35.0 m/s) = 7.48 x 10 “ m/s. 
Discussion for (a) 
This recoil velocity is small and in the same direction as the puck’s original 
velocity, as we might expect. 
Solution for (b) 
Before the collision, the internal kinetic energy KE; of the system is that 
of the hockey puck, because the goalie is initially at rest. Therefore, K Bint 
is initially 


Equation: 
KEin, = mv? = +(0.150 kg)(35.0 m/s)? 
91.9 J. 
After the collision, the internal kinetic energy is 
Equation: 
KEfin, = +(m-+M)v? = 4(70.15 kg) (7.48 x 10~? m/s)” 


0.196 J. 


The change in internal kinetic energy is thus 
Equation: 


KEfn, — KE = 0.196 J — 91.9 J 
Sue! 


where the minus sign indicates that the energy was lost. 

Discussion for (b) 

Nearly all of the initial internal kinetic energy is lost in this perfectly 
inelastic collision. KE;,4 is mostly converted to thermal energy and sound. 
During some collisions, the objects do not stick together and less of the 
internal kinetic energy is removed—such as happens in most automobile 
accidents. Alternatively, stored energy may be converted into internal 
kinetic energy during a collision. [link] shows a one-dimensional example 
in which two carts on an air track collide, releasing potential energy from a 


compressed spring. [link] deals with data from such a collision. 
Before net F = 0 System of interest 


Py + Po = Prot 


Frictionless 
surface 


After KEin > KEint System of interest 
Pi + P2 = Prot 


1/\ f\ P\ 


: ; Frictionless surface 
Uncoiled spring 


An air track is nearly frictionless, so that 
momentum is conserved. Motion is one- 
dimensional. In this collision, examined in [link], 
the potential energy of a compressed spring is 
released during the collision and is converted to 
internal kinetic energy. 


Collisions are particularly important in sports and the sporting and leisure 
industry utilizes elastic and inelastic collisions. Let us look briefly at 
tennis. Recall that in a collision, it is momentum and not force that is 
important. So, a heavier tennis racquet will have the advantage over a 
lighter one. This conclusion also holds true for other sports—a lightweight 
bat (such as a softball bat) cannot hit a hardball very far. 

The location of the impact of the tennis ball on the racquet is also 
important, as is the part of the stroke during which the impact occurs. A 
smooth motion results in the maximizing of the velocity of the ball after 
impact and reduces sports injuries such as tennis elbow. A tennis player 
tries to hit the ball on the “sweet spot” on the racquet, where the vibration 
and impact are minimized and the ball is able to be given more velocity. 
Sports science and technologies also use physics concepts such as 
momentum and rotational motion and vibrations. 


Note: 
Take-Home Experiment—Bouncing of Tennis Ball 


1. Find a racquet (a tennis, badminton, or other racquet will do). Place 
the racquet on the floor and stand on the handle. Drop a tennis ball on 
the strings from a measured height. Measure how high the ball 
bounces. Now ask a friend to hold the racquet firmly by the handle 
and drop a tennis ball from the same measured height above the 
racquet. Measure how high the ball bounces and observe what 
happens to your friend’s hand during the collision. Explain your 
observations and measurements. 

2. The coefficient of restitution (c) is a measure of the elasticity of a 
collision between a ball and an object, and is defined as the ratio of 
the speeds after and before the collision. A perfectly elastic collision 
has ac of 1. For a ball bouncing off the floor (or a racquet on the 
floor), c can be shown to be c = (h/H)*/2 where h is the height to 
which the ball bounces and # is the height from which the ball is 
dropped. Determine c for the cases in Part 1 and for the case of a 
tennis ball bouncing off a concrete or wooden floor (c = 0.85 for new 
tennis balls used on a tennis court). 


Example: 

Calculating Final Velocity and Energy Release: Two Carts Collide 

In the collision pictured in [link], two carts collide inelastically. Cart 1 
(denoted ™, carries a spring which is initially compressed. During the 
collision, the spring releases its potential energy and converts it to internal 
kinetic energy. The mass of cart 1 and the spring is 0.350 kg, and the cart 
and the spring together have an initial velocity of 2.00 m/s. Cart 2 
(denoted mz in [link]) has a mass of 0.500 kg and an initial velocity of 
—0.500 m/s. After the collision, cart 1 is observed to recoil with a 
velocity of —4.00 m/s. (a) What is the final velocity of cart 2? (b) How 
much energy was released by the spring (assuming all of it was converted 
into internal kinetic energy)? 

Strategy 

We can use conservation of momentum to find the final velocity of cart 2, 
because F’,,, = 0 (the track is frictionless and the force of the spring is 
internal). Once this velocity is determined, we can compare the internal 
kinetic energy before and after the collision to see how much energy was 
released by the spring. 

Solution for (a) 

As before, the equation for conservation of momentum in a two-object 
system is 

Equation: 


M1V1 + MyV2q = MV + MyV!o. 


The only unknown in this equation is v/y. Solving for v/y and substituting 
known values into the previous equation yields 


Equation: 
vi M1V1+M2Vg—M1vVNh 
my 
(0.350 kg) (2.00 m/s)+(0.500 kg)(—0.500 m/s) (0.350 kg)(—4.00 m/s) 


0.500 kg 0.500 kg 
==) Sh (Ady cy 


Solution for (b) 
The internal kinetic energy before the collision is 


Equation: 
KE = FMV; oe FM2V5 
= 41(0.350 kg)(2.00 m/s)* + +(0.500 kg)(—0.500 m/s)’ 
0.763 J. 
After the collision, the internal kinetic energy is 
Equation: 
KE = $myvl + +myv!5 
= 4(0.350 kg)(—4.00 m/s)? + (0.500 kg)(3.70 m/s)’ 
6.22 J. 
The change in internal kinetic energy is thus 
Equation: 
KE), —~ Kin — 6:22) — 0.763 J 
5.46 J. 
Discussion 


The final velocity of cart 2 is large and positive, meaning that it is moving 
to the right after the collision. The internal kinetic energy in this collision 
increases by 5.46 J. That energy was released by the spring. 


Section Summary 


e An inelastic collision is one in which the internal kinetic energy 
changes (it is not conserved). 

e A collision in which the objects stick together is sometimes called 
perfectly inelastic because it reduces internal kinetic energy more than 
does any other type of inelastic collision. 


e Sports science and technologies also use physics concepts such as 
momentum and rotational motion and vibrations. 


Conceptual Questions 


Exercise: 


Problem: 


What is an inelastic collision? What is a perfectly inelastic collision? 
Exercise: 


Problem: 


Mixed-pair ice skaters performing in a show are standing motionless at 
arms length just before starting a routine. They reach out, clasp hands, 
and pull themselves together by only using their arms. Assuming there 
is no friction between the blades of their skates and the ice, what is 
their velocity after their bodies meet? 


Exercise: 
Problem: 
A small pickup truck that has a camper shell slowly coasts toward a 
red light with negligible friction. Two dogs in the back of the truck are 
moving and making various inelastic collisions with each other and the 
walls. What is the effect of the dogs on the motion of the center of 


mass of the system (truck plus entire load)? What is their effect on the 
motion of the truck? 


Problems & Exercises 


Exercise: 


Problem: 


A 0.240-kg billiard ball that is moving at 3.00 m/s strikes the bumper 
of a pool table and bounces straight back at 2.40 m/s (80% of its 
original speed). The collision lasts 0.0150 s. (a) Calculate the average 
force exerted on the ball by the bumper. (b) How much kinetic energy 
in joules is lost during the collision? (c) What percent of the original 
energy is left? 


Solution: 
(a) 86.4 N perpendicularly away from the bumper 
(b) 0.389 J 


(c) 64.0% 
Exercise: 


Problem: 


During an ice show, a 60.0-kg skater leaps into the air and is caught by 
an initially stationary 75.0-kg skater. (a) What is their final velocity 
assuming negligible friction and that the 60.0-kg skater’s original 
horizontal velocity is 4.00 m/s? (b) How much kinetic energy is lost? 


Exercise: 


Problem: Professional Application 


Using mass and speed data from [link] and assuming that the football 
player catches the ball with his feet off the ground with both of them 
moving horizontally, calculate: (a) the final velocity if the ball and 
player are going in the same direction and (b) the loss of kinetic energy 
in this case. (c) Repeat parts (a) and (b) for the situation in which the 
ball and the player are going in opposite directions. Might the loss of 
kinetic energy be related to how much it hurts to catch the pass? 


Solution: 


(a) 8.06 m/s 
(b) -56.0 J 


(c)(i) 7.88 m/s; (ii) -223 J 
Exercise: 


Problem: 


A battleship that is 6.00 x 10’ kg and is originally at rest fires a 1100- 
kg artillery shell horizontally with a velocity of 575 m/s. (a) If the shell 
is fired straight aft (toward the rear of the ship), there will be 
negligible friction opposing the ship’s recoil. Calculate its recoil 
velocity. (b) Calculate the increase in internal kinetic energy (that is, 
for the ship and the shell). This energy is less than the energy released 
by the gun powder—significant heat transfer occurs. 


Exercise: 


Problem: Professional Application 


Two manned satellites approaching one another, at a relative speed of 
0.250 m/s, intending to dock. The first has a mass of 4.00 x 10° kg, 
and the second a mass of 7.50 x 10° kg. (a) Calculate the final 
velocity (after docking) by using the frame of reference in which the 
first satellite was originally at rest. (b) What is the loss of kinetic 
energy in this inelastic collision? (c) Repeat both parts by using the 
frame of reference in which the second satellite was originally at rest. 
Explain why the change in velocity is different in the two frames, 
whereas the change in kinetic energy is the same in both. 


Solution: 
(a) 0.163 m/s in the direction of motion of the more massive satellite 


(b) 81.6 J 


(c) 8.70 x 10~? m/s in the direction of motion of the less massive 
satellite, 81.5 J. Because there are no extemal forces, the velocity of 
the center of mass of the two-satellite system is unchanged by the 
collision. The two velocities calculated above are the velocity of the 
center of mass in each of the two different individual reference frames. 
The loss in KE is the same in both reference frames because the KE 
lost to internal forces (heat, friction, etc.) is the same regardless of the 
coordinate system chosen. 


Exercise: 


Problem: Professional Application 


A 30,000-kg freight car is coasting at 0.850 m/s with negligible 
friction under a hopper that dumps 110,000 kg of scrap metal into it. 
(a) What is the final velocity of the loaded freight car? (b) How much 
kinetic energy is lost? 


Exercise: 


Problem: Professional Application 


Space probes may be separated from their launchers by exploding 
bolts. (They bolt away from one another.) Suppose a 4800-kg satellite 
uses this method to separate from the 1500-kg remains of its launcher, 
and that 5000 J of kinetic energy is supplied to the two parts. What are 
their subsequent velocities using the frame of reference in which they 
were at rest before separation? 


Solution: 
0.704 m/s 


—2.25 m/s 


Exercise: 


Problem: 


A 0.0250-kg bullet is accelerated from rest to a speed of 550 m/s ina 
3.00-kg rifle. The pain of the rifle’s kick is much worse if you hold the 
gun loosely a few centimeters from your shoulder rather than holding 
it tightly against your shoulder. (a) Calculate the recoil velocity of the 
rifle if it is held loosely away from the shoulder. (b) How much kinetic 
energy does the rifle gain? (c) What is the recoil velocity if the rifle is 
held tightly against the shoulder, making the effective mass 28.0 kg? 
(d) How much kinetic energy is transferred to the rifle-shoulder 
combination? The pain is related to the amount of kinetic energy, 
which is significantly less in this latter situation. (e) Calculate the 
momentum of a 110-kg football player running at 8.00 m/s. Compare 
the player’s momentum with the momentum of a hard-thrown 0.410- 
kg football that has a speed of 25.0 m/s. Discuss its relationship to this 
problem. 


Solution: 

(a) 4.58 m/s away from the bullet 
(b) 31.5 J 

(c) -0.491 m/s 


(d) 3.38 J 


Exercise: 


Problem: Professional Application 


One of the waste products of a nuclear reactor is plutonium-239 
(Pu): This nucleus is radioactive and decays by splitting into a 
helium-4 nucleus and a uranium-235 nucleus (“He -- ml , the latter 
of which is also radioactive and will itself decay some time later. The 
energy emitted in the plutonium decay is 8.40 x 10°!° J and is 
entirely converted to kinetic energy of the helium and uranium nuclei. 


The mass of the helium nucleus is 6.68 x 1072? kg, while that of the 
uranium is 3.92 x 10° kg (note that the ratio of the masses is 4 to 
235). (a) Calculate the velocities of the two nuclei, assuming the 
plutonium nucleus is originally at rest. (b) How much kinetic energy 
does each nucleus carry away? Note that the data given here are 
accurate to three digits only. 


Exercise: 


Problem: Professional Application 


The Moon’s craters are remnants of meteorite collisions. Suppose a 
fairly large asteroid that has a mass of 5.00 x 10! kg (about a 
kilometer across) strikes the Moon at a speed of 15.0 km/s. (a) At what 
speed does the Moon recoil after the perfectly inelastic collision (the 
mass of the Moon is 7.36 x 107? kg) ? (b) How much kinetic energy 
is lost in the collision? Such an event may have been observed by 
medieval English monks who reported observing a red glow and 
subsequent haze about the Moon. (c) In October 2009, NASA crashed 
a rocket into the Moon, and analyzed the plume produced by the 
impact. (Significant amounts of water were detected.) Answer part (a) 
and (b) for this real-life experiment. The mass of the rocket was 2000 
kg and its speed upon impact was 9000 km/h. How does the plume 
produced alter these results? 


Solution: 
(a) 1.02 x 10° m/s 
(b) 5.63 x 107° J (almost all KE lost) 


(c) Recoil speed is 6.79 x 10-1" m/s, energy lost is 6.25 x 1024): 
The plume will not affect the momentum result because the plume is 
still part of the Moon system. The plume may affect the kinetic energy 
result because a significant part of the initial kinetic energy may be 
transferred to the kinetic energy of the plume particles. 


Exercise: 


Problem: Professional Application 


Two football players collide head-on in midair while trying to catch a 
thrown football. The first player is 95.0 kg and has an initial velocity 
of 6.00 m/s, while the second player is 115 kg and has an initial 
velocity of —3.50 m/s. What is their velocity just after impact if they 
cling together? 


Exercise: 
Problem: 
What is the speed of a garbage truck that is 1.20 x 104 kg and is 


initially moving at 25.0 m/s just after it hits and adheres to a trash can 
that is 80.0 kg and is initially at rest? 


Solution: 


24.8 m/s 
Exercise: 


Problem: 


During a circus act, an elderly performer thrills the crowd by catching 
a cannon ball shot at him. The cannon ball has a mass of 10.0 kg and 
the horizontal component of its velocity is 8.00 m/s when the 65.0-kg 
performer catches it. If the performer is on nearly frictionless roller 
skates, what is his recoil velocity? 


Exercise: 


Problem: 


(a) During an ice skating performance, an initially motionless 80.0-kg 
clown throws a fake barbell away. The clown’s ice skates allow her to 
recoil frictionlessly. If the clown recoils with a velocity of 0.500 m/s 
and the barbell is thrown with a velocity of 10.0 m/s, what is the mass 
of the barbell? (b) How much kinetic energy is gained by this 
maneuver? (c) Where does the kinetic energy come from? 


Solution: 
(a) 4.00 kg 
(b) 210 J 


(c) The clown does work to throw the barbell, so the kinetic energy 
comes from the muscles of the clown. The muscles convert the 
chemical potential energy of ATP into kinetic energy. 


Glossary 


inelastic collision 
a collision in which internal kinetic energy is not conserved 


perfectly inelastic collision 
a collision in which the colliding objects stick together 


Introduction to Rocket Propulsion 


e State Newton’s third law of motion. 

e Explain the principle involved in propulsion of rockets and jet engines. 

e Derive an expression for the acceleration of the rocket and discuss the 
factors that affect the acceleration. 

e Describe the function of a space shuttle. 


Rockets range in size from fireworks so small that ordinary people use them 
to immense Saturn Vs that once propelled massive payloads toward the 
Moon. The propulsion of all rockets, jet engines, deflating balloons, and 
even squids and octopuses is explained by the same physical principle— 
Newton’s third law of motion. Matter is forcefully ejected from a system, 
producing an equal and opposite reaction on what remains. Another 
common example is the recoil of a gun. The gun exerts a force on a bullet to 
accelerate it and consequently experiences an equal and opposite force, 
causing the gun’s recoil or kick. 


Note: 

Making Connections: Take-Home Experiment—Propulsion of a Balloon 
Hold a balloon and fill it with air. Then, let the balloon go. In which 
direction does the air come out of the balloon and in which direction does 
the balloon get propelled? If you fill the balloon with water and then let the 
balloon go, does the balloon’s direction change? Explain your answer. 


[link] shows a rocket accelerating straight up. In part (a), the rocket has a 
mass m and a velocity v relative to Earth, and hence a momentum mv. In 
part (b), a time At has elapsed in which the rocket has ejected a mass Am 
of hot gas at a velocity v, relative to the rocket. The remainder of the mass 
(m — Am) now has a greater velocity (v + Av). The momentum of the 
entire system (rocket plus expelled gas) has actually decreased because the 
force of gravity has acted for a time At, producing a negative impulse 

Ap = —mgAt. (Remember that impulse is the net external force on a 
system multiplied by the time it acts, and it equals the change in momentum 


of the system.) So, the center of mass of the system is in free fall but, by 
rapidly expelling mass, part of the system can accelerate upward. It is a 
commonly held misconception that the rocket exhaust pushes on the 
ground. If we consider thrust; that is, the force exerted on the rocket by the 
exhaust gases, then a rocket’s thrust is greater in outer space than in the 
atmosphere or on the launch pad. In fact, gases are easier to expel into a 
vacuum. 


By calculating the change in momentum for the entire system over At, and 
equating this change to the impulse, the following expression can be shown 
to be a good approximation for the acceleration of the rocket. 

Equation: 


“The rocket” is that part of the system remaining after the gas is ejected, 
and g is the acceleration due to gravity. 


Note: 

Acceleration of a Rocket 
Acceleration of a rocket is 
Equation: 


ve Am 
a= — —— — 
ay ke 


where a is the acceleration of the rocket, ve is the exhaust velocity, m is 
the mass of the rocket, Am is the mass of the ejected gas, and At is the 
time in which the gas is ejected. 
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(a) (b) 


(a) This rocket has a mass m 
and an upward velocity v. The 
net external force on the system 
is —mg, if air resistance is 
neglected. (b) A time At later 
the system has two main parts, 
the ejected gas and the 
remainder of the rocket. The 
reaction force on the rocket is 
what overcomes the 
gravitational force and 
accelerates it upward. 


A rocket’s acceleration depends on three major factors, consistent with the 
equation for acceleration of a rocket . First, the greater the exhaust velocity 
of the gases relative to the rocket, ve, the greater the acceleration is. The 


practical limit for ve is about 2.5 x 10° m/s for conventional (non-nuclear) 
hot-gas propulsion systems. The second factor is the rate at which mass is 
ejected from the rocket. This is the factor Am/At in the equation. The 
quantity (Am/At)ve, with units of newtons, is called "thrust.” The faster 
the rocket burns its fuel, the greater its thrust, and the greater its 
acceleration. The third factor is the mass m of the rocket. The smaller the 
mass is (all other factors being the same), the greater the acceleration. The 
rocket mass ™m decreases dramatically during flight because most of the 
rocket is fuel to begin with, so that acceleration increases continuously, 
reaching a maximum just before the fuel is exhausted. 


Note: 
Factors Affecting a Rocket’s Acceleration 


e The greater the exhaust velocity v, of the gases relative to the rocket, 
the greater the acceleration. 

e The faster the rocket burns its fuel, the greater its acceleration. 

e The smaller the rocket’s mass (all other factors being the same), the 
greater the acceleration. 


Example: 

Calculating Acceleration: Initial Acceleration of a Moon Launch 

A Saturn V’s mass at liftoff was 2.80 x 10° kg, its fuel-burn rate was 

1.40 x 104 kg/s, and the exhaust velocity was 2.40 x 10° m/s. Calculate 
its initial acceleration. 

Strategy 

This problem is a straightforward application of the expression for 
acceleration because a is the unknown and all of the terms on the right side 
of the equation are given. 

Solution 

Substituting the given values into the equation for acceleration yields 
Equation: 


Ve Am 


eee yaa Tes 
__ 2.40x10? m/s 4 2 
= 2.20 m/s’. 
Discussion 


This value is fairly small, even for an initial acceleration. The acceleration 
does increase steadily as the rocket burns fuel, because m decreases while 
Ve and am remain constant. Knowing this acceleration and the mass of the 


rocket, you can show that the thrust of the engines was 3.36 x 10’ N. 


To achieve the high speeds needed to hop continents, obtain orbit, or escape 
Earth’s gravity altogether, the mass of the rocket other than fuel must be as 
small as possible. It can be shown that, in the absence of air resistance and 
neglecting gravity, the final velocity of a one-stage rocket initially at rest is 
Equation: 


™o 
v=veln ; 
My 


where In(mo/m,) is the natural logarithm of the ratio of the initial mass of 
the rocket (7m) to what is left (m,) after all of the fuel is exhausted. (Note 
that v is actually the change in velocity, so the equation can be used for any 
segment of the flight. If we start from rest, the change in velocity equals the 
final velocity.) For example, let us calculate the mass ratio needed to escape 
Earth’s gravity starting from rest, given that the escape velocity from Earth 
is about 11.2 x 10° m /s, and assuming an exhaust velocity 
Ve = 2.5 x 10° m/s. 
Equation: 

mo v 11.2 x 10° m/s 


In — = — = —— = 4.48 
Mr Ue 2.5 x 10° m/s 


Solving for mo/m;, gives 
Equation: 


Thus, the mass of the rocket is 
Equation: 


mo 
mM, = —. 
88 
This result means that only 1/88 of the mass is left when the fuel is burnt, 
and 87/88 of the initial mass was fuel. Expressed as percentages, 98.9% of 
the rocket is fuel, while payload, engines, fuel tanks, and other components 
make up only 1.10%. Taking air resistance and gravitational force into 
account, the mass m, remaining can only be about mo/180. It is difficult to 
build a rocket in which the fuel has a mass 180 times everything else. The 
solution is multistage rockets. Each stage only needs to achieve part of the 
final velocity and is discarded after it burns its fuel. The result is that each 
successive stage can have smaller engines and more payload relative to its 
fuel. Once out of the atmosphere, the ratio of payload to fuel becomes more 
favorable, too. 


The space shuttle was an attempt at an economical vehicle with some 
reusable parts, such as the solid fuel boosters and the craft itself. (See 
[link]) The shuttle’s need to be operated by humans, however, made it at 
least as costly for launching satellites as expendable, unmanned rockets. 
Ideally, the shuttle would only have been used when human activities were 
required for the success of a mission, such as the repair of the Hubble space 
telescope. Rockets with satellites can also be launched from airplanes. 
Using airplanes has the double advantage that the initial velocity is 
significantly above zero and a rocket can avoid most of the atmosphere’s 
resistance. 


The space shuttle had a 
number of reusable parts. 
Solid fuel boosters on 
either side were 
recovered and refueled 
after each flight, and the 
entire orbiter returned to 
Earth for use in 
subsequent flights. The 
large liquid fuel tank was 
expended. The space 
shuttle was a complex 
assemblage of 
technologies, employing 
both solid and liquid fuel 
and pioneering ceramic 
tiles as reentry heat 
shields. As a result, it 
permitted multiple 
launches as opposed to 


single-use rockets. 
(credit: NASA) 


Note: 

PhET Explorations: Lunar Lander 

Can you avoid the boulder field and land safely, just before your fuel runs 
out, as Neil Armstrong did in 1969? Our version of this classic video game 
accurately simulates the real motion of the lunar lander with the correct 
mass, thrust, fuel consumption rate, and lunar gravity. The real lunar lander 
is very hard to control. 
https://phet.colorado.edu/sims/lunar-lander/lunar-lander_en.html 


Section Summary 


e Newton’s third law of motion states that to every action, there is an 
equal and opposite reaction. 
: : _ Ve Am 
e Acceleration of a rocket isa = — >, —g. 


e A rocket’s acceleration depends on three main factors. They are 


1. The greater the exhaust velocity of the gases, the greater the 
acceleration. 

2. The faster the rocket burns its fuel, the greater its acceleration. 

3. The smaller the rocket's mass, the greater the acceleration. 


Conceptual Questions 


Exercise: 


Problem: Professional Application 


Suppose a fireworks shell explodes, breaking into three large pieces 
for which air resistance is negligible. How is the motion of the center 


of mass affected by the explosion? How would it be affected if the 
pieces experienced significantly more air resistance than the intact 
shell? 


Exercise: 


Problem: Professional Application 


During a visit to the International Space Station, an astronaut was 
positioned motionless in the center of the station, out of reach of any 
solid object on which he could exert a force. Suggest a method by 
which he could move himself away from this position, and explain the 
physics involved. 


Exercise: 
Problem: Professional Application 
It is possible for the velocity of a rocket to be greater than the exhaust 
velocity of the gases it ejects. When that is the case, the gas velocity 
and gas momentum are in the same direction as that of the rocket. How 
is the rocket still able to obtain thrust by ejecting the gases? 


Problems & Exercises 


Exercise: 


Problem: Professional Application 


Antiballistic missiles (ABMs) are designed to have very large 
accelerations so that they may intercept fast-moving incoming missiles 
in the short time available. What is the takeoff acceleration of a 
10,000-kg ABM that expels 196 kg of gas per second at an exhaust 
velocity of 2.50 x 10° m/s? 


Solution: 


39.2 m/s” 


Exercise: 


Problem: Professional Application 


What is the acceleration of a 5000-kg rocket taking off from the Moon, 
where the acceleration due to gravity is only 1.6 m/ s, if the rocket 


expels 8.00 kg of gas per second at an exhaust velocity of 
2.20 x 10° m/s? 


Exercise: 


Problem: Professional Application 


Calculate the increase in velocity of a 4000-kg space probe that expels 
3500 kg of its mass at an exhaust velocity of 2.00 x 10° m/s. You 
may assume the gravitational force is negligible at the probe’s location. 


Solution: 


4.16 x 10° m/s 


Exercise: 


Problem: Professional Application 


Ion-propulsion rockets have been proposed for use in space. They 
employ atomic ionization techniques and nuclear energy sources to 
produce extremely high exhaust velocities, perhaps as great as 

8.00 x 10° m/s. These techniques allow a much more favorable 
payload-to-fuel ratio. To illustrate this fact: (a) Calculate the increase 
in velocity of a 20,000-kg space probe that expels only 40.0-kg of its 
mass at the given exhaust velocity. (b) These engines are usually 
designed to produce a very small thrust for a very long time—the type 
of engine that might be useful on a trip to the outer planets, for 
example. Calculate the acceleration of such an engine if it expels 


4.50 x 10 ° kg /s at the given velocity, assuming the acceleration due 
to gravity is negligible. 


Exercise: 


Problem: Derive the equation for the vertical acceleration of a rocket. 
Solution: 


The force needed to give a small mass Am an acceleration aam is 
F = Amaa,,. To accelerate this mass in the small time interval Az at 
a speed ve requires ve = AAmAt, so F = ve — By Newton’s third 
law, this force is equal in magnitude to the thrust force acting on the 
rocket, so Fiprast = veo, where all quantities are positive. Applying 
Newton’s second law to the root gives 

Ve m 


Finrust — Ng = ma => a = — =? — g, where m is the mass of the 
rocket and unburnt fuel. 


Exercise: 


Problem: Professional Application 


(a) Calculate the maximum rate at which a rocket can expel gases if its 
acceleration cannot exceed seven times that of gravity. The mass of the 
rocket just as it runs out of fuel is 75,000-kg, and its exhaust velocity 

is: 2.40.10" mm /s. Assume that the acceleration of gravity is the same 


as on Earth’s surface (9.80 m/ s’) . (b) Why might it be necessary to 
limit the acceleration of a rocket? 


Exercise: 


Problem: 


Given the following data for a fire extinguisher-toy wagon rocket 
experiment, calculate the average exhaust velocity of the gases 
expelled from the extinguisher. Starting from rest, the final velocity is 
10.0 m/s. The total mass is initially 75.0 kg and is 70.0 kg after the 
extinguisher is fired. 


Exercise: 


Problem: 


How much of a single-stage rocket that is 100,000 kg can be anything 
but fuel if the rocket is to have a final speed of 8.00 km/s, given that 
it expels gases at an exhaust velocity of 2.20 x 10° m /s? 


Solution: 


2.63 x 10° kg 


Exercise: 


Problem: Professional Application 


(a) A 5.00-kg squid initially at rest ejects 0.250-kg of fluid with a 
velocity of 10.0 m/s. What is the recoil velocity of the squid if the 
ejection is done in 0.100 s and there is a 5.00-N frictional force 
opposing the squid’s movement. (b) How much energy is lost to work 
done against friction? 


Solution: 


(a) 0.421 m/s away from the ejected fluid. 
(b) 0.237 J. 


Exercise: 


Problem: Unreasonable Results 


Squids have been reported to jump from the ocean and travel 30.0 m 
(measured horizontally) before re-entering the water. (a) Calculate the 
initial speed of the squid if it leaves the water at an angle of 20.0°, 
assuming negligible lift from the air and negligible air resistance. (b) 
The squid propels itself by squirting water. What fraction of its mass 
would it have to eject in order to achieve the speed found in the 
previous part? The water is ejected at 12.0 m/s; gravitational force 
and friction are neglected. (c) What is unreasonable about the results? 
(d) Which premise is unreasonable, or which premises are 
inconsistent? 


Exercise: 


Problem: Construct Your Own Problem 


Consider an astronaut in deep space cut free from her space ship and 
needing to get back to it. The astronaut has a few packages that she can 
throw away to move herself toward the ship. Construct a problem in 
which you calculate the time it takes her to get back by throwing all 
the packages at one time compared to throwing them one at a time. 
Among the things to be considered are the masses involved, the force 
she can exert on the packages through some distance, and the distance 
to the ship. 


Exercise: 


Problem: Construct Your Own Problem 


Consider an artillery projectile striking armor plating. Construct a 
problem in which you find the force exerted by the projectile on the 
plate. Among the things to be considered are the mass and speed of the 
projectile and the distance over which its speed is reduced. Your 
instructor may also wish for you to consider the relative merits of 
depleted uranium versus lead projectiles based on the greater density 
of uranium. 


Introduction to Temperature, Kinetic Theory, and the Gas Laws 
class="introduction" 


The welder’s 
gloves and 
helmet 
protect him 
from the 
electric arc 
that transfers 
enough 
thermal 
energy to 
melt the rod, 
spray sparks, 
and burn the 
retina of an 
unprotected 
eye. The 
thermal 
energy can 
be felt on 
exposed skin 
a few meters 
away, and its 
light can be 
seen for 
kilometers. 
(credit: 
Kevin S. 
O’Brien/U.S 
. Navy) 


Heat is something familiar to each of us. We feel the warmth of the summer 
Sun, the chill of a clear summer night, the heat of coffee after a winter 
stroll, and the cooling effect of our sweat. Heat transfer is maintained by 
temperature differences. Manifestations of heat transfer—the movement of 
heat energy from one place or material to another—are apparent throughout 
the universe. Heat from beneath Earth’s surface is brought to the surface in 
flows of incandescent lava. The Sun warms Earth’s surface and is the 
source of much of the energy we find on it. Rising levels of atmospheric 
carbon dioxide threaten to trap more of the Sun’s energy, perhaps 
fundamentally altering the ecosphere. In space, supernovas explode, briefly 
radiating more heat than an entire galaxy does. 


What is heat? How do we define it? How is it related to temperature? What 
are heat’s effects? How is it related to other forms of energy and to work? 
We will find that, in spite of the richness of the phenomena, there is a small 
set of underlying physical principles that unite the subjects and tie them to 
other fields. 


In a typical thermometer 
like this one, the alcohol, 
with a red dye, expands 


more rapidly than the 
glass containing it. When 
the thermometer’s 
temperature increases, the 
liquid from the bulb is 
forced into the narrow 
tube, producing a large 
change in the length of 
the column for a small 
change in temperature. 
(credit: Chemical 
Engineer, Wikimedia 
Commons) 


Temperature 


e Define temperature. 

¢ Convert temperatures between the Celsius, Fahrenheit, and Kelvin scales. 
¢ Define thermal equilibrium. 

e State the zeroth law of thermodynamics. 


The concept of temperature has evolved from the common concepts of hot and cold. Human 
perception of what feels hot or cold is a relative one. For example, if you place one hand in 
hot water and the other in cold water, and then place both hands in tepid water, the tepid 
water will feel cool to the hand that was in hot water, and warm to the one that was in cold 
water. The scientific definition of temperature is less ambiguous than your senses of hot and 
cold. Temperature is operationally defined to be what we measure with a thermometer. 
(Many physical quantities are defined solely in terms of how they are measured. We shall see 
later how temperature is related to the kinetic energies of atoms and molecules, a more 
physical explanation.) Two accurate thermometers, one placed in hot water and the other in 
cold water, will show the hot water to have a higher temperature. If they are then placed in 
the tepid water, both will give identical readings (within measurement uncertainties). In this 
section, we discuss temperature, its measurement by thermometers, and its relationship to 
thermal equilibrium. Again, temperature is the quantity measured by a thermometer. 


Note: 

Misconception Alert: Human Perception vs. Reality 

On a cold winter morning, the wood on a porch feels warmer than the metal of your bike. 
The wood and bicycle are in thermal equilibrium with the outside air, and are thus the same 
temperature. They feel different because of the difference in the way that they conduct heat 
away from your skin. The metal conducts heat away from your body faster than the wood 
does (see more about conductivity in Conduction). This is just one example demonstrating 
that the human sense of hot and cold is not determined by temperature alone. 

Another factor that affects our perception of temperature is humidity. Most people feel much 
hotter on hot, humid days than on hot, dry days. This is because on humid days, sweat does 
not evaporate from the skin as efficiently as it does on dry days. It is the evaporation of 
sweat (or water from a sprinkler or pool) that cools us off. 


Any physical property that depends on temperature, and whose response to temperature is 
reproducible, can be used as the basis of a thermometer. Because many physical properties 
depend on temperature, the variety of thermometers is remarkable. For example, volume 
increases with temperature for most substances. This property is the basis for the common 
alcohol thermometer, the old mercury thermometer, and the bimetallic strip ({link]). Other 
properties used to measure temperature include electrical resistance and color, as shown in 
[link], and the emission of infrared radiation, as shown in [link]. 


To TS Ty 


(a) (b) 


The 
curvature of 
a bimetallic 

strip depends 
on 
temperature. 
(a) The strip 
is straight at 
the starting 
temperature, 
where its two 
components 
have the 
same length. 
(b) Ata 
higher 
temperature, 
this strip 
bends to the 
right, 
because the 
metal on the 
left has 
expanded 
more than the 
metal on the 
right. 
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Each of the six squares on 
this plastic (liquid crystal) 


thermometer contains a 
film of a different heat- 
sensitive liquid crystal 
material. Below 95°F, all 
six squares are black. 
When the plastic 
thermometer is exposed 
to temperature that 
increases to 95°F, the 
first liquid crystal square 
changes color. When the 
temperature increases 
above 96.8°F the second 
liquid crystal square also 
changes color, and so 
forth. (credit: Arkrishna, 
Wikimedia Commons) 


Fireman Jason 
Ormand uses a 


pyrometer to 
check the 
temperature of 
an aircraft 
carrier’s 
ventilation 
system. Infrared 
radiation (whose 
emission varies 
with 
temperature) 


from the vent is 
measured and a 
temperature 
readout is 
quickly 
produced. 
Infrared 
measurements 
are also 
frequently used 
as a Measure of 
body 
temperature. 
These modern 
thermometers, 
placed in the ear 
canal, are more 
accurate than 
alcohol 
thermometers 
placed under the 
tongue or in the 
armpit. (credit: 
Lamel J. 
Hinton/U.S. 
Navy) 


Temperature Scales 


Thermometers are used to measure temperature according to well-defined scales of 
measurement, which use pre-defined reference points to help compare quantities. The three 
most common temperature scales are the Fahrenheit, Celsius, and Kelvin scales. A 
temperature scale can be created by identifying two easily reproducible temperatures. The 
freezing and boiling temperatures of water at standard atmospheric pressure are commonly 
used. 


The Celsius scale (which replaced the slightly different centigrade scale) has the freezing 
point of water at 0°C and the boiling point at 100°C. Its unit is the degree Celsius(°C). On 
the Fahrenheit scale (still the most frequently used in the United States), the freezing point 
of water is at 32°F and the boiling point is at 212°F. The unit of temperature on this scale is 
the degree Fahrenheit(°F'). Note that a temperature difference of one degree Celsius is 
greater than a temperature difference of one degree Fahrenheit. Only 100 Celsius degrees 


span the same range as 180 Fahrenheit degrees, thus one degree on the Celsius scale is 1.8 
times larger than one degree on the Fahrenheit scale 180/100 = 9/5. 


The Kelvin scale is the temperature scale that is commonly used in science. It is an absolute 
temperature scale defined to have 0 K at the lowest possible temperature, called absolute 
zero. The official temperature unit on this scale is the kelvin, which is abbreviated K, and is 
not accompanied by a degree sign. The freezing and boiling points of water are 273.15 K and 
373.15 K, respectively. Thus, the magnitude of temperature differences is the same in units 
of kelvins and degrees Celsius. Unlike other temperature scales, the Kelvin scale is an 
absolute scale. It is used extensively in scientific work because a number of physical 
quantities, such as the volume of an ideal gas, are directly related to absolute temperature. 
The kelvin is the SI unit used in scientific work. 
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Relationships between the Fahrenheit, Celsius, 

and Kelvin temperature scales, rounded to the 

nearest degree. The relative sizes of the scales 
are also shown. 


The relationships between the three common temperature scales is shown in [link]. 
Temperatures on these scales can be converted using the equations in [link]. 


To 
convert 
from... Use this equation ... Also written as... 


To 


convert 

from... Use this equation ... Also written as... 
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Temperature Conversions 


Notice that the conversions between Fahrenheit and Kelvin look quite complicated. In fact, 
they are simple combinations of the conversions between Fahrenheit and Celsius, and the 
conversions between Celsius and Kelvin. 


Example: 

Converting between Temperature Scales: Room Temperature 

“Room temperature” is generally defined to be 25°C. (a) What is room temperature in °F? 
(b) What is it in K? 

Strategy 

To answer these questions, all we need to do is choose the correct conversion equations and 
plug in the known values. 


Solution for (a) 
1. Choose the right equation. To convert from °C to °F, use the equation 
Equation: 


9 


2. Plug the known value into the equation and solve: 
Equation: 


) 
Top = Bene 32 — ik, 


Solution for (b) 
1. Choose the right equation. To convert from °C to K, use the equation 
Equation: 


T= Tg 273, 15. 


2. Plug the known value into the equation and solve: 
Equation: 


Tk = 25°C + 273.15 = 298K. 


Example: 

Converting between Temperature Scales: the Reaumur Scale 

The Reaumur scale is a temperature scale that was used widely in Europe in the 18th and 
19th centuries. On the Reaumur temperature scale, the freezing point of water is 0°R and the 
boiling temperature is 80°R. If “room temperature” is 25°C on the Celsius scale, what is it 
on the Reaumur scale? 

Strategy 

To answer this question, we must compare the Reaumur scale to the Celsius scale. The 
difference between the freezing point and boiling point of water on the Reaumur scale is 
80°R. On the Celsius scale it is 100°C. Therefore 100°C = 80°R. Both scales start at 0° for 
freezing, so we can derive a simple formula to convert between temperatures on the two 
scales. 


Solution 
1. Derive a formula to convert from one scale to the other: 
Equation: 
0.8°R 
Tor — x Too. 
2 e: 


2. Plug the known value into the equation and solve: 


Equation: 
_ 0.8°R 
= a 


ies x 25°C = 20°R. 


Temperature Ranges in the Universe 


[link] shows the wide range of temperatures found in the universe. Human beings have been 
known to survive with body temperatures within a small range, from 24°C to 44°C (75°F to 
111°F). The average normal body temperature is usually given as 37.0°C (98.6°F), and 

variations in this temperature can indicate a medical condition: a fever, an infection, a tumor, 


or circulatory problems (see [link]). 


: . 
(ERIE vrecmograms @ standard 8° C color range 


This image of radiation 
from a person’s body (an 
infrared thermograph) 
shows the location of 
temperature abnormalities 
in the upper body. Dark 
blue corresponds to cold 
areas and red to white 
corresponds to hot areas. 
An elevated temperature 
might be an indication of 
malignant tissue (a 
cancerous tumor in the 
breast, for example), while 
a depressed temperature 


might be due to a decline in 
blood flow from a clot. In 
this case, the abnormalities 
are caused by a condition 
called hyperhidrosis. 
(credit: Porcelina81, 
Wikimedia Commons) 


The lowest temperatures ever recorded have been measured during laboratory experiments: 
4.5 x 10°!° K at the Massachusetts Institute of Technology (USA), and 1.0 x 10°!° K at 
Helsinki University of Technology (Finland). In comparison, the coldest recorded place on 
Earth’s surface is Vostok, Antarctica at 183 K (—89°C), and the coldest place (outside the 
lab) known in the universe is the Boomerang Nebula, with a temperature of 1 K. 


1012 experiments at the Relativistic 
Heavy Ion Collider (RHIC) 


10° Interior neutron star 


108 Rapid hydrogen fusion 


107 Solar interior 

106 Solar corona 
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Center of Earth 

104 
Solar surface 

103 Fireplace fire 
Water boils 

5 Water freezes 
10 Vostok, Antarctica 


Liquid nitrogen 


Liquid helium 
10° Boomerang Nebula 


temperarure, T (K) 
= 


10-10 Lowest temperature 
achieved 


Each increment on 
this logarithmic 
scale indicates an 
increase by a factor 
of ten, and thus 
illustrates the 
tremendous range 
of temperatures in 
nature. Note that 
zero ona 
logarithmic scale 
would occur off the 
bottom of the page 

at infinity. 


Note: 
Making Connections: Absolute Zero 


What is absolute zero? Absolute zero is the temperature at which all molecular motion has 
ceased. The concept of absolute zero arises from the behavior of gases. [link] shows how the 
pressure of gases at a constant volume decreases as temperature decreases. Various scientists 
have noted that the pressures of gases extrapolate to zero at the same temperature, 
—273.15°C. This extrapolation implies that there is a lowest temperature. This temperature 
is called absolute zero. Today we know that most gases first liquefy and then freeze, and it is 


not actually possible to reach absolute zero. The numerical value of absolute zero 
temperature is —-273.15°C or 0 K. 


Pressure, P 


—200—-100 0 100 
Ff ENS) 


Temperature, 7(°C) 


Graph of pressure versus 
temperature for various 


gases kept at a constant 
volume. Note that all of 
the graphs extrapolate to 
zero pressure at the same 
temperature. 


Thermal Equilibrium and the Zeroth Law of Thermodynamics 


Thermometers actually take their own temperature, not the temperature of the object they are 
measuring. This raises the question of how we can be certain that a thermometer measures 
the temperature of the object with which it is in contact. It is based on the fact that any two 
systems placed in thermal contact (meaning heat transfer can occur between them) will reach 
the same temperature. That is, heat will flow from the hotter object to the cooler one until 
they have exactly the same temperature. The objects are then in thermal equilibrium, and 
no further changes will occur. The systems interact and change because their temperatures 
differ, and the changes stop once their temperatures are the same. Thus, if enough time is 
allowed for this transfer of heat to run its course, the temperature a thermometer registers 
does represent the system with which it is in thermal equilibrium. Thermal equilibrium is 
established when two bodies are in contact with each other and can freely exchange energy. 


Furthermore, experimentation has shown that if two systems, A and B, are in thermal 
equilibrium with each another, and B is in thermal equilibrium with a third system C, then A 
is also in thermal equilibrium with C. This conclusion may seem obvious, because all three 
have the same temperature, but it is basic to thermodynamics. It is called the zeroth law of 
thermodynamics. 


Note: 

The Zeroth Law of Thermodynamics 

If two systems, A and B, are in thermal equilibrium with each other, and B is in thermal 
equilibrium with a third system, C, then A is also in thermal equilibrium with C. 


This law was postulated in the 1930s, after the first and second laws of thermodynamics had 
been developed and named. It is called the zeroth law because it comes logically before the 
first and second laws (discussed in Thermodynamics). An example of this law in action is 
seen in babies in incubators: babies in incubators normally have very few clothes on, so to an 
observer they look as if they may not be warm enough. However, the temperature of the air, 
the cot, and the baby is the same, because they are in thermal equilibrium, which is 
accomplished by maintaining air temperature to keep the baby comfortable. 


Exercise: 
Check Your Understanding 


Problem: Does the temperature of a body depend on its size? 


Solution: 


No, the system can be divided into smaller parts each of which is at the same 
temperature. We say that the temperature is an intensive quantity. Intensive quantities 


are independent of size. 


Section Summary 


¢ Temperature is the quantity measured by a thermometer. 

e Temperature is related to the average kinetic energy of atoms and molecules in a system. 
e Absolute zero is the temperature at which there is no molecular motion. 

e There are three main temperature scales: Celsius, Fahrenheit, and Kelvin. 

¢ Temperatures on one scale can be converted to temperatures on another scale using the 


following equations: 
Equation: 


Equation: 


Equation: 


Equation: 


Tk = Tes OTS 15 


[ee we ees 


e Systems are in thermal equilibrium when they have the same temperature. 
e Thermal equilibrium occurs when two bodies are in contact with each other and can 


freely exchange energy. 


The zeroth law of thermodynamics states that when two systems, A and B, are in 


thermal equilibrium with each other, and B is in thermal equilibrium with a third 
system, C, then A is also in thermal equilibrium with C. 


Conceptual Questions 


Exercise: 


Problem: What does it mean to say that two systems are in thermal equilibrium? 
Exercise: 
Problem: 
Give an example of a physical property that varies with temperature and describe how it 
is used to measure temperature. 
Exercise: 
Problem: 
When a cold alcohol thermometer is placed in a hot liquid, the column of alcohol goes 
down slightly before going up. Explain why. 
Exercise: 
Problem: 
If you add boiling water to a cup at room temperature, what would you expect the final 


equilibrium temperature of the unit to be? You will need to include the surroundings as 
part of the system. Consider the zeroth law of thermodynamics. 


Problems & Exercises 


Exercise: 


Problem: What is the Fahrenheit temperature of a person with a 39.0°C fever? 


Solution: 
102°F 
Exercise: 


Problem: 


Frost damage to most plants occurs at temperatures of 28.0°F or lower. What is this 
temperature on the Kelvin scale? 


Exercise: 


Problem: 


To conserve energy, room temperatures are kept at 68.0°F in the winter and 78.0°F in 
the summer. What are these temperatures on the Celsius scale? 


Solution: 


20.0°C and 25.6°C 
Exercise: 
Problem: 
A tungsten light bulb filament may operate at 2900 K. What is its Fahrenheit 
temperature? What is this on the Celsius scale? 
Exercise: 
Problem: 


The surface temperature of the Sun is about 5750 K. What is this temperature on the 
Fahrenheit scale? 


Solution: 


9890°F 
Exercise: 
Problem: 
One of the hottest temperatures ever recorded on the surface of Earth was 134°F in 


Death Valley, CA. What is this temperature in Celsius degrees? What is this temperature 
in Kelvin? 


Exercise: 
Problem: 
(a) Suppose a cold front blows into your locale and drops the temperature by 40.0 
Fahrenheit degrees. How many degrees Celsius does the temperature decrease when 


there is a 40.0°F decrease in temperature? (b) Show that any change in temperature in 
Fahrenheit degrees is nine-fifths the change in Celsius degrees. 


Solution: 


(a) 22.2°C 


AT(*F) = T2(°F) — TiCF) 
(b) = 272(°C) + 32.0°— (27,(°C) + 32.0°) 
= #(%(C) —T,(C)) = ATCC) 
Exercise: 


Problem: 


(a) At what temperature do the Fahrenheit and Celsius scales have the same numerical 
value? (b) At what temperature do the Fahrenheit and Kelvin scales have the same 
numerical value? 


Glossary 


temperature 
the quantity measured by a thermometer 


Celsius scale 
temperature scale in which the freezing point of water is 0°C and the boiling point of 
water is 100°C 


degree Celsius 
unit on the Celsius temperature scale 


Fahrenheit scale 
temperature scale in which the freezing point of water is 32°F and the boiling point of 
water is 212°F 


degree Fahrenheit 
unit on the Fahrenheit temperature scale 


Kelvin scale 
temperature scale in which 0 K is the lowest possible temperature, representing absolute 
Zero 


absolute zero 
the lowest possible temperature; the temperature at which all molecular motion ceases 


thermal equilibrium 
the condition in which heat no longer flows between two objects that are in contact; the 
two objects have the same temperature 


zeroth law of thermodynamics 
law that states that if two objects are in thermal equilibrium, and a third object is in 
thermal equilibrium with one of those objects, it is also in thermal equilibrium with the 
other object 


Introduction to Heat and Heat Transfer Methods 
class="introduction" 


(a) The 
chilling effect 
of a clear 
breezy night 
is produced 
by the wind 
and by 
radiative heat 
transfer to 
cold outer 
space. (b) 
There was 
once great 
controversy 
about the 
Earth’s age, 
but it is now 
generally 
accepted to 
be about 4.5 
billion years 
old. Much of 
the debate is 
centered on 
the Earth’s 
molten 
interior. 
According to 
our 
understandin 
g of heat 
transfer, if the 
Earth is really 
that old, its 


center should 
have cooled 
off long ago. 
The 
discovery of 
radioactivity 
in rocks 
revealed the 
source of 
energy that 
keeps the 
Earth’s 
interior 
molten, 
despite heat 
transfer to the 
surface, and 
from there to 
cold outer 
space. 


(b) 


Energy can exist in many forms and heat is one of the most intriguing. Heat 
is often hidden, as it only exists when in transit, and is transferred by a 
number of distinctly different methods. Heat transfer touches every aspect 
of our lives and helps us understand how the universe functions. It explains 
the chill we feel on a clear breezy night, or why Earth’s core has yet to cool. 
This chapter defines and explores heat transfer, its effects, and the methods 
by which heat is transferred. These topics are fundamental, as well as 
practical, and will often be referred to in the chapters ahead. 


Heat 
¢ Define heat as transfer of energy. 


In Work, Energy, and Energy Resources, we defined work as force times 
distance and learned that work done on an object changes its kinetic energy. 
We also saw in Temperature, Kinetic Theory, and the Gas Laws that 
temperature is proportional to the (average) kinetic energy of atoms and 
molecules. We say that a thermal system has a certain internal energy: its 
internal energy is higher if the temperature is higher. If two objects at 
different temperatures are brought in contact with each other, energy is 
transferred from the hotter to the colder object until equilibrium is reached 
and the bodies reach thermal equilibrium (i.e., they are at the same 
temperature). No work is done by either object, because no force acts 
through a distance. The transfer of energy is caused by the temperature 
difference, and ceases once the temperatures are equal. These observations 
lead to the following definition of heat: Heat is the spontaneous transfer of 
energy due to a temperature difference. 


As noted in Temperature, Kinetic Theory, and the Gas Laws, heat is often 
confused with temperature. For example, we may say the heat was 
unbearable, when we actually mean that the temperature was high. Heat is a 
form of energy, whereas temperature is not. The misconception arises 
because we are sensitive to the flow of heat, rather than the temperature. 


Owing to the fact that heat is a form of energy, it has the SI unit of joule (J). 
The calorie (cal) is a common unit of energy, defined as the energy needed 
to change the temperature of 1.00 g of water by 1.00°C —specifically, 
between 14.5°C and 15.5°C, since there is a slight temperature dependence. 
Perhaps the most common unit of heat is the kilocalorie (kcal), which is the 
energy needed to change the temperature of 1.00 kg of water by 1.00°C. 
Since mass is most often specified in kilograms, kilocalorie is commonly 
used. Food calories (given the notation Cal, and sometimes called “big 
calorie”) are actually kilocalories (1 kilocalorie = 1000 calories), a fact 
not easily determined from package labeling. 


In figure (a) the soft drink 
and the ice have different 
temperatures, 7; and 75, 
and are not in thermal 
equilibrium. In figure (b), 
when the soft drink and 
ice are allowed to 
interact, energy is 
transferred until they 
reach the same 
temperature T’, achieving 
equilibrium. Heat transfer 
occurs due to the 
difference in 
temperatures. In fact, 
since the soft drink and 
ice are both in contact 
with the surrounding air 
and bench, the 
equilibrium temperature 
will be the same for both. 


Mechanical Equivalent of Heat 


It is also possible to change the temperature of a substance by doing work. 
Work can transfer energy into or out of a system. This realization helped 
establish the fact that heat is a form of energy. James Prescott Joule (1818— 
1889) performed many experiments to establish the mechanical equivalent 
of heat—the work needed to produce the same effects as heat transfer. In 
terms of the units used for these two terms, the best modern value for this 
equivalence is 

Equation: 


1.000 kcal = 4186 J. 


We consider this equation as the conversion between two different units of 
energy. 


Measured Pr 
height of | 
descent 


Schematic depiction of Joule’s 
experiment that established the 
equivalence of heat and work. 


The figure above shows one of Joule’s most famous experimental setups for 
demonstrating the mechanical equivalent of heat. It demonstrated that work 
and heat can produce the same effects, and helped establish the principle of 
conservation of energy. Gravitational potential energy (PE) (work done by 
the gravitational force) is converted into kinetic energy (KE), and then 
randomized by viscosity and turbulence into increased average kinetic 
energy of atoms and molecules in the system, producing a temperature 
increase. His contributions to the field of thermodynamics were so 
significant that the SI unit of energy was named after him. 


Heat added or removed from a system changes its internal energy and thus 
its temperature. Such a temperature increase is observed while cooking. 
However, adding heat does not necessarily increase the temperature. An 
example is melting of ice; that is, when a substance changes from one phase 
to another. Work done on the system or by the system can also change the 
internal energy of the system. Joule demonstrated that the temperature of a 
system can be increased by stirring. If an ice cube is rubbed against a rough 
surface, work is done by the frictional force. A system has a well-defined 
internal energy, but we cannot say that it has a certain “heat content” or 
“work content”. We use the phrase “heat transfer” to emphasize its nature. 
Exercise: 

Check Your Understanding 


Problem: 


Two samples (A and B) of the same substance are kept in a lab. 
Someone adds 10 kilojoules (kJ) of heat to one sample, while 10 kJ of 
work is done on the other sample. How can you tell to which sample 
the heat was added? 


Solution: 


Heat and work both change the internal energy of the substance. 
However, the properties of the sample only depend on the internal 
energy so that it is impossible to tell whether heat was added to sample 
A or B. 


Summary 


e Heat and work are the two distinct methods of energy transfer. 

e Heat is energy transferred solely due to a temperature difference. 

e Any energy unit can be used for heat transfer, and the most common 
are kilocalorie (kcal) and joule (J). 

e Kilocalorie is defined to be the energy needed to change the 
temperature of 1.00 kg of water between 14.5°C and 15.5°C. 

e The mechanical equivalent of this heat transfer is 
1.00 kcal = 4186 J. 


Conceptual Questions 


Exercise: 


Problem: How is heat transfer related to temperature? 
Exercise: 
Problem: 
Describe a situation in which heat transfer occurs. What are the 
resulting forms of energy? 
Exercise: 
Problem: 


When heat transfers into a system, is the energy stored as heat? 
Explain briefly. 


Glossary 


heat 
the spontaneous transfer of energy due to a temperature difference 


kilocalorie 
1 kilocalorie = 1000 calories 


mechanical equivalent of heat 
the work needed to produce the same effects as heat transfer 


Temperature Change and Heat Capacity 


¢ Observe heat transfer and change in temperature and mass. 
e Calculate final temperature after heat transfer between two objects. 


One of the major effects of heat transfer is temperature change: heating increases the 
temperature while cooling decreases it. We assume that there is no phase change and that 
no work is done on or by the system. Experiments show that the transferred heat depends 
on three factors—the change in temperature, the mass of the system, and the substance 


and phase of the substance. 
a 
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The heat Q transferred to 
cause a temperature 
change depends on the 
magnitude of the 
temperature change, the 
mass of the system, and 
the substance and phase 
involved. (a) The amount 
of heat transferred is 
directly proportional to 
the temperature change. 
To double the 
temperature change of a 
mass m, you need to add 
twice the heat. (b) The 
amount of heat 
transferred is also directly 
proportional to the mass. 
To cause an equivalent 
temperature change in a 


doubled mass, you need 
to add twice the heat. (c) 
The amount of heat 
transferred depends on 
the substance and its 
phase. If it takes an 
amount Q of heat to 
cause a temperature 
change AT in a given 
mass of copper, it will 
take 10.8 times that 
amount of heat to cause 
the equivalent 
temperature change in the 
same mass of water 
assuming no phase 
change in either 
substance. 


The dependence on temperature change and mass are easily understood. Owing to the 
fact that the (average) kinetic energy of an atom or molecule is proportional to the 
absolute temperature, the internal energy of a system is proportional to the absolute 
temperature and the number of atoms or molecules. Owing to the fact that the transferred 
heat is equal to the change in the internal energy, the heat is proportional to the mass of 
the substance and the temperature change. The transferred heat also depends on the 
substance so that, for example, the heat necessary to raise the temperature is less for 
alcohol than for water. For the same substance, the transferred heat also depends on the 
phase (gas, liquid, or solid). 


Note: 

Heat Transfer and Temperature Change 

The quantitative relationship between heat transfer and temperature change contains all 
three factors: 

Equation: 


Q=mcAT, 


where Q is the symbol for heat transfer, m is the mass of the substance, and AT is the 
change in temperature. The symbol c stands for specific heat and depends on the 
material and phase. The specific heat is the amount of heat necessary to change the 


temperature of 1.00 kg of mass by 1.00°C. The specific heat c is a property of the 
substance; its SI unit is J/(kg - K) or J/(kg -°C). Recall that the temperature change 
(AT) is the same in units of kelvin and degrees Celsius. If heat transfer is measured in 
kilocalories, then the unit of specific heat is kcal/(kg -°C). 


Values of specific heat must generally be looked up in tables, because there is no simple 
way to calculate them. In general, the specific heat also depends on the temperature. 
[link] lists representative values of specific heat for various substances. Except for gases, 
the temperature and volume dependence of the specific heat of most substances is weak. 
We see from this table that the specific heat of water is five times that of glass and ten 
times that of iron, which means that it takes five times as much heat to raise the 
temperature of water the same amount as for glass and ten times as much heat to raise the 
temperature of water as for iron. In fact, water has one of the largest specific heats of any 
material, which is important for sustaining life on Earth. 


Example: 

Calculating the Required Heat: Heating Water in an Aluminum Pan 

A 0.500 kg aluminum pan on a stove is used to heat 0.250 liters of water from 20.0°C to 
80.0°C. (a) How much heat is required? What percentage of the heat is used to raise the 
temperature of (b) the pan and (c) the water? 

Strategy 

The pan and the water are always at the same temperature. When you put the pan on the 
stove, the temperature of the water and the pan is increased by the same amount. We use 
the equation for the heat transfer for the given temperature change and mass of water and 
aluminum. The specific heat values for water and aluminum are given in [link]. 
Solution 

Because water is in thermal contact with the aluminum, the pan and the water are at the 
same temperature. 


1. Calculate the temperature difference: 
Equation: 


AT = ] — T; = 60.0°C. 


2. Calculate the mass of water. Because the density of water is 1000 kg/ m’, one liter 
of water has a mass of 1 kg, and the mass of 0.250 liters of water is 
Mwy = 0.250 kg. 

3. Calculate the heat transferred to the water. Use the specific heat of water in [link]: 
Equation: 


OF nt eg AF S050 ke) (4186) J/keeC) (600°C) 62s ks 
4. Calculate the heat transferred to the aluminum. Use the specific heat for aluminum 
in [link]: 
Equation: 


Quy = Macy AT = (0.500 kg) (900 J/kg°C)(60.0°C)= 27.0 x 1043 = 27.0 kJ. 


5. Compare the percentage of heat going into the pan versus that going into the water. 
First, find the total transferred heat: 
Equation: 


QTota = Ow + Qa = 62.8 kJ + 27.0 kJ = 89.8 kJ. 


Thus, the amount of heat going into heating the pan is 


Equation: 
27.0 kJ 
100% = 30.1%, 
398kJ : 
and the amount going into heating the water is 
Equation: 
62.8 kJ 
x 100% = 69.9%. 
89.8kJ : ° 
Discussion 


In this example, the heat transferred to the container is a significant fraction of the total 
transferred heat. Although the mass of the pan is twice that of the water, the specific heat 
of water is over four times greater than that of aluminum. Therefore, it takes a bit more 
than twice the heat to achieve the given temperature change for the water as compared to 
the aluminum pan. 


The smoking brakes on this 
truck are a visible evidence of 
the mechanical equivalent of 
heat. 


Example: 

Calculating the Temperature Increase from the Work Done on a Substance: Truck 
Brakes Overheat on Downhill Runs 

Truck brakes used to control speed on a downhill run do work, converting gravitational 
potential energy into increased internal energy (higher temperature) of the brake 
material. This conversion prevents the gravitational potential energy from being 
converted into kinetic energy of the truck. The problem is that the mass of the truck is 
large compared with that of the brake material absorbing the energy, and the temperature 
increase may occur too fast for sufficient heat to transfer from the brakes to the 
environment. 

Calculate the temperature increase of 100 kg of brake material with an average specific 
heat of 800 J/kg - °C if the material retains 10% of the energy from a 10,000-kg truck 
descending 75.0 m (in vertical displacement) at a constant speed. 

Strategy 

If the brakes are not applied, gravitational potential energy is converted into kinetic 
energy. When brakes are applied, gravitational potential energy is converted into internal 
energy of the brake material. We first calculate the gravitational potential energy (Mgh) 
that the entire truck loses in its descent and then find the temperature increase produced 
in the brake material alone. 

Solution 


1. Calculate the change in gravitational potential energy as the truck goes downhill 
Equation: 


Meh = (10,000 kg) (9.80 m/s”) (75.0 m) = 7.35 x 10° J. 


N 


Equation: 


. Calculate the temperature from the heat transferred using Q=Mgh and 


where ™ is the mass of the brake material. Insert the values m = 100 kg and 


c = 800 J/kg - °C to find 


Equation: 


Discussion 


(isos 10) 
(100 kg)(800 J/kg°C) 


— ee 


This same idea underlies the recent hybrid technology of cars, where mechanical energy 
(gravitational potential energy) is converted by the brakes into electrical energy 


(battery). 


Substances 


Solids 


Aluminum 

Asbestos 

Concrete, granite (average) 
Copper 


Glass 


Specific heat (c) 
kcal/kg-°C[ footnote] 
Ikg-°C These values are 


900 


800 


840 


387 


840 


identical in units of 
cal/g -°C. 


0.215 
0.19 
0.20 
0.0924 


0.20 


Substances 

Gold 

Human body (average at 37 °C) 

Ice (average, -50°C to 0°C) 

Iron, steel 

Lead 

Silver 

Wood 

Liquids 

Benzene 

Ethanol 

Glycerin 

Mercury 

Water (15.0 °C) 

Gases [footnote] 

Cy at constant volume and at 20.0°C, except 
as noted, and at 1.00 atm average pressure. 


Values in parentheses are c,, at a constant 
pressure of 1.00 atm. 


Air (dry) 


Ammonia 


Carbon dioxide 


Specific heat (c) 


129 
3500 
2090 
452 
128 
235 


1700 


1740 
2450 
2410 
139 


4186 


721 
(1015) 


1670 
(2190) 


638 
(833) 


0.0308 
0.83 
0.50 
0.108 
0.0305 
0.0562 


0.4 


0.415 
0.586 
0.576 
0.0333 


1.000 


0.172 (0.242) 


0.399 (0.523) 


0.152 (0.199) 


Substances Specific heat (c) 


Nitrogen io 40) 0.177 (0.248) 
Oxygen a i 0.156 (0.218) 
Steam (100°C) on 0.363 (0.482) 


Specific Heats[footnote| of Various Substances 
The values for solids and liquids are at constant volume and at 25°C, except as noted. 


Note that [link] is an illustration of the mechanical equivalent of heat. Alternatively, the 
temperature increase could be produced by a blow torch instead of mechanically. 


Example: 

Calculating the Final Temperature When Heat Is Transferred Between Two Bodies: 
Pouring Cold Water in a Hot Pan 

Suppose you pour 0.250 kg of 20.0°C water (about a cup) into a 0.500-kg aluminum pan 
off the stove with a temperature of 150°C. Assume that the pan is placed on an insulated 
pad and that a negligible amount of water boils off. What is the temperature when the 
water and pan reach thermal equilibrium a short time later? 

Strategy 

The pan is placed on an insulated pad so that little heat transfer occurs with the 
surroundings. Originally the pan and water are not in thermal equilibrium: the pan is at a 
higher temperature than the water. Heat transfer then restores thermal equilibrium once 
the water and pan are in contact. Because heat transfer between the pan and water takes 
place rapidly, the mass of evaporated water is negligible and the magnitude of the heat 
lost by the pan is equal to the heat gained by the water. The exchange of heat stops once 
a thermal equilibrium between the pan and the water is achieved. The heat exchange can 
be written as | Qhot |= Qeold- 

Solution 


1. Use the equation for heat transfer Q = mcAT to express the heat lost by the 
aluminum pan in terms of the mass of the pan, the specific heat of aluminum, the 
initial temperature of the pan, and the final temperature: 

Equation: 


Qhot = Marca (Ts — 150°C). 


2. Express the heat gained by the water in terms of the mass of the water, the specific 
heat of water, the initial temperature of the water and the final temperature: 
Equation: 


Qeold = my cw(T¢—20.0°C). 


3. Note that hot < 0 and Q¢oiq > O and that they must sum to zero because the heat 
lost by the hot pan must be the same as the heat gained by the cold water: 
Equation: 


Cieeticn he Es 0, 
eid = eis 
mwycw(T¢ — 20.0°C) = —majca)(Te — 150°C.) 


4. This an equation for the unknown final temperature, 7+ 

5. Bring all terms involving 7; on the left hand side and all other terms on the right 
hand side. Solve for T¢, 
Equation: 


rT MajCaj (150°C) + mwew(20.0°C) 
a hE Aa SSN 
MaiCal + Mwew 


and insert the numerical values: 


Equation: 
oe (0.500 kg)(900 J/kg°C)(150°C) +-(0.250 kg) (4186 J/kg°C) (20.0°C) 
(0.500 kg)(900 J/kg°C)+ (0.250 kg)(4186 J/kg°C) 
88430 J 
1496.5 J/°C 
= 5O01°C:. 
Discussion 


This is a typical calorimetry problem—two bodies at different temperatures are brought 
in contact with each other and exchange heat until a common temperature is reached. 
Why is the final temperature so much closer to 20.0°C than 150°C? The reason is that 
water has a greater specific heat than most common substances and thus undergoes a 
small temperature change for a given heat transfer. A large body of water, such as a lake, 
requires a large amount of heat to increase its temperature appreciably. This explains 
why the temperature of a lake stays relatively constant during a day even when the 
temperature change of the air is large. However, the water temperature does change over 
longer times (e.g., summer to winter). 


Note: 

Take-Home Experiment: Temperature Change of Land and Water 
What heats faster, land or water? 

To study differences in heat capacity: 


e Place equal masses of dry sand (or soil) and water at the same temperature into two 
small jars. (The average density of soil or sand is about 1.6 times that of water, so 
you can achieve approximately equal masses by using 50% more water by volume.) 

e Heat both (using an oven or a heat lamp) for the same amount of time. 

e Record the final temperature of the two masses. 

¢ Now bring both jars to the same temperature by heating for a longer period of time. 

e Remove the jars from the heat source and measure their temperature every 5 
minutes for about 30 minutes. 


Which sample cools off the fastest? This activity replicates the phenomena responsible 
for land breezes and sea breezes. 


Exercise: 
Check Your Understanding 


Problem: 


If 25 kJ is necessary to raise the temperature of a block from 25°C to 30°C, how 
much heat is necessary to heat the block from 45°C to 50°C? 


Solution: 


The heat transfer depends only on the temperature difference. Since the temperature 
differences are the same in both cases, the same 25 kJ is necessary in the second 
case. 

Summary 
e The transfer of heat Q that leads to a change AT in the temperature of a body with 


mass m is Q = mcAT, where c is the specific heat of the material. This 
relationship can also be considered as the definition of specific heat. 


Conceptual Questions 


Exercise: 


Problem: 


What three factors affect the heat transfer that is necessary to change an object’s 
temperature? 

Exercise: 
Problem: 
The brakes in a car increase in temperature by AT when bringing the car to rest 
from a speed v. How much greater would AT be if the car initially had twice the 


speed? You may assume the car to stop sufficiently fast so that no heat transfers out 
of the brakes. 


Problems & Exercises 


Exercise: 


Problem: 


On a hot day, the temperature of an 80,000-L swimming pool increases by 1.50°C. 
What is the net heat transfer during this heating? Ignore any complications, such as 
loss of water by evaporation. 


Solution: 
Equation: 


5.02 x 10° J 


Exercise: 


Problem:Show that 1 cal/g -°C = 1 kcal/kg-°C. 
Exercise: 


Problem: 


To sterilize a 50.0-g glass baby bottle, we must raise its temperature from 22.0°C to 
95.0°C. How much heat transfer is required? 


Solution: 
Equation: 


3.07 x 10° J 


Exercise: 
Problem: 
The same heat transfer into identical masses of different substances produces 
different temperature changes. Calculate the final temperature when 1.00 kcal of 


heat transfers into 1.00 kg of the following, originally at 20.0°C: (a) water; (b) 
concrete; (c) steel; and (d) mercury. 


Exercise: 
Problem: 
Rubbing your hands together warms them by converting work into thermal energy. 
If a woman rubs her hands back and forth for a total of 20 rubs, at a distance of 7.50 
cm per rub, and with an average frictional force of 40.0 N, what is the temperature 


increase? The mass of tissues warmed is only 0.100 kg, mostly in the palms and 
fingers. 


Solution: 
Equation: 


0.171°C 


Exercise: 
Problem: 
A 0.250-kg block of a pure material is heated from 20.0°C to 65.0°C by the addition 


of 4.35 kJ of energy. Calculate its specific heat and identify the substance of which it 
is most likely composed. 


Exercise: 
Problem: 
Suppose identical amounts of heat transfer into different masses of copper and 


water, causing identical changes in temperature. What is the ratio of the mass of 
copper to water? 


Solution: 


10.8 


Exercise: 


Problem: 


(a) The number of kilocalories in food is determined by calorimetry techniques in 
which the food is burned and the amount of heat transfer is measured. How many 
kilocalories per gram are there in a 5.00-g peanut if the energy from burning it is 
transferred to 0.500 kg of water held in a 0.100-kg aluminum cup, causing a 54.9°C 
temperature increase? (b) Compare your answer to labeling information found on a 
package of peanuts and comment on whether the values are consistent. 


Exercise: 


Problem: 


Following vigorous exercise, the body temperature of an 80.0-kg person is 40.0°C. 

At what rate in watts must the person transfer thermal energy to reduce the the body 
temperature to 37.0°C in 30.0 min, assuming the body continues to produce energy 
at the rate of 150 W? (1 watt = 1 joule/second or 1 W=1J/s). 


Solution: 


617 W 
Exercise: 


Problem: 


Even when shut down after a period of normal use, a large commercial nuclear 
reactor transfers thermal energy at the rate of 150 MW by the radioactive decay of 
fission products. This heat transfer causes a rapid increase in temperature if the 
cooling system fails 

(1 watt = 1 joule/second or 1 W = 1J/s and 1 MW = 1 megawatt). (a) 
Calculate the rate of temperature increase in degrees Celsius per second (°C/s) if 
the mass of the reactor core is 1.60 x 10° kg and it has an average specific heat of 
0.3349 kJ/kg° - C. (b) How long would it take to obtain a temperature increase of 
2000°C, which could cause some metals holding the radioactive materials to melt? 
(The initial rate of temperature increase would be greater than that calculated here 
because the heat transfer is concentrated in a smaller mass. Later, however, the 
temperature increase would slow down because the 5 x 10°-kg steel containment 
vessel would also begin to heat up.) 


Radioactive spent- 
fuel pool at a 
nuclear power 
plant. Spent fuel 
stays hot for a long 
time. (credit: U.S. 
Department of 
Energy) 


Glossary 


specific heat 
the amount of heat necessary to change the temperature of 1.00 kg of a substance by 
1.00 °C 


Phase Change and Latent Heat 


e Examine heat transfer. 
e Calculate final temperature from heat transfer. 


So far we have discussed temperature change due to heat transfer. No temperature change occurs from 
heat transfer if ice melts and becomes liquid water (i.e., during a phase change). For example, consider 
water dripping from icicles melting on a roof warmed by the Sun. Conversely, water freezes in an ice tray 
cooled by lower-temperature surroundings. 


Heat from the air transfers to 
the ice causing it to melt. 
(credit: Mike Brand) 


Energy is required to melt a solid because the cohesive bonds between the molecules in the solid must be 
broken apart such that, in the liquid, the molecules can move around at comparable kinetic energies; thus, 
there is no rise in temperature. Similarly, energy is needed to vaporize a liquid, because molecules in a 
liquid interact with each other via attractive forces. There is no temperature change until a phase change is 
complete. The temperature of a cup of soda initially at 0°C stays at 0°C until all the ice has melted. 
Conversely, energy is released during freezing and condensation, usually in the form of thermal energy. 
Work is done by cohesive forces when molecules are brought together. The corresponding energy must be 
given off (dissipated) to allow them to stay together [link]. 


The energy involved in a phase change depends on two major factors: the number and strength of bonds or 
force pairs. The number of bonds is proportional to the number of molecules and thus to the mass of the 
sample. The strength of forces depends on the type of molecules. The heat Q required to change the phase 
of a sample of mass ™ is given by 

Equation: 


Q = mL+ (melting/freezing), 
Equation: 
Q = mL, (vaporization/condensation), 


where the latent heat of fusion, L¢, and latent heat of vaporization, L,, are material constants that are 
determined experimentally. See ([link]). 
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(b) 


(a) Energy is required to partially overcome the 
attractive forces between molecules in a solid to 
form a liquid. That same energy must be 
removed for freezing to take place. (b) 
Molecules are separated by large distances 
when going from liquid to vapor, requiring 
significant energy to overcome molecular 
attraction. The same energy must be removed 
for condensation to take place. There is no 
temperature change until a phase change is 
complete. 


Latent heat is measured in units of J/kg. Both L¢ and L, depend on the substance, particularly on the 
strength of its molecular forces as noted earlier. L¢ and Ly are collectively called latent heat coefficients. 
They are latent, or hidden, because in phase changes, energy enters or leaves a system without causing a 
temperature change in the system; so, in effect, the energy is hidden. [link] lists representative values of 
L¢ and L,, together with melting and boiling points. 


The table shows that significant amounts of energy are involved in phase changes. Let us look, for 
example, at how much energy is needed to melt a kilogram of ice at 0°C to produce a kilogram of water at 
0°C. Using the equation for a change in temperature and the value for water from [link], we find that 

Q = mL; = (1.0 kg) (334 kJ/kg) = 334 kJ is the energy to melt a kilogram of ice. This is a lot of 
energy as it represents the same amount of energy needed to raise the temperature of 1 kg of liquid water 
from 0°C to 79.8°C. Even more energy is required to vaporize water; it would take 2256 kJ to change 1 kg 
of liquid water at the normal boiling point (100°C at atmospheric pressure) to steam (water vapor). This 
example shows that the energy for a phase change is enormous compared to energy associated with 
temperature changes without a phase change. 


Substance 


Helium 
Hydrogen 
Nitrogen 
Oxygen 
Ethanol 
Ammonia 


Mercury 


Water 


Sulfur 
Lead 
Antimony 
Aluminum 
Silver 
Gold 
Copper 
Uranium 


Tungsten 


Heats of Fusion and Vaporization [footnote] 


Melting 
point 
(°C) 
-269.7 
-259.3 
-210.0 
-218.8 
-114 


=795 


—38.9 


0.00 


119 
327 
631 
660 
961 
1063 
1083 
1133 


3410 


11.8 


334 


38.1 


24.5 


165 


380 


88.3 


64.5 


134 


84 


184 


kcal/kg 


79.8 


32.0 


Boiling 
point 
(°C) 
-268.9 
-252.9 
-195.8 
-183.0 
78.3 


—33.4 


357 


100.0 


444.6 
1750 
1440 
2450 
2193 
2660 
2595 
3900 


5900 


2256[footnote] 
At 37.0°C 
(body 
temperature), 
the heat of 
vaporization 
Ly, for water is 
2430 kJ/kg or 
580 kcal/kg 


326 


871 


561 


11400 


2336 


1578 


5069 


1900 


4810 


kcal/kg 


4.99 

108 

48.0 

50.9 

204 

327 

65.0 
539[footnote] 
At 37.0°C 
(body 
temperature), 
the heat of 
vaporization 
L,, for water 
is 2430 kJ/kg 
or 580 
kcal/kg 

77.9 

208 

134 

2720 

558 

377 

1211 

454 


1150 


Values quoted at the normal melting and boiling temperatures at standard atmospheric pressure (1 atm). 


Phase changes can have a tremendous stabilizing effect even on temperatures that are not near the melting 
and boiling points, because evaporation and condensation (conversion of a gas into a liquid state) occur 
even at temperatures below the boiling point. Take, for example, the fact that air temperatures in humid 
climates rarely go above 35.0°C, which is because most heat transfer goes into evaporating water into the 
air. Similarly, temperatures in humid weather rarely fall below the dew point because enormous heat is 
released when water vapor condenses. 


We examine the effects of phase change more precisely by considering adding heat into a sample of ice at 
—20°C ([link]). The temperature of the ice rises linearly, absorbing heat at a constant rate of 

0.50 cal/g -° C until it reaches 0°C. Once at this temperature, the ice begins to melt until all the ice has 
melted, absorbing 79.8 cal/g of heat. The temperature remains constant at 0°C during this phase change. 
Once all the ice has melted, the temperature of the liquid water rises, absorbing heat at a new constant rate 
of 1.00 cal/g -° C. At 100°C, the water begins to boil and the temperature again remains constant while 
the water absorbs 539 cal/g of heat during this phase change. When all the liquid has become steam vapor, 


the temperature rises again, absorbing heat at a rate of 0.482 cal/g -° C. 
4 
1204 


Steam 


100 4 
Water + Steam 


0 100 200 300 400 500 600 700 800 
AQ/m (cal/g) 


A graph of temperature versus energy 
added. The system is constructed so that no 
vapor evaporates while ice warms to become 
liquid water, and so that, when vaporization 
occurs, the vapor remains in of the system. 
The long stretches of constant temperature 
values at 0°C and 100°C reflect the large 
latent heat of melting and vaporization, 
respectively. 


Water can evaporate at temperatures below the boiling point. More energy is required than at the boiling 
point, because the kinetic energy of water molecules at temperatures below 100°C is less than that at 
100°C, hence less energy is available from random thermal motions. Take, for example, the fact that, at 
body temperature, perspiration from the skin requires a heat input of 2428 kJ/kg, which is about 10 
percent higher than the latent heat of vaporization at 100°C. This heat comes from the skin, and thus 
provides an effective cooling mechanism in hot weather. High humidity inhibits evaporation, so that body 
temperature might rise, leaving unevaporated sweat on your brow. 


Example: 
Calculate Final Temperature from Phase Change: Cooling Soda with Ice Cubes 


Three ice cubes are used to chill a soda at 20°C with mass mgoga = 0.25 kg. The ice is at 0°C and each 
ice cube has a mass of 6.0 g. Assume that the soda is kept in a foam container so that heat loss can be 
ignored. Assume the soda has the same heat capacity as water. Find the final temperature when all ice has 
melted. 

Strategy 

The ice cubes are at the melting temperature of 0°C. Heat is transferred from the soda to the ice for 
melting. Melting of ice occurs in two steps: first the phase change occurs and solid (ice) transforms into 
liquid water at the melting temperature, then the temperature of this water rises. Melting yields water at 
0°C, so more heat is transferred from the soda to this water until the water plus soda system reaches 
thermal equilibrium, 


Equation: 
Qice — —Qsoda: 
The heat transferred to the ice is Qice = MiceL¢ + Micecw(T — 0°C). The heat given off by the soda is 
Qsoda = Msodacw(T¢ — 20°C). Since no heat is lost, Qice = —Qsoda, SO that 
Equation: 


Mice Lg = Micecw(Tt = 0°C) = —Msodacw (Tt =; 20°C). 


Bring all terms involving 7; on the left-hand-side and all other terms on the right-hand-side. Solve for the 
unknown quantity T¢: 

Equation: 

Msodacw (20°C) — MiceLt 


T; = 
(isos Sle Mice) CW 


Solution 


1. Identify the known quantities. The mass of ice is Mice = 36.0 g = 0.018 kg and the mass of soda 
iS Mgoda = 0.25 kg. 

2. Calculate the terms in the numerator: 
Equation: 


Mesodacw(20°C) = (0.25 kg) (4186 J/kg -° C)(20°C) = 20,930 J 


and 
Equation: 


MiceLt = (0.018 kg) (334,000 J/kg)=6012 J. 


3. Calculate the denominator: 
Equation: 


(Msoda + Mice )Cw = (0.25 kg + 0.018 kg) (4186 K/(kg -° C)=1122 J/°C. 


4. Calculate the final temperature: 
Equation: 


__ 20,930 J — 6012 J 
TRE LS 


Sak. 


f 


Discussion 

This example illustrates the enormous energies involved during a phase change. The mass of ice is about 
7 percent the mass of water but leads to a noticeable change in the temperature of soda. Although we 
assumed that the ice was at the freezing temperature, this is incorrect: the typical temperature is —6°C. 
However, this correction gives a final temperature that is essentially identical to the result we found. Can 


you explain why? 


We have seen that vaporization requires heat transfer to a liquid from the surroundings, so that energy is 
released by the surroundings. Condensation is the reverse process, increasing the temperature of the 
surroundings. This increase may seem surprising, since we associate condensation with cold objects—the 
glass in the figure, for example. However, energy must be removed from the condensing molecules to 
make a vapor condense. The energy is exactly the same as that required to make the phase change in the 
other direction, from liquid to vapor, and so it can be calculated from Q = mLy,. 


Condensation forms on this 
glass of iced tea because the 
temperature of the nearby air 
is reduced to below the dew 
point. The rate at which water 
molecules join together 
exceeds the rate at which they 
separate, and so water 
condenses. Energy is released 
when the water condenses, 
speeding the melting of the 
ice in the glass. (credit: Jenny 
Downing) 


Note: 

Real-World Application 

Energy is also released when a liquid freezes. This phenomenon is used by fruit growers in Florida to 
protect oranges when the temperature is close to the freezing point (0°C). Growers spray water on the 


plants in orchards so that the water freezes and heat is released to the growing oranges on the trees. This 


prevents the temperature inside the orange from dropping below freezing, which would damage the fruit. 
Gee Se Nighi (Saye 


The ice on these trees released large 
amounts of energy when it froze, 
helping to prevent the temperature of 
the trees from dropping below 0°C. 
Water is intentionally sprayed on 
orchards to help prevent hard frosts. 
(credit: Hermann Hammer) 


Sublimation is the transition from solid to vapor phase. You may have noticed that snow can disappear 
into thin air without a trace of liquid water, or the disappearance of ice cubes in a freezer. The reverse is 
also true: Frost can form on very cold windows without going through the liquid stage. A popular effect is 
the making of “smoke” from dry ice, which is solid carbon dioxide. Sublimation occurs because the 
equilibrium vapor pressure of solids is not zero. Certain air fresheners use the sublimation of a solid to 
inject a perfume into the room. Moth balls are a slightly toxic example of a phenol (an organic compound) 
that sublimates, while some solids, such as osmium tetroxide, are so toxic that they must be kept in sealed 
containers to prevent human exposure to their sublimation-produced vapors. 


Direct transitions 
between solid and 


vapor are common, 
sometimes useful, 
and even beautiful. 
(a) Dry ice 
sublimates directly to 
carbon dioxide gas. 
The visible vapor is 
made of water 
droplets. (credit: 
Windell Oskay) (b) 
Frost forms patterns 
on a very cold 
window, an example 
of a solid formed 
directly from a 
vapor. (credit: Liz 
West) 


All phase transitions involve heat. In the case of direct solid-vapor transitions, the energy required is given 
by the equation Q = mLs, where L, is the heat of sublimation, which is the energy required to change 
1.00 kg of a substance from the solid phase to the vapor phase. L, is analogous to Ly and Ly, and its value 
depends on the substance. Sublimation requires energy input, so that dry ice is an effective coolant, 
whereas the reverse process (i.e., frosting) releases energy. The amount of energy required for sublimation 
is of the same order of magnitude as that for other phase transitions. 


The material presented in this section and the preceding section allows us to calculate any number of 
effects related to temperature and phase change. In each case, it is necessary to identify which temperature 
and phase changes are taking place and then to apply the appropriate equation. Keep in mind that heat 
transfer and work can cause both temperature and phase changes. 


Problem-Solving Strategies for the Effects of Heat Transfer 


= 


. Examine the situation to determine that there is a change in the temperature or phase. Is there heat 
transfer into or out of the system? When the presence or absence of a phase change is not obvious, 
you may wish to first solve the problem as if there were no phase changes, and examine the 
temperature change obtained. If it is sufficient to take you past a boiling or melting point, you should 
then go back and do the problem in steps—temperature change, phase change, subsequent 
temperature change, and so on. 

. Identify and list all objects that change temperature and phase. 

. Identify exactly what needs to be determined in the problem (identify the unknowns). A written list is 
useful. 

4. Make a list of what is given or what can be inferred from the problem as stated (identify the knowns). 

5. Solve the appropriate equation for the quantity to be determined (the unknown). If there is a 

temperature change, the transferred heat depends on the specific heat (see [link]) whereas, for a phase 
change, the transferred heat depends on the latent heat. See [Link]. 

6. Substitute the knowns along with their units into the appropriate equation and obtain numerical 

solutions complete with units. You will need to do this in steps if there is more than one stage to the 

process (such as a temperature change followed by a phase change). 


WN 


7. Check the answer to see if it is reasonable: Does it make sense? As an example, be certain that the 
temperature change does not also cause a phase change that you have not taken into account. 


Exercise: 
Check Your Understanding 
Problem: 


Why does snow remain on mountain slopes even when daytime temperatures are higher than the 
freezing temperature? 


Solution: 


Snow is formed from ice crystals and thus is the solid phase of water. Because enormous heat is 
necessary for phase changes, it takes a certain amount of time for this heat to be accumulated from 
the air, even if the air is above 0°C. The warmer the air is, the faster this heat exchange occurs and 
the faster the snow melts. 


Summary 


e Most substances can exist either in solid, liquid, and gas forms, which are referred to as “phases.” 
e Phase changes occur at fixed temperatures for a given substance at a given pressure, and these 
temperatures are called boiling and freezing (or melting) points. 
e During phase changes, heat absorbed or released is given by: 
Equation: 


Q=nmlL, 


where L is the latent heat coefficient. 


Conceptual Questions 


Exercise: 


Problem: 


Heat transfer can cause temperature and phase changes. What else can cause these changes? 
Exercise: 

Problem: 

How does the latent heat of fusion of water help slow the decrease of air temperatures, perhaps 


preventing temperatures from falling significantly below 0°C, in the vicinity of large bodies of 
water? 


Exercise: 


Problem: What is the temperature of ice right after it is formed by freezing water? 


Exercise: 


Problem: 
If you place 0°C ice into 0°C water in an insulated container, what will happen? Will some ice melt, 
will more water freeze, or will neither take place? 
Exercise: 
Problem: 
What effect does condensation on a glass of ice water have on the rate at which the ice melts? Will 
the condensation speed up the melting process or slow it down? 
Exercise: 
Problem: 
In very humid climates where there are numerous bodies of water, such as in Florida, it is unusual for 


temperatures to rise above about 35°C (95°F). In deserts, however, temperatures can rise far above 
this. Explain how the evaporation of water helps limit high temperatures in humid climates. 


Exercise: 
Problem: 
In winters, it is often warmer in San Francisco than in nearby Sacramento, 150 km inland. In 


summers, it is nearly always hotter in Sacramento. Explain how the bodies of water surrounding San 
Francisco moderate its extreme temperatures. 


Exercise: 
Problem: 
Putting a lid on a boiling pot greatly reduces the heat transfer necessary to keep it boiling. Explain 
why. 
Exercise: 
Problem: 
Freeze-dried foods have been dehydrated in a vacuum. During the process, the food freezes and must 


be heated to facilitate dehydration. Explain both how the vacuum speeds up dehydration and why the 
food freezes as a result. 


Exercise: 
Problem: 
When still air cools by radiating at night, it is unusual for temperatures to fall below the dew point. 
Explain why. 

Exercise: 
Problem: 
In a physics classroom demonstration, an instructor inflates a balloon by mouth and then cools it in 
liquid nitrogen. When cold, the shrunken balloon has a small amount of light blue liquid in it, as well 
as some snow-like crystals. As it warms up, the liquid boils, and part of the crystals sublimate, with 


some crystals lingering for awhile and then producing a liquid. Identify the blue liquid and the two 
solids in the cold balloon. Justify your identifications using data from [link]. 


Problems & Exercises 


Exercise: 
Problem: 


How much heat transfer (in kilocalories) is required to thaw a 0.450-kg package of frozen vegetables 
originally at 0°C if their heat of fusion is the same as that of water? 


Solution: 


35.9 kcal 
Exercise: 
Problem: 


A bag containing 0°C ice is much more effective in absorbing energy than one containing the same 
amount of 0°C water. 


a. How much heat transfer is necessary to raise the temperature of 0.800 kg of water from 0°C to 
30.0°C? 

b. How much heat transfer is required to first melt 0.800 kg of 0°C ice and then raise its 
temperature? 

c. Explain how your answer supports the contention that the ice is more effective. 


Exercise: 
Problem: 
(a) How much heat transfer is required to raise the temperature of a 0.750-kg aluminum pot 
containing 2.50 kg of water from 30.0°C to the boiling point and then boil away 0.750 kg of water? 


(b) How long does this take if the rate of heat transfer is 500 W 
1 watt = 1 joule/second (1W=1J/s)? 


Solution: 
(a) 591 kcal 


(b) 4.94 x 10° s 
Exercise: 
Problem: 
The formation of condensation on a glass of ice water causes the ice to melt faster than it would 


otherwise. If 8.00 g of condensation forms on a glass containing both water and 200 g of ice, how 
many grams of the ice will melt as a result? Assume no other heat transfer occurs. 


Exercise: 
Problem: 
On a trip, you notice that a 3.50-kg bag of ice lasts an average of one day in your cooler. What is the 


average power in watts entering the ice if it starts at O°C and completely melts to 0°C water in 
exactly one day 1 watt = 1 joule/second (1W=1J/s)? 


Solution: 


13.5 W 
Exercise: 
Problem: 
On a certain dry sunny day, a swimming pool’s temperature would rise by 1.50°C if not for 


evaporation. What fraction of the water must evaporate to carry away precisely enough energy to 
keep the temperature constant? 


Exercise: 


Problem: 


(a) How much heat transfer is necessary to raise the temperature of a 0.200-kg piece of ice from 
—20.0°C to 130°C, including the energy needed for phase changes? 

(b) How much time is required for each stage, assuming a constant 20.0 kJ/s rate of heat transfer? 
(c) Make a graph of temperature versus time for this process. 


Solution: 
(a) 148 kcal 


(b) 0.418 s, 3.34 s, 4.19 s, 22.6 s, 0.456 s 
Exercise: 


Problem: 


In 1986, a gargantuan iceberg broke away from the Ross Ice Shelf in Antarctica. It was 
approximately a rectangle 160 km long, 40.0 km wide, and 250 m thick. 


(a) What is the mass of this iceberg, given that the density of ice is 917 kg/ m*? 
(b) How much heat transfer (in joules) is needed to melt it? 


(c) How many years would it take sunlight alone to melt ice this thick, if the ice absorbs an average 
of 100 W/m”, 12.00 h per day? 

Exercise: 
Problem: 
How many grams of coffee must evaporate from 350 g of coffee in a 100-g glass cup to cool the 
coffee from 95.0°C to 45.0°C? You may assume the coffee has the same thermal properties as water 


and that the average heat of vaporization is 2340 kJ/kg (560 cal/g). (You may neglect the change in 
mass of the coffee as it cools, which will give you an answer that is slightly larger than correct.) 


Solution: 


33.0 g 


Exercise: 


Problem: 


(a) It is difficult to extinguish a fire on a crude oil tanker, because each liter of crude oil releases 

2.80 x 10’ J of energy when burned. To illustrate this difficulty, calculate the number of liters of 
water that must be expended to absorb the energy released by burning 1.00 L of crude oil, if the water 
has its temperature raised from 20.0°C to 100°C, it boils, and the resulting steam is raised to 300°C. 
(b) Discuss additional complications caused by the fact that crude oil has a smaller density than 
water. 


Solution: 
(a) 9.67 L 


(b) Crude oil is less dense than water, so it floats on top of the water, thereby exposing it to the 
oxygen in the air, which it uses to burn. Also, if the water is under the oil, it is less efficient in 
absorbing the heat generated by the oil. 


Exercise: 
Problem: 
The energy released from condensation in thunderstorms can be very large. Calculate the energy 


released into the atmosphere for a small storm of radius 1 km, assuming that 1.0 cm of rain is 
precipitated uniformly over this area. 


Exercise: 
Problem:To help prevent frost damage, 4.00 kg of 0°C water is sprayed onto a fruit tree. 


(a) How much heat transfer occurs as the water freezes? 


(b) How much would the temperature of the 200-kg tree decrease if this amount of heat transferred 
from the tree? Take the specific heat to be 3.35 kJ/kg -° C, and assume that no phase change occurs. 


Solution: 
a) 319 kcal 
b) 2.00°C 


Exercise: 


Problem: 


A 0.250-kg aluminum bowl holding 0.800 kg of soup at 25.0°C is placed in a freezer. What is the 
final temperature if 377 kJ of energy is transferred from the bowl and soup, assuming the soup’s 
thermal properties are the same as that of water? Explicitly show how you follow the steps in 
Problem-Solving Strategies for the Effects of Heat Transfer. 


Exercise: 


Problem: 


A 0.0500-kg ice cube at —30.0°C is placed in 0.400 kg of 35.0°C water in a very well-insulated 
container. What is the final temperature? 


Solution: 


20.6°C 
Exercise: 
Problem: 
If you pour 0.0100 kg of 20.0°C water onto a 1.20-kg block of ice (which is initially at —15.0°C), 


what is the final temperature? You may assume that the water cools so rapidly that effects of the 
surroundings are negligible. 


Exercise: 
Problem: 
Indigenous people sometimes cook in watertight baskets by placing hot rocks into water to bring it to 
a boil. What mass of 500°C rock must be placed in 4.00 kg of 15.0°C water to bring its temperature 


to 100°C, if 0.0250 kg of water escapes as vapor from the initial sizzle? You may neglect the effects 
of the surroundings and take the average specific heat of the rocks to be that of granite. 


Solution: 


4.38 kg 
Exercise: 
Problem: 
What would be the final temperature of the pan and water in Calculating the Final Temperature When 
Heat Is Transferred Between Two Bodies: Pouring Cold Water in a Hot Pan if 0.260 kg of water was 


placed in the pan and 0.0100 kg of the water evaporated immediately, leaving the remainder to come 
to a common temperature with the pan? 


Exercise: 


Problem: 


In some countries, liquid nitrogen is used on dairy trucks instead of mechanical refrigerators. A 3.00- 
hour delivery trip requires 200 L of liquid nitrogen, which has a density of 808 kg/ m*, 


(a) Calculate the heat transfer necessary to evaporate this amount of liquid nitrogen and raise its 
temperature to 3.00°C. (Use c, and assume it is constant over the temperature range.) This value is 
the amount of cooling the liquid nitrogen supplies. 


(b) What is this heat transfer rate in kilowatt-hours? 


(c) Compare the amount of cooling obtained from melting an identical mass of 0°C ice with that from 
evaporating the liquid nitrogen. 


Solution: 
(a) 1.57 x 104 kcal 
(b) 18.3kW -h 


(c) 1.29 x 104 kcal 


Exercise: 
Problem: 


Some gun fanciers make their own bullets, which involves melting and casting the lead slugs. How 


much heat transfer is needed to raise the temperature and melt 0.500 kg of lead, starting from 25.0°C 
? 


Glossary 


heat of sublimation 
the energy required to change a substance from the solid phase to the vapor phase 


latent heat coefficient 
a physical constant equal to the amount of heat transferred for every 1 kg of a substance during the 
change in phase of the substance 


sublimation 
the transition from the solid phase to the vapor phase 


Heat Transfer Methods 
e Discuss the different methods of heat transfer. 


Equally as interesting as the effects of heat transfer on a system are the 
methods by which this occurs. Whenever there is a temperature difference, 
heat transfer occurs. Heat transfer may occur rapidly, such as through a 
cooking pan, or slowly, such as through the walls of a picnic ice chest. We 
can control rates of heat transfer by choosing materials (such as thick wool 
clothing for the winter), controlling air movement (such as the use of 
weather stripping around doors), or by choice of color (such as a white roof 
to reflect summer sunlight). So many processes involve heat transfer, so that 
it is hard to imagine a situation where no heat transfer occurs. Yet every 
process involving heat transfer takes place by only three methods: 


1. Conduction is heat transfer through stationary matter by physical 
contact. (The matter is stationary on a macroscopic scale—we know 
there is thermal motion of the atoms and molecules at any temperature 
above absolute zero.) Heat transferred between the electric burner of a 
stove and the bottom of a pan is transferred by conduction. 

2. Convection is the heat transfer by the macroscopic movement of a 
fluid. This type of transfer takes place in a forced-air furnace and in 
weather systems, for example. 

3. Heat transfer by radiation occurs when microwaves, infrared 
radiation, visible light, or another form of electromagnetic radiation is 
emitted or absorbed. An obvious example is the warming of the Earth 
by the Sun. A less obvious example is thermal radiation from the 
human body. 


Convection 
around windows 
and doors 
(cold air) 


Convection (hot air) 


Conduction 


In a fireplace, heat transfer 
occurs by all three methods: 
conduction, convection, and 

radiation. Radiation is 
responsible for most of the heat 
transferred into the room. Heat 
transfer also occurs through 
conduction into the room, but at 
a much slower rate. Heat 
transfer by convection also 
occurs through cold air entering 
the room around windows and 
hot air leaving the room by 
rising up the chimney. 


We examine these methods in some detail in the three following modules. 
Each method has unique and interesting characteristics, but all three do 
have one thing in common: they transfer heat solely because of a 
temperature difference [link]. 

Exercise: 

Check Your Understanding 


Problem: 


Name an example from daily life (different from the text) for each 
mechanism of heat transfer. 


Solution: 


Conduction: Heat transfers into your hands as you hold a hot cup of 
coffee. 


Convection: Heat transfers as the barista “steams” cold milk to make 
hot cocoa. 


Radiation: Reheating a cold cup of coffee in a microwave oven. 


Summary 


e Heat is transferred by three different methods: conduction, convection, 
and radiation. 


Conceptual Questions 


Exercise: 


Problem: 


What are the main methods of heat transfer from the hot core of Earth 
to its surface? From Earth’s surface to outer space? 


When our bodies get too warm, they respond by sweating and increasing 
blood circulation to the surface to transfer thermal energy away from the 
core. What effect will this have on a person in a 40.0°C hot tub? 


[link] shows a cut-away drawing of a thermos bottle (also known as a 
Dewar flask), which is a device designed specifically to slow down all 
forms of heat transfer. Explain the functions of the various parts, such as the 


vacuum, the silvering of the walls, the thin-walled long glass neck, the 
rubber support, the air layer, and the stopper. 


Glass walls 
with silvered 
surfaces 


Spring 
centering 


Hot or cold 
liquid 


Rubber support 


The construction 
of a thermos 
bottle is designed 
to inhibit all 
methods of heat 
transfer. 


Glossary 


conduction 
heat transfer through stationary matter by physical contact 


convection 
heat transfer by the macroscopic movement of fluid 


radiation 
heat transfer which occurs when microwaves, infrared radiation, 
visible light, or other electromagnetic radiation is emitted or absorbed 


Introduction to Oscillatory Motion and Waves 
class="introduction" 


There 
are at 
least 
four 
types 
of 
waves 
in this 
picture 
—only 
the 
water 
waves 
are 
evident 
. There 
are also 
sound 
waves, 
light 
waves, 
and 
waves 
on the 
guitar 
strings. 
(credit: 
John 
Norton 


) 


What do an ocean buoy, a child in a swing, the cone inside a speaker, a 
guitar, atoms in a crystal, the motion of chest cavities, and the beating of 
hearts all have in common? They all oscillate—-that is, they move back and 
forth between two points. Many systems oscillate, and they have certain 
characteristics in common. All oscillations involve force and energy. You 
push a child in a swing to get the motion started. The energy of atoms 
vibrating in a crystal can be increased with heat. You put energy into a 
guitar string when you pluck it. 


Some oscillations create waves. A guitar creates sound waves. You can 
make water waves in a swimming pool by slapping the water with your 
hand. You can no doubt think of other types of waves. Some, such as water 
waves, are visible. Some, such as sound waves, are not. But every wave is a 
disturbance that moves from its source and carries energy. Other examples 
of waves include earthquakes and visible light. Even subatomic particles, 
such as electrons, can behave like waves. 


By studying oscillatory motion and waves, we shall find that a small 
number of underlying principles describe all of them and that wave 
phenomena are more common than you have ever imagined. We begin by 
studying the type of force that underlies the simplest oscillations and waves. 
We will then expand our exploration of oscillatory motion and waves to 


include concepts such as simple harmonic motion, uniform circular motion, 
and damped harmonic motion. Finally, we will explore what happens when 
two or more waves share the same space, in the phenomena known as 
superposition and interference. 


Glossary 


oscillate 
moving back and forth regularly between two points 


wave 
a disturbance that moves from its source and carries energy 


Hooke’s Law: Stress and Strain Revisited 


e Explain Newton’s third law of motion with respect to stress and 
deformation. 

e Describe the restoration of force and displacement. 

¢ Calculate the energy in Hooke’s Law of deformation, and the stored 
energy in a spring. 


Equilibrium position 


When displaced 
from its vertical 
equilibrium 
position, this 
plastic ruler 
oscillates back and 
forth because of the 
restoring force 
opposing 
displacement. 
When the ruler is 
on the left, there is 
a force to the right, 
and vice versa. 


Newton’s first law implies that an object oscillating back and forth is 
experiencing forces. Without force, the object would move in a straight line 


at a constant speed rather than oscillate. Consider, for example, plucking a 
plastic ruler to the left as shown in [link]. The deformation of the ruler 
creates a force in the opposite direction, known as a restoring force. Once 
released, the restoring force causes the ruler to move back toward its stable 
equilibrium position, where the net force on it is zero. However, by the time 
the ruler gets there, it gains momentum and continues to move to the right, 
producing the opposite deformation. It is then forced to the left, back 
through equilibrium, and the process is repeated until dissipative forces 
dampen the motion. These forces remove mechanical energy from the 
system, gradually reducing the motion until the ruler comes to rest. 


The simplest oscillations occur when the restoring force is directly 
proportional to displacement. When stress and strain were covered in 
Newton’s Third Law of Motion, the name was given to this relationship 
between force and displacement was Hooke’s law: 

Equation: 


F = —kx. 


Here, F is the restoring force, x is the displacement from equilibrium or 
deformation, and k is a constant related to the difficulty in deforming the 
system. The minus sign indicates the restoring force is in the direction 
opposite to the displacement. 


Equilibrium position 


—_ 


, 
F : v F | ro ee 
(a) 


(b) (c) (d) (e) 


(a) The plastic ruler has been released, and the 
restoring force is returning the ruler to its equilibrium 
position. (b) The net force is zero at the equilibrium 
position, but the ruler has momentum and continues 
to move to the right. (c) The restoring force is in the 
opposite direction. It stops the ruler and moves it back 
toward equilibrium again. (d) Now the ruler has 
momentum to the left. (e) In the absence of damping 


(caused by frictional forces), the ruler reaches its 
original position. From there, the motion will repeat 
itself. 


The force constant k is related to the rigidity (or stiffness) of a system—the 
larger the force constant, the greater the restoring force, and the stiffer the 
system. The units of & are newtons per meter (N/m). For example, k is 
directly related to Young’s modulus when we stretch a string. [link] shows a 
graph of the absolute value of the restoring force versus the displacement 
for a system that can be described by Hooke’s law—a simple spring in this 
case. The slope of the graph equals the force constant k in newtons per 
meter. A common physics laboratory exercise is to measure restoring forces 
created by springs, determine if they follow Hooke’s law, and calculate their 
force constants if they do. 


F(N) 


0 
0 0.050 0.100 x (m) 
Displacement= x (m) 


(a) A graph of absolute value of the 
restoring force versus displacement is 


displayed. The fact that the graph is a 
straight line means that the system obeys 
Hooke’s law. The slope of the graph is 
the force constant k. (b) The data in the 
graph were generated by measuring the 
displacement of a spring from 
equilibrium while supporting various 
weights. The restoring force equals the 
weight supported, if the mass is 
stationary. 


Example: 
How Stiff Are Car Springs? 


The mass of a 


car increases 
due to the 
introduction of a 
passenger. This 
affects the 
displacement of 


the car on its 
suspension 
system. (credit: 
exfordy on 
Flickr) 


What is the force constant for the suspension system of a car that settles 
1.20 cm when an 80.0-kg person gets in? 

Strategy 

Consider the car to be in its equilibrium position x = 0 before the person 
gets in. The car then settles down 1.20 cm, which means it is displaced to a 
position « = —1.20 x 10°? m. At that point, the springs supply a 
restoring force F’ equal to the person’s weight 


w = mg = (80.0 kg) (9.80 m/s”) = 784 N. We take this force to be F 
in Hooke’s law. Knowing F' and x, we can then solve the force constant k. 
Solution 


1. Solve Hooke’s law, F' = —kx, for k: 
Equation: 


F 
k= ——. 
Lo 


Substitute known values and solve k: 


Equation: 
Pepe 784 N 
k =1- 20% 106" an 
= 6.53 x 104 N/m. 
Discussion 


Note that F’ and x have opposite signs because they are in opposite 
directions—the restoring force is up, and the displacement is down. Also, 
note that the car would oscillate up and down when the person got in if it 


were not for damping (due to frictional forces) provided by shock 
absorbers. Bouncing cars are a sure sign of bad shock absorbers. 


Energy in Hooke’s Law of Deformation 


In order to produce a deformation, work must be done. That is, a force must 
be exerted through a distance, whether you pluck a guitar string or 
compress a car spring. If the only result is deformation, and no work goes 
into thermal, sound, or kinetic energy, then all the work is initially stored in 
the deformed object as some form of potential energy. The potential energy 
stored in a spring is PE.) = Skx’, Here, we generalize the idea to elastic 
potential energy for a deformation of any system that can be described by 
Hooke’s law. Hence, 

Equation: 


1 
PE. = ak’, 


where PE, is the elastic potential energy stored in any deformed system 
that obeys Hooke’s law and has a displacement x from equilibrium and a 
force constant k. 


It is possible to find the work done in deforming a system in order to find 
the energy stored. This work is performed by an applied force Papp. The 
applied force is exactly opposite to the restoring force (action-reaction), and 
SO Papp = kx. [link] shows a graph of the applied force versus deformation 
x for a system that can be described by Hooke’s law. Work done on the 
system is force multiplied by distance, which equals the area under the 
curve or (1/2)kx?(Method A in the figure). Another way to determine the 
work is to note that the force increases linearly from 0 to kx, so that the 
average force is (1/2) kx, the distance moved is x, and thus 

W = Fappd = [(1/2)kx](x) = (1/2)kx” (Method B in the figure). 


Method A 


& 1 1 
3 W =— bh = — kxx 
Ww 2 2 
TT — 
8 i hae kx 
& 
no} 
co) 
S Method B 
<x 


Displacement = x 


A graph of applied force versus distance for the 
deformation of a system that can be described by 
Hooke’s law is displayed. The work done on the 
system equals the area under the graph or the area of 
the triangle, which is half its base multiplied by its 
height, or W = (1/2)kx’. 


Example: 

Calculating Stored Energy: A Tranquilizer Gun Spring 

We can use a toy gun’s spring mechanism to ask and answer two simple 
questions: (a) How much energy is stored in the spring of a tranquilizer 
gun that has a force constant of 50.0 N/m and is compressed 0.150 m? (b) 
If you neglect friction and the mass of the spring, at what speed will a 
2.00-g projectile be ejected from the gun? 


a) 


eed 


b) Work is done 
to compress 
spring 
aca 
PE, 
Cc) 


KE 
j : 


(a) In this image of the 
gun, the spring is 
uncompressed before 
being cocked. (b) The 
spring has been 
compressed a distance z, 
and the projectile is in 
place. (c) When released, 
the spring converts elastic 
potential energy PE 
into kinetic energy. 


Strategy fora 

(a): The energy stored in the spring can be found directly from elastic 
potential energy equation, because k and ~ are given. 

Solution for a 

Entering the given values for & and x yields 

Equation: 


PE. = +kx” = +(50.0 N/m)(0.150 m)* = 0.563 N-m 
0.563 J 


Strategy for b 


Because there is no friction, the potential energy is converted entirely into 
kinetic energy. The expression for kinetic energy can be solved for the 
projectile’s speed. 

Solution for b 


1. Identify known quantities: 
Equation: 


KE; = PEq or 1/2mv? = (1/2)kx? = PEa = 0.563 J 


2. Solve for v: 
Equation: 


= | We 
v= | —— 
m 


ES J) 


1/2 
— 23.7(J/ke)1/? 
0.002 kg | (J/kg) 


3. Convert units: 23.7 m/s 


Discussion 

(a) and (b): This projectile speed is impressive for a tranquilizer gun (more 
than 80 km/h). The numbers in this problem seem reasonable. The force 
needed to compress the spring is small enough for an adult to manage, and 
the energy imparted to the dart is small enough to limit the damage it might 
do. Yet, the speed of the dart is great enough for it to travel an acceptable 
distance. 


Exercise: 
Check your Understanding 


Problem: 


Envision holding the end of a ruler with one hand and deforming it 
with the other. When you let go, you can see the oscillations of the 
ruler. In what way could you modify this simple experiment to 
increase the rigidity of the system? 


Solution: 
Answer 


You could hold the ruler at its midpoint so that the part of the ruler that 
oscillates is half as long as in the original experiment. 


Exercise: 
Check your Understanding 


Problem: 


If you apply a deforming force on an object and let it come to 
equilibrium, what happened to the work you did on the system? 


Solution: 
Answer 


It was stored in the object as potential energy. 


Section Summary 


An oscillation is a back and forth motion of an object between two 
points of deformation. 

An oscillation may create a wave, which is a disturbance that 
propagates from where it was created. 

The simplest type of oscillations and waves are related to systems that 
can be described by Hooke’s law: 

Equation: 


F = —kx, 


where Fis the restoring force, x is the displacement from equilibrium 
or deformation, and k is the force constant of the system. 

Elastic potential energy PE,; stored in the deformation of a system 
that can be described by Hooke’s law is given by 

Equation: 


PE. = (1/2)kx?. 


Conceptual Questions 


Exercise: 


Problem: 


Describe a system in which elastic potential energy is stored. 


Problems & Exercises 


Exercise: 


Problem: 


Fish are hung on a spring scale to determine their mass (most 
fishermen feel no obligation to truthfully report the mass). 


(a) What is the force constant of the spring in such a scale if it the 
spring stretches 8.00 cm for a 10.0 kg load? 


(b) What is the mass of a fish that stretches the spring 5.50 cm? 
(c) How far apart are the half-kilogram marks on the scale? 
Solution: 

(a) 1.23 x 10°? N/m 

(b) 6.88 kg 


(c) 4.00 mm 


Exercise: 


Problem: 


It is weigh-in time for the local under-85-kg rugby team. The bathroom 
scale used to assess eligibility can be described by Hooke’s law and is 
depressed 0.75 cm by its maximum load of 120 kg. (a) What is the 
spring’s effective spring constant? (b) A player stands on the scales 
and depresses it by 0.48 cm. Is he eligible to play on this under-85 kg 
team? 


Exercise: 
Problem: 
One type of BB gun uses a spring-driven plunger to blow the BB from 
its barrel. (a) Calculate the force constant of its plunger’s spring if you 


must compress it 0.150 m to drive the 0.0500-kg plunger to a top speed 
of 20.0 m/s. (b) What force must be exerted to compress the spring? 


Solution: 
(a) 889 N/m 


(b) 133 N 
Exercise: 
Problem: 
(a) The springs of a pickup truck act like a single spring with a force 


constant of 1.30 x 10° N/m. By how much will the truck be 
depressed by its maximum load of 1000 kg? 


(b) If the pickup truck has four identical springs, what is the force 
constant of each? 

Exercise: 
Problem: 


When an 80.0-kg man stands on a pogo stick, the spring is compressed 
0.120 m. 


(a) What is the force constant of the spring? (b) Will the spring be 
compressed more when he hops down the road? 


Solution: 
(a) 6.53 x 10? N/m 


(b) Yes 
Exercise: 


Problem: 


A spring has a length of 0.200 m when a 0.300-kg mass hangs from it, 
and a length of 0.750 m when a 1.95-kg mass hangs from it. (a) What 

is the force constant of the spring? (b) What is the unloaded length of 

the spring? 


Glossary 


deformation 
displacement from equilibrium 


elastic potential energy 
potential energy stored as a result of deformation of an elastic object, 
such as the stretching of a spring 


force constant 
a constant related to the rigidity of a system: the larger the force 
constant, the more rigid the system; the force constant is represented 
by k 


restoring force 
force acting in opposition to the force caused by a deformation 


Period and Frequency in Oscillations 


¢ Observe the vibrations of a guitar string. 
e Determine the frequency of oscillations. 


The strings on this 
guitar vibrate at 
regular time intervals. 
(credit: JAR) 


When you pluck a guitar string, the resulting sound has a steady tone and 
lasts a long time. Each successive vibration of the string takes the same 
time as the previous one. We define periodic motion to be a motion that 
repeats itself at regular time intervals, such as exhibited by the guitar string 
or by an object on a spring moving up and down. The time to complete one 
oscillation remains constant and is called the period 7’. Its units are usually 
seconds, but may be any convenient unit of time. The word period refers to 
the time for some event whether repetitive or not; but we shall be primarily 
interested in periodic motion, which is by definition repetitive. A concept 
closely related to period is the frequency of an event. For example, if you 
get a paycheck twice a month, the frequency of payment is two per month 
and the period between checks is half a month. Frequency f is defined to be 
the number of events per unit time. For periodic motion, frequency is the 
number of oscillations per unit time. The relationship between frequency 
and period is 

Equation: 


1 
ai 


The SI unit for frequency is the cycle per second, which is defined to be a 
hertz (Hz): 
Equation: 


| 1 
_ or 1 Hz = — 
Sec S 


1Hz=1 


A cycle is one complete oscillation. Note that a vibration can be a single or 
multiple event, whereas oscillations are usually repetitive for a significant 
number of cycles. 


Example: 

Determine the Frequency of Two Oscillations: Medical Ultrasound 
and the Period of Middle C 

We can use the formulas presented in this module to determine both the 
frequency based on known oscillations and the oscillation based on a 
known frequency. Let’s try one example of each. (a) A medical imaging 
device produces ultrasound by oscillating with a period of 0.400 ps. What 
is the frequency of this oscillation? (b) The frequency of middle C ona 
typical musical instrument is 264 Hz. What is the time for one complete 
oscillation? 

Strategy 

Both questions (a) and (b) can be answered using the relationship between 
period and frequency. In question (a), the period 7’ is given and we are 
asked to find frequency f. In question (b), the frequency f is given and we 
are asked to find the period 7’. 

Solution a 


1. Substitute 0.400 us for T in f = 7: 
Equation: 


a im 1 
T 0.400 x 10s 


Solve to find 
Equation: 


PSO A ake 


Discussion a 

The frequency of sound found in (a) is much higher than the highest 
frequency that humans can hear and, therefore, is called ultrasound. 
Appropriate oscillations at this frequency generate ultrasound used for 
noninvasive medical diagnoses, such as observations of a fetus in the 
womb. 

Solution b 


AG 


Identify the known values: 


The time for one complete oscillation is the period T: 
Equation: 


1 
i 
. solve for 7’: 
Equation: 
if 
i 
u 
. Substitute the given value for the frequency into the resulting 
expression: 
Equation: 
il 1 1 
T=— ——___ = 3.79 x 10° s = 3.79 ms. 


f  264Hz 264 cycles/s 


Discussion 
The period found in (b) is the time per cycle, but this value is often quoted 
as simply the time in convenient units (ms or milliseconds in this case). 


Exercise: 
Check your Understanding 


Problem: 


Identify an event in your life (such as receiving a paycheck) that 
occurs regularly. Identify both the period and frequency of this event. 


Solution: 


I visit my parents for dinner every other Sunday. The frequency of my 
visits is 26 per calendar year. The period is two weeks. 


Section Summary 


e Periodic motion is a repetitious oscillation. 
e The time for one oscillation is the period 7’. 
e The number of oscillations per unit time is the frequency f. 
e These quantities are related by 
Equation: 


os, 
| 
| Foe 


Problems & Exercises 


Exercise: 


Problem: What is the period of 60.0 Hz electrical power? 


Solution: 


16.7 ms 
Exercise: 


Problem: 


If your heart rate is 150 beats per minute during strenuous exercise, 
what is the time per beat in units of seconds? 


Solution: 


0.400 s/beats 
Exercise: 


Problem: 


Find the frequency of a tuning fork that takes 2.50 x 107° s to 
complete one oscillation. 


Solution: 


400 Hz 
Exercise: 


Problem: 


A stroboscope is set to flash every 8.00 x 10~° s. What is the 
frequency of the flashes? 


Solution: 


12,500 Hz 


Exercise: 


Problem: 


A tire has a tread pattern with a crevice every 2.00 cm. Each crevice 
makes a single vibration as the tire moves. What is the frequency of 
these vibrations if the car moves at 30.0 m/s? 


Solution: 
1.50 kHz 
Exercise: 
Problem: Engineering Application 


Each piston of an engine makes a sharp sound every other revolution 
of the engine. (a) How fast is a race car going if its eight-cylinder 
engine emits a sound of frequency 750 Hz, given that the engine 
makes 2000 revolutions per kilometer? (b) At how many revolutions 
per minute is the engine rotating? 


Solution: 
(a) 93.8 m/s 


(b) 11.3 x 10° rev/min 


Glossary 


period 
time it takes to complete one oscillation 


periodic motion 
motion that repeats itself at regular time intervals 


frequency 
number of events per unit of time 


Simple Harmonic Motion: A Special Periodic Motion 


e Describe a simple harmonic oscillator. 
e Explain the link between simple harmonic motion and waves. 


The oscillations of a system in which the net force can be described by 
Hooke’s law are of special importance, because they are very common. 
They are also the simplest oscillatory systems. Simple Harmonic Motion 
(SHM) is the name given to oscillatory motion for a system where the net 
force can be described by Hooke’s law, and such a system is called a simple 
harmonic oscillator. If the net force can be described by Hooke’s law and 
there is no damping (by friction or other non-conservative forces), then a 
simple harmonic oscillator will oscillate with equal displacement on either 
side of the equilibrium position, as shown for an object on a spring in [link]. 
The maximum displacement from equilibrium is called the amplitude X. 
The units for amplitude and displacement are the same, but depend on the 
type of oscillation. For the object on the spring, the units of amplitude and 
displacement are meters; whereas for sound oscillations, they have units of 
pressure (and other types of oscillations have yet other units). Because 
amplitude is the maximum displacement, it is related to the energy in the 
oscillation. 


Note: 

Take-Home Experiment: SHM and the Marble 

Find a bowl or basin that is shaped like a hemisphere on the inside. Place a 
marble inside the bowl and tilt the bowl periodically so the marble rolls 
from the bottom of the bowl to equally high points on the sides of the 
bowl. Get a feel for the force required to maintain this periodic motion. 
What is the restoring force and what role does the force you apply play in 
the simple harmonic motion (SHM) of the marble? 
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An object attached to a spring sliding on a frictionless 
surface is an uncomplicated simple harmonic oscillator. 
When displaced from equilibrium, the object performs 
simple harmonic motion that has an amplitude X and a 
period T’. The object’s maximum speed occurs as it 
passes through equilibrium. The stiffer the spring is, the 
smaller the period 7’. The greater the mass of the object 
is, the greater the period T. 


What is so significant about simple harmonic motion? One special thing is 
that the period T' and frequency f of a simple harmonic oscillator are 
independent of amplitude. The string of a guitar, for example, will oscillate 
with the same frequency whether plucked gently or hard. Because the 
period is constant, a simple harmonic oscillator can be used as a clock. 


Two important factors do affect the period of a simple harmonic oscillator. 
The period is related to how stiff the system is. A very stiff object has a 
large force constant k, which causes the system to have a smaller period. 
For example, you can adjust a diving board’s stiffness—the stiffer it is, the 


faster it vibrates, and the shorter its period. Period also depends on the mass 
of the oscillating system. The more massive the system is, the longer the 
period. For example, a heavy person on a diving board bounces up and 
down more slowly than a light one. 


In fact, the mass m and the force constant k are the only factors that affect 
the period and frequency of simple harmonic motion. 


Note: 
Period of Simple Harmonic Oscillator 
The period of a simple harmonic oscillator is given by 


Equation: 
LT = 2n4/ see 
k 


and, because f = 1/T, the frequency of a simple harmonic oscillator is 


Equation: 
1 k 
P= on V on 


Note that neither 7’ nor f has any dependence on amplitude. 


Note: 

Take-Home Experiment: Mass and Ruler Oscillations 

Find two identical wooden or plastic rulers. Tape one end of each ruler 
firmly to the edge of a table so that the length of each ruler that protrudes 
from the table is the same. On the free end of one ruler tape a heavy object 
such as a few large coins. Pluck the ends of the rulers at the same time and 
observe which one undergoes more cycles in a time period, and measure 
the period of oscillation of each of the rulers. 


Example: 

Calculate the Frequency and Period of Oscillations: Bad Shock 
Absorbers in a Car 

If the shock absorbers in a car go bad, then the car will oscillate at the least 
provocation, such as when going over bumps in the road and after stopping 
(See [link]). Calculate the frequency and period of these oscillations for 
such a car if the car’s mass (including its load) is 900 kg and the force 
constant (k) of the suspension system is 6.53 x 104 N /m. 

Strategy 

The frequency of the car’s oscillations will be that of a simple harmonic 


oscillator as given in the equation f = = 4/ x The mass and the force 


constant are both given. 
Solution 


1. Enter the known values of k and m: 
Equation: 


f- a 2 = 1 /6.53 x 104N/m 
On Vm Qn 900 kg , 


2. Calculate the frequency: 
Equation: 


il Z 2 
5, V 72.6/s? = 1.3656/s ' = 1.36/s ' = 1.36 Hz. 
Tl 
3. You could use T’ = 2n,/ 7% to calculate the period, but it is simpler to 
use the relationship T = 1/f and substitute the value just found for f: 
Equation: 


il 


T=— = 
f 1.356 Hz 


— 0) (35.3: 


Discussion 


The values of J’ and f both seem about right for a bouncing car. You can 
observe these oscillations if you push down hard on the end of a car and let 


go. 


The Link between Simple Harmonic Motion and Waves 


If a time-exposure photograph of the bouncing car were taken as it drove 
by, the headlight would make a wavelike streak, as shown in [link]. 
Similarly, [link] shows an object bouncing on a spring as it leaves a 
wavelike "trace of its position on a moving strip of paper. Both waves are 
sine functions. All simple harmonic motion is intimately related to sine and 
cosine waves. 


The bouncing car makes a 
wavelike motion. If the 
restoring force in the suspension 
system can be described only by 
Hooke’s law, then the wave is a 
sine function. (The wave is the 
trace produced by the headlight 
as the car moves to the right.) 


The vertical 
position of an 
object bouncing 
on a spring is 
recorded on a 
strip of moving 
paper, leaving a 
sine wave. 


The displacement as a function of time ¢ in any simple harmonic motion— 
that is, one in which the net restoring force can be described by Hooke’s 
law, is given by 

Equation: 


where X is amplitude. At ¢ = 0, the initial position is x9 = X, and the 
displacement oscillates back and forth with a period 7’. (When t = T’, we 
get x = X again because cos 2m = 1.). Furthermore, from this expression 
for x, the velocity v as a function of time is given by: 


Equation: 
; 2nt 
u(t) = —Umax sin ae 


where Umax = 20nX/T = X s k,/m. The object has zero velocity at 
maximum displacement—for example, v = 0 when t = 0, and at that time 
x = X. The minus sign in the first equation for v(t) gives the correct 
direction for the velocity. Just after the start of the motion, for instance, the 
velocity is negative because the system is moving back toward the 
equilibrium point. Finally, we can get an expression for acceleration using 
Newton’s second law. [Then we have x(t), u(t), t, and a(t), the quantities 
needed for kinematics and a description of simple harmonic motion. | 
According to Newton’s second law, the acceleration is a = F'/m = kx/m. 
So, a(t) is also a cosine function: 

Equation: 


Hence, a(t) is directly proportional to and in the opposite direction to x(t). 


[link] shows the simple harmonic motion of an object on a spring and 
presents graphs of x(t),v(t ), and a(t) versus time. 


Onin 
Orlin 
@ioniiines 
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Graphs of x(t), v(t), and a(t) 
versus t for the motion of an object 
on a spring. The net force on the 
object can be described by Hooke’s 
law, and so the object undergoes 
simple harmonic motion. Note that 
the initial position has the vertical 
displacement at its maximum value 
X; vis initially zero and then 
negative as the object moves 
down; and the initial acceleration 


is negative, back toward the 
equilibrium position and becomes 
zero at that point. 


The most important point here is that these equations are mathematically 
straightforward and are valid for all simple harmonic motion. They are very 
useful in visualizing waves associated with simple harmonic motion, 
including visualizing how waves add with one another. 

Exercise: 

Check Your Understanding 


Problem: 
Suppose you pluck a banjo string. You hear a single note that starts out 
loud and slowly quiets over time. Describe what happens to the sound 


waves in terms of period, frequency and amplitude as the sound 
decreases in volume. 


Solution: 
Frequency and period remain essentially unchanged. Only amplitude 


decreases as volume decreases. 


Exercise: 
Check Your Understanding 


Problem: 
A babysitter is pushing a child on a swing. At the point where the 


swing reaches x, where would the corresponding point on a wave of 
this motion be located? 


Solution: 


x is the maximum deformation, which corresponds to the amplitude of 
the wave. The point on the wave would either be at the very top or the 
very bottom of the curve. 


Note: 

PhET Explorations: Masses and Springs 

A realistic mass and spring laboratory. Hang masses from springs and 
adjust the spring stiffness and damping. You can even slow time. Transport 
the lab to different planets. A chart shows the kinetic, potential, and 
thermal energy for each spring. 
https://phet.colorado.edu/sims/mass-spring-lab/mass-spring-lab_en.html 


Section Summary 


Simple harmonic motion is oscillatory motion for a system that can be 
described only by Hooke’s law. Such a system is also called a simple 
harmonic oscillator. 

Maximum displacement is the amplitude X. The period T’ and 
frequency f of a simple harmonic oscillator are given by 


=> 2n4/ a and f = — / a where ™ is the mass of the system. 
Displacement in simple harmonic motion as a function of time is given 
by z(t) = X cos 

y Tr: 


The velocity is given by u(t) = ~Umaxsin 7 , where 
Ven =al kim. 
The acceleration is found to be a(t) = —£* cos a 


Conceptual Questions 


Exercise: 


Problem: 


What conditions must be met to produce simple harmonic motion? 


Exercise: 


Problem: 


(a) If frequency is not constant for some oscillation, can the oscillation 
be simple harmonic motion? 


(b) Can you think of any examples of harmonic motion where the 
frequency may depend on the amplitude? 
Exercise: 
Problem: 
Give an example of a simple harmonic oscillator, specifically noting 
how its frequency is independent of amplitude. 
Exercise: 
Problem: 
Explain why you expect an object made of a stiff material to vibrate at 
a higher frequency than a similar object made of a spongy material. 
Exercise: 
Problem: 
As you pass a freight truck with a trailer on a highway, you notice that 


its trailer is bouncing up and down slowly. Is it more likely that the 
trailer is heavily loaded or nearly empty? Explain your answer. 


Exercise: 


Problem: 

Some people modify cars to be much closer to the ground than when 

manufactured. Should they install stiffer springs? Explain your answer. 
Problems & Exercises 


Exercise: 


Problem: 


A type of cuckoo clock keeps time by having a mass bouncing on a 
spring, usually something cute like a cherub in a chair. What force 
constant is needed to produce a period of 0.500 s for a 0.0150-kg 
mass? 


Solution: 


2.37 N/m 
Exercise: 
Problem: 
If the spring constant of a simple harmonic oscillator is doubled, by 


what factor will the mass of the system need to change in order for the 
frequency of the motion to remain the same? 


Exercise: 
Problem: 
A 0.500-kg mass suspended from a spring oscillates with a period of 


1.50 s. How much mass must be added to the object to change the 
period to 2.00 s? 


Solution: 


0.389 kg 
Exercise: 
Problem: 
By how much leeway (both percentage and mass) would you have in 


the selection of the mass of the object in the previous problem if you 


did not wish the new period to be greater than 2.01 s or less than 1.99 
S? 


Exercise: 


Problem: 


Suppose you attach the object with mass m to a vertical spring 
originally at rest, and let it bounce up and down. You release the object 
from rest at the spring’s original rest length. (a) Show that the spring 
exerts an upward force of 2.00 mg on the object at its lowest point. 
(b) If the spring has a force constant of 10.0 N/m and a 0.25-kg-mass 
object is set in motion as described, find the amplitude of the 
oscillations. (c) Find the maximum velocity. 


Exercise: 
Problem: 
A diver on a diving board is undergoing simple harmonic motion. Her 
mass is 55.0 kg and the period of her motion is 0.800 s. The next diver 


is amale whose period of simple harmonic oscillation is 1.05 s. What 
is his mass if the mass of the board is negligible? 


Solution: 


94.7 kg 
Exercise: 
Problem: 
Suppose a diving board with no one on it bounces up and down in a 
simple harmonic motion with a frequency of 4.00 Hz. The board has 


an effective mass of 10.0 kg. What is the frequency of the simple 
harmonic motion of a 75.0-kg diver on the board? 


Exercise: 


Problem: 


This child’s toy 
relies on springs to 
keep infants 
entertained. (credit: 
By Humboldthead, 
Flickr) 


The device pictured in [link] entertains infants while keeping them 
from wandering. The child bounces in a harness suspended from a 
door frame by a spring constant. 


(a) If the spring stretches 0.250 m while supporting an 8.0-kg child, 
what is its spring constant? 


(b) What is the time for one complete bounce of this child? (c) What is 
the child’s maximum velocity if the amplitude of her bounce is 0.200 
m? 


Exercise: 


Problem: 


A 90.0-kg skydiver hanging from a parachute bounces up and down 
with a period of 1.50 s. What is the new period of oscillation when a 
second skydiver, whose mass is 60.0 kg, hangs from the legs of the 

first, as seen in [link]. 


The oscillations of 


one skydiver are 
about to be affected 
by a second 
skydiver. (credit: 
U.S. Army, 
www.army.mil) 


Solution: 


1.945 


Glossary 


amplitude 
the maximum displacement from the equilibrium position of an object 
oscillating around the equilibrium position 


simple harmonic motion 
the oscillatory motion in a system where the net force can be described 
by Hooke’s law 


simple harmonic oscillator 
a device that implements Hooke’s law, such as a mass that is attached 
to a spring, with the other end of the spring being connected to a rigid 
support such as a wall 


The Simple Pendulum 


e Measure acceleration due to gravity. 


cn) 


sq 


A simple pendulum 
has a small-diameter 
bob and a string that 
has a very small mass 
but is strong enough 
not to stretch 
appreciably. The linear 
displacement from 
equilibrium is s, the 
length of the arc. Also 
shown are the forces 
on the bob, which 
result in a net force of 
—m_g sin toward the 
equilibrium position— 
that is, a restoring 
force. 


Pendulums are in common usage. Some have crucial uses, such as in 
clocks; some are for fun, such as a child’s swing; and some are just there, 
such as the sinker on a fishing line. For small displacements, a pendulum is 
a simple harmonic oscillator. A simple pendulum is defined to have an 


object that has a small mass, also known as the pendulum bob, which is 
suspended from a light wire or string, such as shown in [link]. Exploring the 
simple pendulum a bit further, we can discover the conditions under which 
it performs simple harmonic motion, and we can derive an interesting 
expression for its period. 


We begin by defining the displacement to be the arc length s. We see from 
[link] that the net force on the bob is tangent to the arc and equals 

—mg sin 0. (The weight mg has components mg cos @ along the string and 
mg sin 9 tangent to the arc.) Tension in the string exactly cancels the 
component mg cos @ parallel to the string. This leaves a net restoring force 
back toward the equilibrium position at 6 = 0. 


Now, if we can show that the restoring force is directly proportional to the 
displacement, then we have a simple harmonic oscillator. In trying to 
determine if we have a simple harmonic oscillator, we should note that for 
small angles (less than about 15°), sin 8 9 (sin @ and 6 differ by about 
1% or less at smaller angles). Thus, for angles less than about 15°, the 
restoring force F' is 

Equation: 


F = —mg6. 


The displacement s is directly proportional to 8. When @ is expressed in 
radians, the arc length in a circle is related to its radius (L in this instance) 


by: 
Equation: 
s= L6, 
so that 
Equation: 
oem 


For small angles, then, the expression for the restoring force is: 
Equation: 
mg 


Fr ——s 
L 


This expression is of the form: 
Equation: 


F = —kx, 


where the force constant is given by k = mg/L and the displacement is 
given by x = s. For angles less than about 15°, the restoring force is 
directly proportional to the displacement, and the simple pendulum is a 
simple harmonic oscillator. 


Using this equation, we can find the period of a pendulum for amplitudes 
less than about 15°. For the simple pendulum: 


Equation: 
mM m 
T = 2n,/ — = 2n ; 
/ k Vaal L 
Thus, 
Equation: 


cn 
P=21 = 
g 


for the period of a simple pendulum. This result is interesting because of its 
simplicity. The only things that affect the period of a simple pendulum are 
its length and the acceleration due to gravity. The period is completely 
independent of other factors, such as mass. As with simple harmonic 
oscillators, the period 7’ for a pendulum is nearly independent of amplitude, 


especially if @ is less than about 15°. Even simple pendulum clocks can be 
finely adjusted and accurate. 


Note the dependence of 7’ on g. If the length of a pendulum is precisely 
known, it can actually be used to measure the acceleration due to gravity. 
Consider the following example. 


Example: 

Measuring Acceleration due to Gravity: The Period of a Pendulum 
What is the acceleration due to gravity in a region where a simple 
pendulum having a length 75.000 cm has a period of 1.7357 s? 

Strategy 

We are asked to find g given the period T and the length L of a pendulum. 
We can solve T = 2ny/ S for g, assuming only that the angle of deflection 


is less than 15°. 
Solution 


i Square. 2 — 21047 and solve for g: 
Equation: 
L 
2 
g= An T° 


2. Substitute known values into the new equation: 
Equation: 


p22: 75000 m 
(1.7357 s)? 


3. Calculate to find g: 
Equation: 


g = 9.8281 m/s?. 


Discussion 

This method for determining g can be very accurate. This is why length 
and period are given to five digits in this example. For the precision of the 
approximation sin 8 ~ @ to be better than the precision of the pendulum 
length and period, the maximum displacement angle should be kept below 
about 0.5°. 


Note: 

Making Career Connections 

Knowing g can be important in geological exploration; for example, a map 
of g over large geographical regions aids the study of plate tectonics and 
helps in the search for oil fields and large mineral deposits. 


Note: 

Take Home Experiment: Determining g 

Use a simple pendulum to determine the acceleration due to gravity g in 
your own locale. Cut a piece of a string or dental floss so that it is about 1 
m long. Attach a small object of high density to the end of the string (for 
example, a metal nut or a car key). Starting at an angle of less than 10°, 
allow the pendulum to swing and measure the pendulum’s period for 10 
oscillations using a stopwatch. Calculate g. How accurate is this 
measurement? How might it be improved? 


Exercise: 
Check Your Understanding 


Problem: 


An engineer builds two simple pendula. Both are suspended from 
small wires secured to the ceiling of a room. Each pendulum hovers 2 
cm above the floor. Pendulum 1 has a bob with a mass of 10 kg. 
Pendulum 2 has a bob with a mass of 100 kg. Describe how the 
motion of the pendula will differ if the bobs are both displaced by 12°. 


Solution: 


The movement of the pendula will not differ at all because the mass of 
the bob has no effect on the motion of a simple pendulum. The pendula 
are only affected by the period (which is related to the pendulum’s 
length) and by the acceleration due to gravity. 


Note: 

PhET Explorations: Pendulum Lab 

Play with one or two pendulums and discover how the period of a simple 
pendulum depends on the length of the string, the mass of the pendulum 
bob, and the amplitude of the swing. It’s easy to measure the period using 
the photogate timer. You can vary friction and the strength of gravity. Use 
the pendulum to find the value of g on planet X. Notice the anharmonic 
behavior at large amplitude. 


Section Summary 


e A mass m suspended by a wire of length LZ is a simple pendulum and 
undergoes simple harmonic motion for amplitudes less than about 15°. 


The period of a simple pendulum is 
Equation: 


where L is the length of the string and g is the acceleration due to 
gravity. 


Conceptual Questions 


Exercise: 
Problem: 
Pendulum clocks are made to run at the correct rate by adjusting the 
pendulum’s length. Suppose you move from one city to another where 
the acceleration due to gravity is slightly greater, taking your 
pendulum clock with you, will you have to lengthen or shorten the 


pendulum to keep the correct time, other factors remaining constant? 
Explain your answer. 


Problems & Exercises 


As usual, the acceleration due to gravity in these problems is taken to 
be g = 9.80 m/ s”, unless otherwise specified. 
Exercise: 


Problem: 
What is the length of a pendulum that has a period of 0.500 s? 
Solution: 


6.21 cm 


Exercise: 


Problem: 


Some people think a pendulum with a period of 1.00 s can be driven 
with “mental energy” or psycho kinetically, because its period is the 
Same as an average heartbeat. True or not, what is the length of such a 
pendulum? 


Exercise: 


Problem: What is the period of a 1.00-m-long pendulum? 
Solution: 


2.015 
Exercise: 
Problem: 
How long does it take a child on a swing to complete one swing if her 
center of gravity is 4.00 m below the pivot? 
Exercise: 
Problem: 


The pendulum on a cuckoo clock is 5.00 cm long. What is its 
frequency? 


Solution: 


223 HZ 
Exercise: 
Problem: 
Two parakeets sit on a swing with their combined center of mass 10.0 
cm below the pivot. At what frequency do they swing? 


Exercise: 


Problem: 


(a) A pendulum that has a period of 3.00000 s and that is located 
where the acceleration due to gravity is 9.79 m/ s” is moved to a 
location where it the acceleration due to gravity is 9.82 m/ 5”. What is 
its new period? (b) Explain why so many digits are needed in the value 
for the period, based on the relation between the period and the 
acceleration due to gravity. 


Solution: 
(a) 2.99541 s 


(b) Since the period is related to the square root of the acceleration of 
gravity, when the acceleration changes by 1% the period changes by 
(0.01)? = 0.01% so it is necessary to have at least 4 digits after the 
decimal to see the changes. 


Exercise: 


Problem: 


A pendulum with a period of 2.00000 s in one location 

(9 = 9.80 m/ s’) is moved to a new location where the period is now 

1.99796 s. What is the acceleration due to gravity at its new location? 
Exercise: 

Problem: 


(a) What is the effect on the period of a pendulum if you double its 
length? 


(b) What is the effect on the period of a pendulum if you decrease its 
length by 5.00%? 


Solution: 


(a) Period increases by a factor of 1.41 (/ 2) 


(b) Period decreases to 97.5% of old period 
Exercise: 
Problem: 
Find the ratio of the new/old periods of a pendulum if the pendulum 
were transported from Earth to the Moon, where the acceleration due 
eee 2 
to gravity is 1.63 m/s”. 
Exercise: 


Problem: 


At what rate will a pendulum clock run on the Moon, where the 
acceleration due to gravity is 1.63 m/ s”, if it keeps time accurately on 
Earth? That is, find the time (in hours) it takes the clock’s hour hand to 
make one revolution on the Moon. 


Solution: 


Slow by a factor of 2.45 

Exercise: 
Problem: 
Suppose the length of a clock’s pendulum is changed by 1.000%, 
exactly at noon one day. What time will it read 24.00 hours later, 
assuming it the pendulum has kept perfect time before the change? 


Note that there are two answers, and perform the calculation to four- 
digit precision. 


Exercise: 


Problem: 


If a pendulum-driven clock gains 5.00 s/day, what fractional change in 
pendulum length must be made for it to keep perfect time? 


Solution: 


length must increase by 0.0116%. 


Glossary 


simple pendulum 
an object with a small mass suspended from a light wire or string 


Energy and the Simple Harmonic Oscillator 
e Determine the maximum speed of an oscillating system. 


To study the energy of a simple harmonic oscillator, we first consider all the 
forms of energy it can have We know from Hooke’s Law: Stress and Strain 
Revisited that the energy stored in the deformation of a simple harmonic 
oscillator is a form of potential energy given by: 

Equation: 


1 
PE = ke’. 


Because a simple harmonic oscillator has no dissipative forces, the other 
important form of energy is kinetic energy KE. Conservation of energy for 
these two forms is: 

Equation: 


KE + PE, = constant 


or 
Equation: 


1 1 
zm + ake = constant. 


This statement of conservation of energy is valid for all simple harmonic 
oscillators, including ones where the gravitational force plays a role 


Namely, for a simple pendulum we replace the velocity with v = Dw, the 
spring constant with k = mg/Z, and the displacement term with « = L@. 
Thus 

Equation: 


1 1 
zmL*w* + 5 melo = constant. 


In the case of undamped simple harmonic motion, the energy oscillates 
back and forth between kinetic and potential, going completely from one to 
the other as the system oscillates. So for the simple example of an object on 
a frictionless surface attached to a spring, as shown again in [link], the 
motion starts with all of the energy stored in the spring. As the object starts 
to move, the elastic potential energy is converted to kinetic energy, 
becoming entirely kinetic energy at the equilibrium position. It is then 
converted back into elastic potential energy by the spring, the velocity 
becomes zero when the kinetic energy is completely converted, and so on. 
This concept provides extra insight here and in later applications of simple 
harmonic motion, such as alternating current circuits. 


The transformation of energy in simple harmonic motion is 
illustrated for an object attached to a spring on a frictionless 
surface. 


The conservation of energy principle can be used to derive an expression 
for velocity v. If we start our simple harmonic motion with zero velocity 
and maximum displacement (2 = X), then the total energy is 


Equation: 


kx. 
2 


This total energy is constant and is shifted back and forth between kinetic 
energy and potential energy, at most times being shared by each. The 
conservation of energy for this system in equation form is thus: 
Equation: 


Solving this equation for v yields: 


Equation: 
k 
= sf # (xe — 7). 
m 


Manipulating this expression algebraically gives: 


Equation: 
k 2 
v= 44/—X4/1-— — 
m xX? 
and so 
Equation: 
2 
x 
VU = £VUmax\/ 1 — ben 
where 


Equation: 


ke 
Umax = te 
m 


From this expression, we see that the velocity is a maximuM (Vmax) at 

x = 0, as stated earlier in v(t) = —Umax sin a Notice that the maximum 
velocity depends on three factors. Maximum velocity is directly 
proportional to amplitude. As you might guess, the greater the maximum 
displacement the greater the maximum velocity. Maximum velocity is also 
greater for stiffer systems, because they exert greater force for the same 
displacement. This observation is seen in the expression for Umax; it is 
proportional to the square root of the force constant k. Finally, the 
maximum velocity is smaller for objects that have larger masses, because 
the maximum velocity is inversely proportional to the square root of m. For 
a given force, objects that have large masses accelerate more slowly. 


A similar calculation for the simple pendulum produces a similar result, 


namely: 
Equation: 
ce 
Wmax = of Laas 


Example: 

Determine the Maximum Speed of an Oscillating System: A Bumpy 
Road 

Suppose that a car is 900 kg and has a suspension system that has a force 
constant k = 6.53 x 104 N/m. The car hits a bump and bounces with an 
amplitude of 0.100 m. What is its maximum vertical velocity if you assume 
no damping occurs? 

Strategy 


We can use the expression for Umax given iN Umax = / kX to determine 


the maximum vertical velocity. The variables m and k are given in the 


problem statement, and the maximum displacement X is 0.100 m. 
Solution 


1. Identify known. 
2. Substitute known values into Upax = 4/ £X 


Equation: 


_ | 6.53 x 10* N/m (0.100 m) 
Umax = 900 ke : a0) 


3. Calculate to find v,..= 0.852 m/s. 


Discussion 

This answer seems reasonable for a bouncing car. There are other ways to 
use conservation of energy to find vmax. We could use it directly, as was 
done in the example featured in Hooke’s Law: Stress and Strain Revisited. 
The small vertical displacement y of an oscillating simple pendulum, 
starting from its equilibrium position, is given as 

Equation: 


y(t) = asin ut, 


where a is the amplitude, w is the angular velocity and ¢ is the time taken. 
Substituting w = 4, we have 
Equation: 


Thus, the displacement of pendulum is a function of time as shown above. 
Also the velocity of the pendulum is given by 


Equation: 
(t) 2aT Qt 
OH) = ——— GES | 
T ye 


so the motion of the pendulum is a function of time. 


Exercise: 
Check Your Understanding 


Problem: 


Why does it hurt more if your hand is snapped with a ruler than with a 
loose spring, even if the displacement of each system is equal? 


Solution: 


The ruler is a stiffer system, which carries greater force for the same 
amount of displacement. The ruler snaps your hand with greater force, 
which hurts more. 


Exercise: 
Check Your Understanding 


Problem: 


You are observing a simple harmonic oscillator. Identify one way you 
could decrease the maximum velocity of the system. 


Solution: 


You could increase the mass of the object that is oscillating. 


Section Summary 


e Energy in the simple harmonic oscillator is shared between elastic 
potential energy and kinetic energy, with the total being constant: 
Equation: 


il 1 
zm + ake = constant. 


e Maximum velocity depends on three factors: it is directly proportional 
to amplitude, it is greater for stiffer systems, and it is smaller for 
objects that have larger masses: 


Equation: 
ke 
Cae i/ ks 
m 


Conceptual Questions 


Exercise: 
Problem: 
Explain in terms of energy how dissipative forces such as friction 
reduce the amplitude of a harmonic oscillator. Also explain how a 


driving mechanism can compensate. (A pendulum clock is such a 
system. ) 


Problems & Exercises 


Exercise: 
Problem: 


The length of nylon rope from which a mountain climber is suspended 
has a force constant of 1.40 x 104 N/m. 


(a) What is the frequency at which he bounces, given his mass plus 
and the mass of his equipment are 90.0 kg? 


(b) How much would this rope stretch to break the climber’s fall if he 
free-falls 2.00 m before the rope runs out of slack? Hint: Use 
conservation of energy. 


(c) Repeat both parts of this problem in the situation where twice this 
length of nylon rope is used. 


Solution: 
(a) 1.99 Hz 
(b) 50.2 cm 


(c) 1.41 Hz, 0.710 m 


Exercise: 


Problem: Engineering Application 


Near the top of the Citigroup Center building in New York City, there 
is an object with mass of 4.00 x 10° kg on springs that have 
adjustable force constants. Its function is to dampen wind-driven 
oscillations of the building by oscillating at the same frequency as the 
building is being driven—the driving force is transferred to the object, 
which oscillates instead of the entire building. (a) What effective force 
constant should the springs have to make the object oscillate with a 
period of 2.00 s? (b) What energy is stored in the springs for a 2.00-m 
displacement from equilibrium? 


Solution: 
(a) 3.95 x 10° N/m 


(b) 7.90 x 10° J 


Waves 


e State the characteristics of a wave. 
e Calculate the velocity of wave propagation. 


Waves in the ocean behave 
similarly to all other types of 
waves. (credit: Steve 
Jurveston, Flickr) 


What do we mean when we say something is a wave? The most intuitive 
and easiest wave to imagine is the familiar water wave. More precisely, a 
wave is a disturbance that propagates, or moves from the place it was 
created. For water waves, the disturbance is in the surface of the water, 
perhaps created by a rock thrown into a pond or by a swimmer splashing 
the surface repeatedly. For sound waves, the disturbance is a change in air 
pressure, perhaps created by the oscillating cone inside a speaker. For 
earthquakes, there are several types of disturbances, including disturbance 
of Earth’s surface and pressure disturbances under the surface. Even radio 
waves are most easily understood using an analogy with water waves. 
Visualizing water waves is useful because there is more to it than just a 
mental image. Water waves exhibit characteristics common to all waves, 
such as amplitude, period, frequency and energy. All wave characteristics 
can be described by a small set of underlying principles. 


A wave is a disturbance that propagates, or moves from the place it was 
created. The simplest waves repeat themselves for several cycles and are 
associated with simple harmonic motion. Let us start by considering the 
simplified water wave in [link]. The wave is an up and down disturbance of 
the water surface. It causes a sea gull to move up and down in simple 
harmonic motion as the wave crests and troughs (peaks and valleys) pass 
under the bird. The time for one complete up and down motion is the 
wave’s period JT’. The wave’s frequency is f = 1/T, as usual. The wave 
itself moves to the right in the figure. This movement of the wave is 
actually the disturbance moving to the right, not the water itself (or the bird 
would move to the right). We define wave velocity v,, to be the speed at 
which the disturbance moves. Wave velocity is sometimes also called the 
propagation velocity or propagation speed, because the disturbance 
propagates from one location to another. 


Note: 

Misconception Alert 

Many people think that water waves push water from one direction to 
another. In fact, the particles of water tend to stay in one location, save for 
moving up and down due to the energy in the wave. The energy moves 
forward through the water, but the water stays in one place. If you feel 
yourself pushed in an ocean, what you feel is the energy of the wave, not a 
rush of water. 


An idealized ocean wave passes under a sea gull that 
bobs up and down in simple harmonic motion. The 
wave has a wavelength A, which is the distance 
between adjacent identical parts of the wave. The up 
and down disturbance of the surface propagates 
parallel to the surface at a speed vy. 


The water wave in the figure also has a length associated with it, called its 
wavelength \, the distance between adjacent identical parts of a wave. (A is 
the distance parallel to the direction of propagation.) The speed of 
propagation v,, is the distance the wave travels in a given time, which is one 
wavelength in the time of one period. In equation form, that is 

Equation: 


or 
Equation: 


Ve = fr. 


This fundamental relationship holds for all types of waves. For water 
waves, Uy is the speed of a surface wave; for sound, vy, is the speed of 
sound; and for visible light, vy is the speed of light, for example. 


Note: 

Take-Home Experiment: Waves in a Bowl 

Fill a large bowl or basin with water and wait for the water to settle so 
there are no ripples. Gently drop a cork into the middle of the bowl. 
Estimate the wavelength and period of oscillation of the water wave that 
propagates away from the cork. Remove the cork from the bowl and wait 


for the water to settle again. Gently drop the cork at a height that is 
different from the first drop. Does the wavelength depend upon how high 
above the water the cork is dropped? 


Example: 

Calculate the Velocity of Wave Propagation: Gull in the Ocean 
Calculate the wave velocity of the ocean wave in [link] if the distance 
between wave crests is 10.0 m and the time for a sea gull to bob up and 
down is 5.00 s. 

Strategy 

We are asked to find vy. The given information tells us that \ = 10.0 m 


and T = 5.00 s. Therefore, we can use vy = 4 to find the wave velocity. 
Solution 


1. Enter the known values into vy, = a: 
Equation: 


10.0 m 
Ve : 
5.00 s 


2. Solve for vy to find vy~= 2.00 m/s. 


Discussion 

This slow speed seems reasonable for an ocean wave. Note that the wave 
moves to the right in the figure at this speed, not the varying speed at 
which the sea gull moves up and down. 


Transverse and Longitudinal Waves 


A simple wave consists of a periodic disturbance that propagates from one 
place to another. The wave in [link] propagates in the horizontal direction 
while the surface is disturbed in the vertical direction. Such a wave is called 
a transverse wave or shear wave; in such a wave, the disturbance is 
perpendicular to the direction of propagation. In contrast, in a longitudinal 


wave or compressional wave, the disturbance is parallel to the direction of 
propagation. [link] shows an example of a longitudinal wave. The size of 
the disturbance is its amplitude X and is completely independent of the 
speed of propagation vy. 


i ae 


In this example of a transverse 
wave, the wave propagates 
horizontally, and the 
disturbance in the cord is in the 
vertical direction. 


In this example of a longitudinal wave, 
the wave propagates horizontally, and the 
disturbance in the cord is also in the 
horizontal direction. 


Waves may be transverse, longitudinal, or a combination of the two. (Water 
waves are actually a combination of transverse and longitudinal. The 
simplified water wave illustrated in [link] shows no longitudinal motion of 
the bird.) The waves on the strings of musical instruments are transverse— 
so are electromagnetic waves, such as visible light. 


Sound waves in air and water are longitudinal. Their disturbances are 
periodic variations in pressure that are transmitted in fluids. Fluids do not 
have appreciable shear strength, and thus the sound waves in them must be 
longitudinal or compressional. Sound in solids can be both longitudinal and 
transverse. 


gh 


The wave on a guitar string 
is transverse. The sound 
wave rattles a sheet of paper 
in a direction that shows the 
sound wave is longitudinal. 


Earthquake waves under Earth’s surface also have both longitudinal and 
transverse components (called compressional or P-waves and shear or S- 
waves, respectively). These components have important individual 
characteristics—they propagate at different speeds, for example. 


Earthquakes also have surface waves that are similar to surface waves on 
water. 

Exercise: 

Check Your Understanding 


Problem: 


Why is it important to differentiate between longitudinal and 
transverse waves? 


Solution: 


In the different types of waves, energy can propagate in a different 
direction relative to the motion of the wave. This is important to 
understand how different types of waves affect the materials around 
them. 


Note: 

PhET Explorations: Wave on a String 

Watch a string vibrate in slow motion. Wiggle the end of the string and 
make waves, or adjust the frequency and amplitude of an oscillator. Adjust 
the damping and tension. The end can be fixed, loose, or open. 
https://phet.colorado.edu/sims/html/wave-on-a-string/latest/wave-on-a- 


string_en.html 


Section Summary 


e A wave is a disturbance that moves from the point of creation with a 
wave velocity Uy. 

e A wave has a wavelength A, which is the distance between adjacent 
identical parts of the wave. 

e Wave velocity and wavelength are related to the wave’s frequency and 


period by vy = # Ory, = fr. 


e A transverse wave has a disturbance perpendicular to its direction of 
propagation, whereas a longitudinal wave has a disturbance parallel to 
its direction of propagation. 


Conceptual Questions 


Exercise: 
Problem: 
Give one example of a transverse wave and another of a longitudinal 


wave, being careful to note the relative directions of the disturbance 
and wave propagation in each. 


Exercise: 


Problem: 
What is the difference between propagation speed and the frequency of 
a wave? Does one or both affect wavelength? If so, how? 

Problems & Exercises 


Exercise: 


Problem: 


Storms in the South Pacific can create waves that travel all the way to 
the California coast, which are 12,000 km away. How long does it take 
them if they travel at 15.0 m/s? 


Solution: 
Equation: 


t= 9.26d 


Exercise: 


Problem: 


Waves on a Swimming pool propagate at 0.750 m/s. You splash the 
water at one end of the pool and observe the wave go to the opposite 
end, reflect, and return in 30.0 s. How far away is the other end of the 
pool? 


Exercise: 
Problem: 


Wind gusts create ripples on the ocean that have a wavelength of 5.00 
cm and propagate at 2.00 m/s. What is their frequency? 


Solution: 
Equation: 


f = 40.0 Hz 


Exercise: 
Problem: 
How many times a minute does a boat bob up and down on ocean 


waves that have a wavelength of 40.0 m and a propagation speed of 
5.00 m/s? 


Exercise: 


Problem: 


Scouts at a camp shake the rope bridge they have just crossed and 
observe the wave crests to be 8.00 m apart. If they shake it the bridge 
twice per second, what is the propagation speed of the waves? 


Solution: 
Equation: 


Uw = 16.0 m/s 


Exercise: 
Problem: 
What is the wavelength of the waves you create in a swimming pool if 


you splash your hand at a rate of 2.00 Hz and the waves propagate at 
0.800 m/s? 


Exercise: 
Problem: 


What is the wavelength of an earthquake that shakes you with a 
frequency of 10.0 Hz and gets to another city 84.0 km away in 12.0 s? 


Solution: 
Equation: 


A = 700m 


Exercise: 
Problem: 
Radio waves transmitted through space at 3.00 x 10° m /s by the 


Voyager spacecraft have a wavelength of 0.120 m. What is their 
frequency? 


Exercise: 
Problem: 
Your ear is capable of differentiating sounds that arrive at the ear just 
1.00 ms apart. What is the minimum distance between two speakers 


that produce sounds that arrive at noticeably different times on a day 
when the speed of sound is 340 m/s? 


Solution: 
Equation: 


d = 34.0 cm 


Exercise: 


Problem: 


(a) Seismographs measure the arrival times of earthquakes with a 
precision of 0.100 s. To get the distance to the epicenter of the quake, 
they compare the arrival times of S- and P-waves, which travel at 
different speeds. [link]) If S- and P-waves travel at 4.00 and 7.20 km/s, 
respectively, in the region considered, how precisely can the distance 
to the source of the earthquake be determined? (b) Seismic waves from 
underground detonations of nuclear bombs can be used to locate the 
test site and detect violations of test bans. Discuss whether your 
answer to (a) implies a serious limit to such detection. (Note also that 
the uncertainty is greater if there is an uncertainty in the propagation 
speeds of the S- and P-waves.) 


A seismograph as 
described in above 
problem.(credit: 
Oleg Alexandrov) 


Glossary 


longitudinal wave 


a wave in which the disturbance is parallel to the direction of 
propagation 


transverse wave 
a wave in which the disturbance is perpendicular to the direction of 
propagation 


wave velocity 
the speed at which the disturbance moves. Also called the propagation 
velocity or propagation speed 


wavelength 
the distance between adjacent identical parts of a wave 


Superposition and Interference 


e Explain standing waves. 
e Describe the mathematical representation of overtones and beat 
frequency. 


These waves result from the 
superposition of several waves 
from different sources, 
producing a complex pattern. 
(credit: waterborough, 
Wikimedia Commons) 


Most waves do not look very simple. They look more like the waves in 
[link] than like the simple water wave considered in Waves. (Simple waves 
may be created by a simple harmonic oscillation, and thus have a sinusoidal 
shape). Complex waves are more interesting, even beautiful, but they look 
formidable. Most waves appear complex because they result from several 
simple waves adding together. Luckily, the rules for adding waves are quite 
simple. 


When two or more waves atrive at the same point, they superimpose 
themselves on one another. More specifically, the disturbances of waves are 
superimposed when they come together—a phenomenon called 
superposition. Each disturbance corresponds to a force, and forces add. If 
the disturbances are along the same line, then the resulting wave is a simple 


addition of the disturbances of the individual waves—that is, their 
amplitudes add. [link] and [link] illustrate superposition in two special 
cases, both of which produce simple results. 


[link] shows two identical waves that arrive at the same point exactly in 
phase. The crests of the two waves are precisely aligned, as are the troughs. 
This superposition produces pure constructive interference. Because the 
disturbances add, pure constructive interference produces a wave that has 
twice the amplitude of the individual waves, but has the same wavelength. 


[link] shows two identical waves that arrive exactly out of phase—that is, 
precisely aligned crest to trough—producing pure destructive interference. 
Because the disturbances are in the opposite direction for this superposition, 
the resulting amplitude is zero for pure destructive interference—the waves 
completely cancel. 


Pure constructive interference of 

two identical waves produces one 

with twice the amplitude, but the 
same wavelength. 


Resultant 


Pure destructive interference of 
two identical waves produces zero 
amplitude, or complete 
cancellation. 


While pure constructive and pure destructive interference do occur, they 
require precisely aligned identical waves. The superposition of most waves 
produces a combination of constructive and destructive interference and can 
vary from place to place and time to time. Sound from a stereo, for 
example, can be loud in one spot and quiet in another. Varying loudness 
means the sound waves add partially constructively and partially 
destructively at different locations. A stereo has at least two speakers 
creating sound waves, and waves can reflect from walls. All these waves 
superimpose. An example of sounds that vary over time from constructive 
to destructive is found in the combined whine of airplane jets heard by a 
stationary passenger. The combined sound can fluctuate up and down in 
volume as the sound from the two engines varies in time from constructive 
to destructive. These examples are of waves that are similar. 


An example of the superposition of two dissimilar waves is shown in [link]. 
Here again, the disturbances add and subtract, producing a more 
complicated looking wave. 


Resultant \ = y, 


Superposition of 
non-identical 
waves exhibits both 
constructive and 
destructive 
interference. 


Standing Waves 


Sometimes waves do not seem to move; rather, they just vibrate in place. 
Unmoving waves can be seen on the surface of a glass of milk ina 
refrigerator, for example. Vibrations from the refrigerator motor create 
waves on the milk that oscillate up and down but do not seem to move 
across the surface. These waves are formed by the superposition of two or 
more moving waves, such as illustrated in [link] for two identical waves 
moving in opposite directions. The waves move through each other with 
their disturbances adding as they go by. If the two waves have the same 
amplitude and wavelength, then they alternate between constructive and 
destructive interference. The resultant looks like a wave standing in place 
and, thus, is called a standing wave. Waves on the glass of milk are one 
example of standing waves. There are other standing waves, such as on 
guitar strings and in organ pipes. With the glass of milk, the two waves that 
produce standing waves may come from reflections from the side of the 
glass. 


A closer look at earthquakes provides evidence for conditions appropriate 
for resonance, standing waves, and constructive and destructive 
interference. A building may be vibrated for several seconds with a driving 
frequency matching that of the natural frequency of vibration of the 
building—producing a resonance resulting in one building collapsing while 
neighboring buildings do not. Often buildings of a certain height are 
devastated while other taller buildings remain intact. The building height 
matches the condition for setting up a standing wave for that particular 
height. As the earthquake waves travel along the surface of Earth and 
reflect off denser rocks, constructive interference occurs at certain points. 
Often areas closer to the epicenter are not damaged while areas farther 
away are damaged. 


Standing wave created by the superposition of two identical waves 
moving in opposite directions. The oscillations are at fixed locations in 
space and result from alternately constructive and destructive 
interference. 


Standing waves are also found on the strings of musical instruments and are 
due to reflections of waves from the ends of the string. [link] and [link] 
show three standing waves that can be created on a string that is fixed at 
both ends. Nodes are the points where the string does not move; more 


generally, nodes are where the wave disturbance is zero in a standing wave. 
The fixed ends of strings must be nodes, too, because the string cannot 
move there. The word antinode is used to denote the location of maximum 
amplitude in standing waves. Standing waves on strings have a frequency 
that is related to the propagation speed vy of the disturbance on the string. 
The wavelength A is determined by the distance between the points where 
the string is fixed in place. 


The lowest frequency, called the fundamental frequency, is thus for the 
longest wavelength, which is seen to be Ay = 2L. Therefore, the 
fundamental frequency is fy = Uw/A1 = Uw/2L. In this case, the 
overtones or harmonics are multiples of the fundamental frequency. As 
seen in [link], the first harmonic can easily be calculated since Az = L. 
Thus, fo = vw/A2q = Vw /2L = 2f,. Similarly, f; = 3f,, and so on. All of 
these frequencies can be changed by adjusting the tension in the string. The 
greater the tension, the greater vy is and the higher the frequencies. This 
observation is familiar to anyone who has ever observed a string instrument 
being tuned. We will see in later chapters that standing waves are crucial to 
many resonance phenomena, such as in sounding boxes on string 
instruments. 


Antinode- 
Loop y Node 


Fundamental 
f, = Yw A, = 2L 
2L 


The figure shows a string oscillating 
at its fundamental frequency. 


First overtone 


fh = - =2f, A,=L 


Nodes i all 
= 


uU 


Second overtone 


i= oh. = Sf, a, = 24 


First and second harmonic frequencies 
are shown. 


Beats 


Striking two adjacent keys on a piano produces a warbling combination 
usually considered to be unpleasant. The superposition of two waves of 
similar but not identical frequencies is the culprit. Another example is often 
noticeable in jet aircraft, particularly the two-engine variety, while taxiing. 
The combined sound of the engines goes up and down in loudness. This 
varying loudness happens because the sound waves have similar but not 
identical frequencies. The discordant warbling of the piano and the 
fluctuating loudness of the jet engine noise are both due to alternately 
constructive and destructive interference as the two waves go in and out of 


phase. [link] illustrates this graphically. 


Destructive Constructive 


Time 


Beats are produced by the superposition of 
two waves of slightly different frequencies 
but identical amplitudes. The waves 
alternate in time between constructive 
interference and destructive interference, 
giving the resulting wave a time-varying 
amplitude. 


The wave resulting from the superposition of two similar-frequency waves 
has a frequency that is the average of the two. This wave fluctuates in 
amplitude, or beats, with a frequency called the beat frequency. We can 
determine the beat frequency by adding two waves together mathematically. 
Note that a wave can be represented at one point in space as 

Equation: 


an £ 
c= X cos( =" = X cos(2n ft), 


where f = 1/T is the frequency of the wave. Adding two waves that have 
different frequencies but identical amplitudes produces a resultant 
Equation: 


L= 4%. 
More specifically, 
Equation: 


xz = X cos(2n fit) + X cos(2n fot). 


Using a trigonometric identity, it can be shown that 
Equation: 


x = 2X cos(m fgt)cos(2n favet), 


where 
Equation: 


fe =| fi — fe | 


is the beat frequency, and faye is the average of f; and f. These results 
mean that the resultant wave has twice the amplitude and the average 
frequency of the two superimposed waves, but it also fluctuates in overall 
amplitude at the beat frequency fg. The first cosine term in the expression 
effectively causes the amplitude to go up and down. The second cosine term 
is the wave with frequency fave. This result is valid for all types of waves. 
However, if it is a sound wave, providing the two frequencies are similar, 
then what we hear is an average frequency that gets louder and softer (or 
warbles) at the beat frequency. 


Note: 

Making Career Connections 

Piano tuners use beats routinely in their work. When comparing a note 
with a tuning fork, they listen for beats and adjust the string until the beats 
go away (to zero frequency). For example, if the tuning fork has a 256 Hz 
frequency and two beats per second are heard, then the other frequency is 
either 254 or 258 Hz. Most keys hit multiple strings, and these strings are 
actually adjusted until they have nearly the same frequency and give a slow 
beat for richness. Twelve-string guitars and mandolins are also tuned using 
beats. 


While beats may sometimes be annoying in audible sounds, we will find 
that beats have many applications. Observing beats is a very useful way to 
compare similar frequencies. There are applications of beats as apparently 
disparate as in ultrasonic imaging and radar speed traps. 

Exercise: 

Check Your Understanding 


Problem: 


Imagine you are holding one end of a jump rope, and your friend holds 
the other. If your friend holds her end still, you can move your end up 
and down, creating a transverse wave. If your friend then begins to 
move her end up and down, generating a wave in the opposite 
direction, what resultant wave forms would you expect to see in the 
jump rope? 


Solution: 


The rope would alternate between having waves with amplitudes two 
times the original amplitude and reaching equilibrium with no 
amplitude at all. The wavelengths will result in both constructive and 
destructive interference 


Exercise: 
Check Your Understanding 


Problem: Define nodes and antinodes. 
Solution: 


Nodes are areas of wave interference where there is no motion. 
Antinodes are areas of wave interference where the motion is at its 
maximum point. 


Exercise: 
Check Your Understanding 


Problem: 


You hook up a stereo system. When you test the system, you notice 
that in one corner of the room, the sounds seem dull. In another area, 
the sounds seem excessively loud. Describe how the sound moving 
about the room could result in these effects. 


Solution: 


With multiple speakers putting out sounds into the room, and these 
sounds bouncing off walls, there is bound to be some wave 
interference. In the dull areas, the interference is probably mostly 
destructive. In the louder areas, the interference is probably mostly 
constructive. 


Note: 

PhET Explorations: Wave Interference 

Make waves with a dripping faucet, audio speaker, or laser! Add a second 
source or a pair of slits to create an interference pattern. 


Wave 
Interferenc 
e 


Section Summary 


e Superposition is the combination of two waves at the same location. 
e Constructive interference occurs when two identical waves are 
superimposed in phase. 


e Destructive interference occurs when two identical waves are 
superimposed exactly out of phase. 

e A standing wave is one in which two waves superimpose to produce a 
wave that varies in amplitude but does not propagate. 

¢ Nodes are points of no motion in standing waves. 

e An antinode is the location of maximum amplitude of a standing wave. 

e Waves on a string are resonant standing waves with a fundamental 
frequency and can occur at higher multiples of the fundamental, called 
overtones or harmonics. 

e Beats occur when waves of similar frequencies f; and f» are 
superimposed. The resulting amplitude oscillates with a beat frequency 
given by 
Equation: 


fa =| fi — fa |- 


Conceptual Questions 


Exercise: 


Problem: 


Speakers in stereo systems have two color-coded terminals to indicate 
how to hook up the wires. If the wires are reversed, the speaker moves 
in a direction opposite that of a properly connected speaker. Explain 
why it is important to have both speakers connected the same way. 


Problems & Exercises 


Exercise: 


Problem: 


A car has two horns, one emitting a frequency of 199 Hz and the other 
emitting a frequency of 203 Hz. What beat frequency do they produce? 


Solution: 


7 =]4 Hz 
Exercise: 
Problem: 
The middle-C hammer of a piano hits two strings, producing beats of 


1.50 Hz. One of the strings is tuned to 260.00 Hz. What frequencies 
could the other string have? 


Exercise: 
Problem: 
Two tuning forks having frequencies of 460 and 464 Hz are struck 


simultaneously. What average frequency will you hear, and what will 
the beat frequency be? 


Solution: 
462 Hz, 


4 Hz 
Exercise: 
Problem: 
Twin jet engines on an airplane are producing an average sound 


frequency of 4100 Hz with a beat frequency of 0.500 Hz. What are 
their individual frequencies? 


Exercise: 


Problem: 


A wave traveling on a Slinky® that is stretched to 4 m takes 2.4 s to 
travel the length of the Slinky and back again. (a) What is the speed of 
the wave? (b) Using the same Slinky stretched to the same length, a 
standing wave is created which consists of three antinodes and four 
nodes. At what frequency must the Slinky be oscillating? 


Solution: 
(a) 3.33 m/s 


(b) 1.25 Hz 
Exercise: 
Problem: 
Three adjacent keys on a piano (F, F-sharp, and G) are struck 


simultaneously, producing frequencies of 349, 370, and 392 Hz. What 
beat frequencies are produced by this discordant combination? 


Glossary 


antinode 
the location of maximum amplitude in standing waves 


beat frequency 
the frequency of the amplitude fluctuations of a wave 


constructive interference 
when two waves atrive at the same point exactly in phase; that is, the 
crests of the two waves are precisely aligned, as are the troughs 


destructive interference 
when two identical waves arrive at the same point exactly out of 
phase; that is, precisely aligned crest to trough 


fundamental frequency 
the lowest frequency of a periodic waveform 


nodes 
the points where the string does not move; more generally, nodes are 


where the wave disturbance is zero in a standing wave 


overtones 


multiples of the fundamental frequency of a sound 


superposition 
the phenomenon that occurs when two or more waves arrive at the 
same point 


Energy in Waves: Intensity 


e Calculate the intensity and the power of rays and waves. 


The destructive effect of an 
earthquake is palpable 
evidence of the energy carried 
in these waves. The Richter 
scale rating of earthquakes is 
related to both their amplitude 
and the energy they carry. 
(credit: Petty Officer 2nd 
Class Candice Villarreal, U.S. 
Navy) 


All waves carry energy. The energy of some waves can be directly 
observed. Earthquakes can shake whole cities to the ground, performing the 
work of thousands of wrecking balls. 


Loud sounds pulverize nerve cells in the inner ear, causing permanent 
hearing loss. Ultrasound is used for deep-heat treatment of muscle strains. 
A laser beam can burn away a malignancy. Water waves chew up beaches. 


The amount of energy in a wave is related to its amplitude. Large-amplitude 
earthquakes produce large ground displacements. Loud sounds have higher 
pressure amplitudes and come from larger-amplitude source vibrations than 


soft sounds. Large ocean breakers churn up the shore more than small ones. 
More quantitatively, a wave is a displacement that is resisted by a restoring 
force. The larger the displacement x, the larger the force F’ = kx needed to 
create it. Because work W is related to force multiplied by distance (Fx) 
and energy is put into the wave by the work done to create it, the energy in 
a wave is related to amplitude. In fact, a wave’s energy is directly 
proportional to its amplitude squared because 

Equation: 


W « Fx = kx’. 


The energy effects of a wave depend on time as well as amplitude. For 
example, the longer deep-heat ultrasound is applied, the more energy it 
transfers. Waves can also be concentrated or spread out. Sunlight, for 
example, can be focused to burn wood. Earthquakes spread out, so they do 
less damage the farther they get from the source. In both cases, changing 
the area the waves cover has important effects. All these pertinent factors 
are included in the definition of intensity J as power per unit area: 
Equation: 


P 
—_ 
A 


where P is the power carried by the wave through area A. The definition of 
intensity is valid for any energy in transit, including that carried by waves. 
The SI unit for intensity is watts per square meter ( W/ m’). For example, 
infrared and visible energy from the Sun impinge on Earth at an intensity of 
1300 W/ m? just above the atmosphere. There are other intensity-related 
units in use, too. The most common is the decibel. For example, a 90 
decibel sound level corresponds to an intensity of 10~? W/m”. (This 
quantity is not much power per unit area considering that 90 decibels is a 
relatively high sound level. Decibels will be discussed in some detail in a 
later chapter. 


Example: 

Calculating intensity and power: How much energy is in a ray of 
sunlight? 

The average intensity of sunlight on Earth’s surface is about 700 W / ioe 
(a) Calculate the amount of energy that falls on a solar collector having an 
area of 0.500 m? in 4.00 h. 

(b) What intensity would such sunlight have if concentrated by a 
magnifying glass onto an area 200 times smaller than its own? 

Strategy a 

Because power is energy per unit time or P = , the definition of 


: ; : E/t ; : 
intensity can be written as J = £ — ae , and this equation can be solved 
for E with the given information. 


Solution a 


1. Begin with the equation that states the definition of intensity: 


Equation: 
ua 
f= —. 
A 
2. Replace P with its equivalent E /t: 
Equation: 
E/t 
eal 
A 
3. Solve for EB: 
Equation: 
f= At. 


4. Substitute known values into the equation: 
Equation: 


= (700 W/m”) (0.500 m?) [(4.00 h)(3600 s/h)]. 


5. Calculate to find / and convert units: 


Equation: 


del ce Al, 


Discussion a 

The energy falling on the solar collector in 4 h in part is enough to be 
useful—for example, for heating a significant amount of water. 
Strategy b 

Taking a ratio of new intensity to old intensity and using primes for the 
new quantities, we will find that it depends on the ratio of the areas. All 
other quantities will cancel. 

Solution b 


de 


Take the ratio of intensities, which yields: 
Equation: 


ee el Th Ib P= P 
ii — P/A Al € powers Cancel DECAUSE = 


. Identify the knowns: 


Equation: 
A = 200A), 
Equation: 
Le 200 
ee : 
. Substitute known quantities: 
Equation: 


I= 2001 = 200 (700 W/m’). 


. Calculate to find J7: 


Equation: 


I'= 1.40 x 10° W/m’. 


Discussion b 
Decreasing the area increases the intensity considerably. The intensity of 
the concentrated sunlight could even start a fire. 


Example: 

Determine the combined intensity of two waves: Perfect constructive 
interference 

If two identical waves, each having an intensity of 1.00 W/ m’, interfere 
perfectly constructively, what is the intensity of the resulting wave? 
Strategy 

We know from Superposition and Interference that when two identical 
waves, which have equal amplitudes X, interfere perfectly constructively, 
the resulting wave has an amplitude of 2X. Because a wave’s intensity is 
proportional to amplitude squared, the intensity of the resulting wave is 
four times as great as in the individual waves. 

Solution 


1. Recall that intensity is proportional to amplitude squared. 
2. Calculate the new amplitude: 
Equation: 


It x (XN)? = (2X)* = 4X". 


3. Recall that the intensity of the old amplitude was: 
Equation: 


Teco 


4. Take the ratio of new intensity to the old intensity. This gives: 
Equation: 


5. Calculate to find J7: 
Equation: 


Ir = 4I = 4.00 W/m’. 


Discussion 

The intensity goes up by a factor of 4 when the amplitude doubles. This 
answer is a little disquieting. The two individual waves each have 
intensities of 1.00 W/ m’, yet their sum has an intensity of 4.00 W/ m?’, 
which may appear to violate conservation of energy. This violation, of 
course, cannot happen. What does happen is intriguing. The area over 
which the intensity is 4.00 W/ m’ is much less than the area covered by 
the two waves before they interfered. There are other areas where the 
intensity is zero. The addition of waves is not as simple as our first look in 
Superposition and Interference suggested. We actually get a pattern of both 
constructive interference and destructive interference whenever two waves 
are added. For example, if we have two stereo speakers putting out 

1.00 W/ m? each, there will be places in the room where the intensity is 
4.00 W/ m’, other places where the intensity is zero, and others in 
between. [link] shows what this interference might look like. We will 
pursue interference patterns elsewhere in this text. 


C7 <<: e_narefaction 
ay ely nate 
a Ms, 

- 4 ~--= 4) = Constructive 


ww = 
ral 7 


These stereo speakers produce both 
constructive interference and 
destructive interference in the 

room, a property common to the 


superposition of all types of waves. 
The shading is proportional to 
intensity. 


Exercise: 
Check Your Understanding 


Problem: 


Which measurement of a wave is most important when determining 
the wave's intensity? 


Solution: 
Amplitude, because a wave’s energy is directly proportional to its 
amplitude squared. 

Section Summary 

Intensity is defined to be the power per unit area: 


I = = and has units of W/m’. 


Conceptual Questions 


Exercise: 
Problem: 
Two identical waves undergo pure constructive interference. Is the 


resultant intensity twice that of the individual waves? Explain your 
answer. 


Exercise: 


Problem: 


Circular water waves decrease in amplitude as they move away from 
where a rock is dropped. Explain why. 


Problems & Exercises 


Exercise: 


Problem: Medical Application 


Ultrasound of intensity 1.50 x 102 W / m’ is produced by the 
rectangular head of a medical imaging device measuring 3.00 by 5.00 
cm. What is its power output? 


Solution: 


0.225 W 
Exercise: 
Problem: 
The low-frequency speaker of a stereo set has a surface area of 
0.05 m? and produces 1W of acoustical power. What is the intensity at 


the speaker? If the speaker projects sound uniformly in all directions, 
at what distance from the speaker is the intensity 0.1 W/ m?? 


Exercise: 


Problem: 


To increase intensity of a wave by a factor of 50, by what factor should 
the amplitude be increased? 


Solution: 


7.07 


Exercise: 


Problem: Engineering Application 


A device called an insolation meter is used to measure the intensity of 
sunlight has an area of 100 cm? and registers 6.50 W. What is the 
intensity in W/m?? 


Exercise: 


Problem: Astronomy Application 


Energy from the Sun arrives at the top of the Earth’s atmosphere with 
an intensity of 1.30 kW/m’. How long does it take for 1.8 x 10° J to 
arrive on an area of 1.00 m?? 


Solution: 


16.0 d 
Exercise: 


Problem: 


Suppose you have a device that extracts energy from ocean breakers in 
direct proportion to their intensity. If the device produces 10.0 kW of 
power on a day when the breakers are 1.20 m high, how much will it 
produce when they are 0.600 m high? 


Solution: 
2.50 kW 
Exercise: 
Problem: Engineering Application 


(a) A photovoltaic array of (solar cells) is 10.0% efficient in gathering 
solar energy and converting it to electricity. If the average intensity of 


sunlight on one day is 700 W/ m”, what area should your array have 
to gather energy at the rate of 100 W? (b) What is the maximum cost 
of the array if it must pay for itself in two years of operation averaging 
10.0 hours per day? Assume that it earns money at the rate of 9.00 ¢ 
per kilowatt-hour. 


Exercise: 
Problem: 
A microphone receiving a pure sound tone feeds an oscilloscope, 
producing a wave on its screen. If the sound intensity is originally 


2.00 x 10° W / m”, but is turned up until the amplitude increases by 
30.0%, what is the new intensity? 


Solution: 


3.38 x 10° W/m? 


Exercise: 


Problem: Medical Application 


(a) What is the intensity in W/ m” of a laser beam used to burn away 
cancerous tissue that, when 90.0% absorbed, puts 500 J of energy into 
a circular spot 2.00 mm in diameter in 4.00 s? (b) Discuss how this 
intensity compares to the average intensity of sunlight (about 

700 W/ m? ) and the implications that would have if the laser beam 
entered your eye. Note how your answer depends on the time duration 
of the exposure. 


Glossary 


intensity 
power per unit area 


Introduction to the Physics of Hearing 
class="introduction" 


This tree fell 
some time 
ago. When it 
fell, atoms in 
the air were 
disturbed. 
Physicists 
would call 
this 
disturbance 
sound 
whether 
someone was 
around to 
hear it or not. 
(credit: B.A. 
Bowen 
Photography 
) 


If a tree falls in the forest and no one is there to hear it, does it make a 
sound? The answer to this old philosophical question depends on how you 
define sound. If sound only exists when someone is around to perceive it, 
then there was no sound. However, if we define sound in terms of physics; 
that is, a disturbance of the atoms in matter transmitted from its origin 
outward (in other words, a wave), then there was a sound, even if nobody 
was around to hear it. 


Such a wave is the physical phenomenon we call sound. Its perception is 
hearing. Both the physical phenomenon and its perception are interesting 
and will be considered in this text. We shall explore both sound and 
hearing; they are related, but are not the same thing. We will also explore 
the many practical uses of sound waves, such as in medical imaging. 


Sound 


e Define sound and hearing. 
e Describe sound as a longitudinal wave. 


This glass has been 
shattered by a high- 
intensity sound 
wave of the same 
frequency as the 
resonant frequency 
of the glass. While 
the sound is not 
visible, the effects 
of the sound prove 
its existence. 
(credit: ||read]||, 
Flickr) 


Sound can be used as a familiar illustration of waves. Because hearing is 
one of our most important senses, it is interesting to see how the physical 
properties of sound correspond to our perceptions of it. Hearing is the 
perception of sound, just as vision is the perception of visible light. But 
sound has important applications beyond hearing. Ultrasound, for example, 
is not heard but can be employed to form medical images and is also used in 
treatment. 


The physical phenomenon of sound is defined to be a disturbance of matter 
that is transmitted from its source outward. Sound is a wave. On the atomic 
scale, it is a disturbance of atoms that is far more ordered than their thermal 
motions. In many instances, sound is a periodic wave, and the atoms 
undergo simple harmonic motion. In this text, we shall explore such 
periodic sound waves. 


A vibrating string produces a sound wave as illustrated in [link], [link], and 
[link]. As the string oscillates back and forth, it transfers energy to the air, 
mostly as thermal energy created by turbulence. But a small part of the 
string’s energy goes into compressing and expanding the surrounding air, 
creating slightly higher and lower local pressures. These compressions 
(high pressure regions) and rarefactions (low pressure regions) move out as 
longitudinal pressure waves having the same frequency as the string—they 
are the disturbance that is a sound wave. (Sound waves in air and most 
fluids are longitudinal, because fluids have almost no shear strength. In 
solids, sound waves can be both transverse and longitudinal.) [link] shows a 
graph of gauge pressure versus distance from the vibrating string. 


Compression 


A vibrating 
string moving to 
the right 
compresses the 
air in front of it 
and expands the 
air behind it. 


As the string 
moves to the 
left, it creates 
another 
compression and 
rarefaction as 
the ones on the 
right move away 
from the string. 


ae 


After many 
vibrations, there are 
a series of 
compressions and 
rarefactions 
moving out from 
the string asa 
sound wave. The 
graph shows gauge 
pressure versus 


distance from the 
source. Pressures 
vary only slightly 
from atmospheric 
for ordinary 
sounds. 


The amplitude of a sound wave decreases with distance from its source, 
because the energy of the wave is spread over a larger and larger area. But it 
is also absorbed by objects, such as the eardrum in [link], and converted to 
thermal energy by the viscosity of air. In addition, during each compression 
a little heat transfers to the air and during each rarefaction even less heat 
transfers from the air, so that the heat transfer reduces the organized 
disturbance into random thermal motions. (These processes can be viewed 
as a manifestation of the second law of thermodynamics presented in 
Introduction to the Second Law of Thermodynamics: Heat Engines and 
Their Efficiency.) Whether the heat transfer from compression to 
rarefaction is significant depends on how far apart they are—that is, it 
depends on wavelength. Wavelength, frequency, amplitude, and speed of 
propagation are important for sound, as they are for all waves. 


Compression 


Sound wave 
compressions and 
rarefactions travel 

up the ear canal and 


force the eardrum 
to vibrate. There is 
a net force on the 
eardrum, since the 
sound wave 
pressures differ 
from the 
atmospheric 
pressure found 
behind the 
eardrum. A 
complicated 
mechanism 
converts the 
vibrations to nerve 
impulses, which are 
perceived by the 
person. 


Note: 


PhET Explorations: Wave Interference 

WMake waves with a dripping faucet, audio speaker, or laser! Add a 
second source or a pair of slits to create an interference pattern. 
https://archive.cnx.org/specials/2fe7ad15-b00e-4402-b068- 


£f503985a18f/wave-interference/ 


Section Summary 


e Sound is a disturbance of matter that is transmitted from its source 
outward. 
e Sound is one type of wave. 


e Hearing is the perception of sound. 


Glossary 


sound 
a disturbance of matter that is transmitted from its source outward 


hearing 
the perception of sound 


Speed of Sound, Frequency, and Wavelength 


e Define pitch. 

e Describe the relationship between the speed of sound, its frequency, 
and its wavelength. 

e Describe the effects on the speed of sound as it travels through various 
media. 

e Describe the effects of temperature on the speed of sound. 


When a firework 
explodes, the light 
energy is perceived 

before the sound 

energy. Sound 
travels more slowly 
than light does. 

(credit: Dominic 

Alves, Flickr) 


Sound, like all waves, travels at a certain speed and has the properties of 
frequency and wavelength. You can observe direct evidence of the speed of 
sound while watching a fireworks display. The flash of an explosion is seen 
well before its sound is heard, implying both that sound travels at a finite 
speed and that it is much slower than light. You can also directly sense the 
frequency of a sound. Perception of frequency is called pitch. The 
wavelength of sound is not directly sensed, but indirect evidence is found in 
the correlation of the size of musical instruments with their pitch. Small 


instruments, such as a piccolo, typically make high-pitch sounds, while 
large instruments, such as a tuba, typically make low-pitch sounds. High 
pitch means small wavelength, and the size of a musical instrument is 
directly related to the wavelengths of sound it produces. So a small 
instrument creates short-wavelength sounds. Similar arguments hold that a 
large instrument creates long-wavelength sounds. 


The relationship of the speed of sound, its frequency, and wavelength is the 
same as for all waves: 
Equation: 


Uw = fr, 


where Uy is the speed of sound, f is its frequency, and J is its wavelength. 
The wavelength of a sound is the distance between adjacent identical parts 
of a wave—for example, between adjacent compressions as illustrated in 
[link]. The frequency is the same as that of the source and is the number of 
waves that pass a point per unit time. 


A sound wave emanates from a source 
vibrating at a frequency f, propagates 
at v,,, and has a wavelength A. 


[link] makes it apparent that the speed of sound varies greatly in different 
media. The speed of sound in a medium is determined by a combination of 
the medium’s rigidity (or compressibility in gases) and its density. The 


more rigid (or less compressible) the medium, the faster the speed of sound. 
This observation is analogous to the fact that the frequency of a simple 
harmonic motion is directly proportional to the stiffness of the oscillating 
object. The greater the density of a medium, the slower the speed of sound. 
This observation is analogous to the fact that the frequency of a simple 
harmonic motion is inversely proportional to the mass of the oscillating 
object. The speed of sound in air is low, because air is compressible. 
Because liquids and solids are relatively rigid and very difficult to 
compress, the speed of sound in such media is generally greater than in 
gases. 


Medium Vy(m/s) 
Gases at 0°C' 

Air 331 
Carbon dioxide 259 
Oxygen 316 
Helium 965 
Hydrogen 1290 
Liquids at 20°C’ 

Ethanol 1160 
Mercury 1450 


Water, fresh 1480 


Medium Vy(m/s) 
Sea water 1540 
Human tissue 1540 


Solids (longitudinal or bulk) 


Vulcanized rubber 54 
Polyethylene 920 
Marble 3810 
Glass, Pyrex 5640 
Lead 1960 
Aluminum 5120 
Steel 5960 


Speed of Sound in Various Media 


Earthquakes, essentially sound waves in Earth’s crust, are an interesting 
example of how the speed of sound depends on the rigidity of the medium. 
Earthquakes have both longitudinal and transverse components, and these 
travel at different speeds. The bulk modulus of granite is greater than its 
shear modulus. For that reason, the speed of longitudinal or pressure waves 
(P-waves) in earthquakes in granite is significantly higher than the speed of 
transverse or shear waves (S-waves). Both components of earthquakes 
travel slower in less rigid material, such as sediments. P-waves have speeds 
of 4 to 7 km/s, and S-waves correspondingly range in speed from 2 to 5 
km/s, both being faster in more rigid material. The P-wave gets 
progressively farther ahead of the S-wave as they travel through Earth’s 
crust. The time between the P- and S-waves is routinely used to determine 
the distance to their source, the epicenter of the earthquake. 


The speed of sound is affected by temperature in a given medium. For air at 
sea level, the speed of sound is given by 
Equation: 


T 


where the temperature (denoted as 7’) is in units of kelvin. The speed of 
sound in gases is related to the average speed of particles in the gas, Urms, 
and that 

Equation: 


where k is the Boltzmann constant (1.38 x 10°73 J /K) and m is the mass 
of each (identical) particle in the gas. So, it is reasonable that the speed of 
sound in air and other gases should depend on the square root of 
temperature. While not negligible, this is not a strong dependence. At 0°C, 
the speed of sound is 331 m/s, whereas at 20.0°C it is 343 m/s, less than a 
4% increase. [link] shows a use of the speed of sound by a bat to sense 
distances. Echoes are also used in medical imaging. 


(4 


)}) 


A bat uses sound echoes 
to find its way about and 
to catch prey. The time 
for the echo to return is 
directly proportional to 
the distance. 


One of the more important properties of sound is that its speed is nearly 
independent of frequency. This independence is certainly true in open air 
for sounds in the audible range of 20 to 20,000 Hz. If this independence 
were not true, you would certainly notice it for music played by a marching 
band in a football stadium, for example. Suppose that high-frequency 
sounds traveled faster—then the farther you were from the band, the more 
the sound from the low-pitch instruments would lag that from the high-pitch 
ones. But the music from all instruments arrives in cadence independent of 
distance, and so all frequencies must travel at nearly the same speed. Recall 
that 

Equation: 


i= 7X: 


In a given medium under fixed conditions, v,, is constant, so that there is a 
relationship between f and A; the higher the frequency, the smaller the 
wavelength. See [link] and consider the following example. 


High f, small 2 


Small f, large 2 


Because they travel 
at the same speed 
in a given medium, 
low-frequency 
sounds must have a 
greater wavelength 
than high- 
frequency sounds. 


Here, the lower- 
frequency sounds 
are emitted by the 

large speaker, 
called a woofer, 
while the higher- 
frequency sounds 
are emitted by the 
small speaker, 
called a tweeter. 


Example: 

Calculating Wavelengths: What Are the Wavelengths of Audible 
Sounds? 

Calculate the wavelengths of sounds at the extremes of the audible range, 
20 and 20,000 Hz, in 30.0°C air. (Assume that the frequency values are 
accurate to two significant figures.) 

Strategy 

To find wavelength from frequency, we can use vy = fA. 

Solution 


1. Identify knowns. The value for v,,, is given by 
Equation: 


Ue — (331 m/s} 73K" 


2. Convert the temperature into kelvin and then enter the temperature 
into the equation 
Equation: 


303 K 


Uy = (Bat m/s) 273 K 


= 348.7 m/s. 


3. Solve the relationship between speed and wavelength for A: 


Equation: 
Uw 
A=—. 
vi 
4. Enter the speed and the minimum frequency to give the maximum 
wavelength: 
Equation: 
348.7 
sah 
20 Hz 
5. Enter the speed and the maximum frequency to give the minimum 
wavelength: 
Equation: 
348.7 
ee ee BENE vane 
20,000 Hz 
Discussion 


Because the product of f multiplied by A equals a constant, the smaller f 
is, the larger A must be, and vice versa. 


The speed of sound can change when sound travels from one medium to 
another. However, the frequency usually remains the same because it is like 
a driven oscillation and has the frequency of the original source. If vy 
changes and f remains the same, then the wavelength A must change. That 
is, because vy, = fA, the higher the speed of a sound, the greater its 
wavelength for a given frequency. 


Note: 
Making Connections: Take-Home Investigation—Voice as a Sound Wave 


Suspend a sheet of paper so that the top edge of the paper is fixed and the 
bottom edge is free to move. You could tape the top edge of the paper to 
the edge of a table. Gently blow near the edge of the bottom of the sheet 
and note how the sheet moves. Speak softly and then louder such that the 
sounds hit the edge of the bottom of the paper, and note how the sheet 
moves. Explain the effects. 


Exercise: 
Check Your Understanding 


Problem: 


Imagine you observe two fireworks explode. You hear the explosion of 
one as soon as you see it. However, you see the other firework for 
several milliseconds before you hear the explosion. Explain why this is 
so. 


Solution: 


Sound and light both travel at definite speeds. The speed of sound is 
slower than the speed of light. The first firework is probably very close 
by, so the speed difference is not noticeable. The second firework is 
farther away, so the light arrives at your eyes noticeably sooner than 
the sound wave atrives at your ears. 


Exercise: 
Check Your Understanding 


Problem: 
You observe two musical instruments that you cannot identify. One 


plays high-pitch sounds and the other plays low-pitch sounds. How 
could you determine which is which without hearing either of them 


play? 


Solution: 


Compare their sizes. High-pitch instruments are generally smaller than 
low-pitch instruments because they generate a smaller wavelength. 


Section Summary 


The relationship of the speed of sound v,,, its frequency f, and its 
wavelength J is given by 
Equation: 


U_ = fA; 


which is the same relationship given for all waves. 


In air, the speed of sound is related to air temperature 7" by 
Equation: 


T 
Uw = (331 m/s) soo 


Uy is the same for all frequencies and wavelengths. 


Conceptual Questions 


Exercise: 


Problem: 


How do sound vibrations of atoms differ from thermal motion? 
Exercise: 

Problem: 

When sound passes from one medium to another where its propagation 


speed is different, does its frequency or wavelength change? Explain 
your answer briefly. 


Problems & Exercises 


Exercise: 
Problem: 


When poked by a spear, an operatic soprano lets out a 1200-Hz shriek. 
What is its wavelength if the speed of sound is 345 m/s? 


Solution: 


0.288 m 
Exercise: 
Problem: 
What frequency sound has a 0.10-m wavelength when the speed of 
sound is 340 m/s? 
Exercise: 
Problem: 


Calculate the speed of sound on a day when a 1500 Hz frequency has a 
wavelength of 0.221 m. 


Solution: 


332 m/s 
Exercise: 
Problem: 
(a) What is the speed of sound in a medium where a 100-kHz 


frequency produces a 5.96-cm wavelength? (b) Which substance in 
[link] is this likely to be? 


Exercise: 


Problem: 


Show that the speed of sound in 20.0°C air is 343 m/s, as claimed in 
the text. 


Solution: 
Equation: 
Vy = (331 m/s)4/ aox = (331 m/s) / 28K 
= 343 m/s 
Exercise: 
Problem: 


Air temperature in the Sahara Desert can reach 56.0°C (about 134°F). 
What is the speed of sound in air at that temperature? 


Exercise: 
Problem: 
Dolphins make sounds in air and water. What is the ratio of the 


wavelength of a sound in air to its wavelength in seawater? Assume air 
temperature is 20.0°C. 


Solution: 


0.223 
Exercise: 
Problem: 
A sonar echo returns to a submarine 1.20 s after being emitted. What is 


the distance to the object creating the echo? (Assume that the 
submarine is in the ocean, not in fresh water.) 


Exercise: 


Problem: 


(a) If a submarine’s sonar can measure echo times with a precision of 
0.0100 s, what is the smallest difference in distances it can detect? 
(Assume that the submarine is in the ocean, not in fresh water.) 


(b) Discuss the limits this time resolution imposes on the ability of the 
sonar system to detect the size and shape of the object creating the 
echo. 


Solution: 
(a) 7.70 m 


(b) This means that sonar is good for spotting and locating large 
objects, but it isn’t able to resolve smaller objects, or detect the 
detailed shapes of objects. Objects like ships or large pieces of 
airplanes can be found by sonar, while smaller pieces must be found by 
other means. 


Exercise: 


Problem: 


A physicist at a fireworks display times the lag between seeing an 
explosion and hearing its sound, and finds it to be 0.400 s. (a) How far 
away is the explosion if air temperature is 24.0°C and if you neglect 
the time taken for light to reach the physicist? (b) Calculate the 
distance to the explosion taking the speed of light into account. Note 
that this distance is negligibly greater. 


Exercise: 


Problem: 


Suppose a bat uses sound echoes to locate its insect prey, 3.00 m away. 
(See [link].) (a) Calculate the echo times for temperatures of 5.00°C 
and 35.0°C. (b) What percent uncertainty does this cause for the bat in 
locating the insect? (c) Discuss the significance of this uncertainty and 
whether it could cause difficulties for the bat. (In practice, the bat 
continues to use sound as it closes in, eliminating most of any 
difficulties imposed by this and other effects, such as motion of the 


prey.) 

Solution: 

(a) 18.0 ms, 17.1 ms 
(b) 5.00% 


(c) This uncertainty could definitely cause difficulties for the bat, if it 
didn’t continue to use sound as it closed in on its prey. A 5% 
uncertainty could be the difference between catching the prey around 
the neck or around the chest, which means that it could miss grabbing 
its prey. 


Glossary 


pitch 
the perception of the frequency of a sound 


Sound Intensity and Sound Level 


¢ Define intensity, sound intensity, and sound pressure level. 
e Calculate sound intensity levels in decibels (dB). 


Noise on crowded 
roadways like this one in 
Delhi makes it hard to 
hear others unless they 
shout. (credit: Lingaraj G 
J, Flickr) 


In a quiet forest, you can sometimes hear a single leaf fall to the ground. 
After settling into bed, you may hear your blood pulsing through your ears. 
But when a passing motorist has his stereo turned up, you cannot even hear 
what the person next to you in your car is saying. We are all very familiar 
with the loudness of sounds and aware that they are related to how 
energetically the source is vibrating. In cartoons depicting a screaming 
person (or an animal making a loud noise), the cartoonist often shows an 
open mouth with a vibrating uvula, the hanging tissue at the back of the 
mouth, to suggest a loud sound coming from the throat [link]. High noise 
exposure is hazardous to hearing, and it is common for musicians to have 
hearing losses that are sufficiently severe that they interfere with the 
musicians’ abilities to perform. The relevant physical quantity is sound 
intensity, a concept that is valid for all sounds whether or not they are in the 
audible range. 


Intensity is defined to be the power per unit area carried by a wave. Power 
is the rate at which energy is transferred by the wave. In equation form, 
intensity J is 

Equation: 


where P is the power through an area A. The SI unit for J is W/ m’. The 
intensity of a sound wave is related to its amplitude squared by the 
following relationship: 

Equation: 


(Ap) 
2pvy 


es 


Here Ap is the pressure variation or pressure amplitude (half the difference 
between the maximum and minimum pressure in the sound wave) in units 
of pascals (Pa) or N/ m?. (We are using a lower case p for pressure to 
distinguish it from power, denoted by P above.) The energy (as kinetic 
energy mae ) of an oscillating element of air due to a traveling sound wave 
is proportional to its amplitude squared. In this equation, p is the density of 
the material in which the sound wave travels, in units of kg/ m’°, and v,, is 
the speed of sound in the medium, in units of m/s. The pressure variation is 
proportional to the amplitude of the oscillation, and so I varies as (Ap)* 
({link]). This relationship is consistent with the fact that the sound wave is 
produced by some vibration; the greater its pressure amplitude, the more the 
air is compressed in the sound it creates. 


Graphs of the 
gauge pressures in 
two sound waves of 
different intensities. 
The more intense 
sound is produced 
by a source that has 
larger-amplitude 
oscillations and has 
greater pressure 
maxima and 
minima. Because 
pressures are higher 
in the greater- 
intensity sound, it 
can exert larger 
forces on the 
objects it 
encounters. 


Sound intensity levels are quoted in decibels (dB) much more often than 
sound intensities in watts per meter squared. Decibels are the unit of choice 
in the scientific literature as well as in the popular media. The reasons for 
this choice of units are related to how we perceive sounds. How our ears 
perceive sound can be more accurately described by the logarithm of the 


intensity rather than directly to the intensity. The sound intensity level ( in 
decibels of a sound having an intensity J in watts per meter squared is 
defined to be 

Equation: 


8 (dB) = 10 lotwo( 7) 


where Jy = 10 '? W is m? is a reference intensity. In particular, [o is the 
lowest or threshold intensity of sound a person with normal hearing can 
perceive at a frequency of 1000 Hz. Sound intensity level is not the same as 
intensity. Because ( is defined in terms of a ratio, it is a unitless quantity 
telling you the level of the sound relative to a fixed standard (i0r* W/ m?, 
in this case). The units of decibels (dB) are used to indicate this ratio is 
multiplied by 10 in its definition. The bel, upon which the decibel is based, 
is named for Alexander Graham Bell, the inventor of the telephone. 


Sound 

intensity 

level B Intensity 

(dB) I(W/m2) Example/effect 

0 ie ak Oa Threshold of hearing at 1000 Hz 
10 i<i0 Rustle of leaves 

20 Leis ° Whisper at 1 m distance 


30 1x 10° Quiet home 


Sound 
intensity 
level B 
(dB) 

40 

50 

60 

70 


80 


90 


100 


110 


120 


140 


160 


Intensity 
I(W/m) 


1x 10° 
1x 10° 
1x 10° 
1x 10° 


1x 10% 


1x 102 


1 x 102 


1 x 104 


Example/effect 

Average home 

Average office, soft music 

Normal conversation 

Noisy office, busy traffic 

Loud radio, classroom lecture 

Inside a heavy truck; damage from 
prolonged exposure| footnote | 

Several government agencies and 
health-related professional associations 
recommend that 85 dB not be exceeded 


for 8-hour daily exposures in the 
absence of hearing protection. 


Noisy factory, siren at 30 m; damage 
from 8 h per day exposure 


Damage from 30 min per day exposure 


Loud rock concert, pneumatic chipper at 
2 m; threshold of pain 


Jet airplane at 30 m; severe pain, 
damage in seconds 


Bursting of eardrums 


Sannd Intensitv T.evels and Intensities 


acai a el 


The decibel level of a sound having the threshold intensity of 10” W/ m? 
is B = 0 dB, because log,,1 = 0. That is, the threshold of hearing is 0 
decibels. [link] gives levels in decibels and intensities in watts per meter 
squared for some familiar sounds. 


One of the more striking things about the intensities in [link] is that the 
intensity in watts per meter squared is quite small for most sounds. The ear 
is sensitive to as little as a trillionth of a watt per meter squared—even more 
impressive when you realize that the area of the eardrum is only about 

1 cm?, so that only 10°! W falls on it at the threshold of hearing! Air 
molecules in a sound wave of this intensity vibrate over a distance of less 
oe one molecular diameter, and the gauge pressures involved are less than 
10° atm. 


Another impressive feature of the sounds in [link] is their numerical range. 
Sound intensity varies by a factor of 10'? from threshold to a sound that 
causes damage in seconds. You are unaware of this tremendous range in 
sound intensity because how your ears respond can be described 
approximately as the logarithm of intensity. Thus, sound intensity levels in 
decibels fit your experience better than intensities in watts per meter 
squared. The decibel scale is also easier to relate to because most people are 
more accustomed to dealing with numbers such as 0, 53, or 120 than 
numbers such as 1.00 x 10°". 


One more observation readily verified by examining [link] or using 


2 

= 3,,__ is that each factor of 10 in intensity corresponds to 10 dB. For 
example, a 90 dB sound compared with a 60 dB sound is 30 dB greater, or 
three factors of 10 (that is, 10° times) as intense. Another example is that if 
one sound is 10’ as intense as another, it is 70 dB higher. See [link]. 


I/Ty Bo- Bi 


2.0 3.0 dB 
5.0 7.0 dB 
10.0 10.0 dB 


Ratios of Intensities and Corresponding Differences in Sound Intensity 
Levels 


Example: 

Calculating Sound Intensity Levels: Sound Waves 

Calculate the sound intensity level in decibels for a sound wave traveling 
in air at O°C and having a pressure amplitude of 0.656 Pa. 

Strategy 

We are given Ap, so we can calculate J using the equation 

I = (Ap)?/(2pv,,)”. Using I, we can calculate A straight from its 
definition in G (dB) = 10 log, (Z/Ip). 

Solution 

(1) Identify knowns: 

Sound travels at 331 m/s in air at 0°C. 

Air has a density of 1.29 kg/ m’ at atmospheric pressure and 0°C. 

(2) Enter these values and the pressure amplitude into I = (Ap)”/ (2pv,): 
Equation: 


Ap)” 0.656 Pa)? 
r-! P) a0 SOE) Espacio Wie 


2PUy 9 (1.29 kg/m’) (331 m/s) 


(3) Enter the value for J and the known value for J into 

6 (dB) = 10 log, )(I/Jo). Calculate to find the sound intensity level in 
decibels: 

Equation: 


10 log,9(5.04 x 10°) = 10 (8.70) dB = 87 dB. 


Discussion 

This 87 dB sound has an intensity five times as great as an 80 dB sound. 
So a factor of five in intensity corresponds to a difference of 7 dB in sound 
intensity level. This value is true for any intensities differing by a factor of 
five. 


Example: 

Change Intensity Levels of a Sound: What Happens to the Decibel 
Level? 

Show that if one sound is twice as intense as another, it has a sound level 
about 3 dB higher. 

Strategy 

You are given that the ratio of two intensities is 2 to 1, and are then asked 
to find the difference in their sound levels in decibels. You can solve this 
problem using of the properties of logarithms. 


Solution 
(1) Identify knowns: 
The ratio of the two intensities is 2 to 1, or: 
Equation: 
2 _ 900. 
i 


We wish to show that the difference in sound levels is about 3 dB. That is, 
we want to show: 
Equation: 


Bo — 6, = 3 dB. 


Note that: 
Equation: 


b 
logigb — log, 9a = logio (=) 


(2) Use the definition of @ to get: 
Equation: 


I. 

Bo — 81 = 10 lotio( = 10 log;,2.00 = 10 (0.301) dB. 
1 

Thus, 

Equation: 


Bz — Pi = 3.01 dB. 


Discussion 

This means that the two sound intensity levels differ by 3.01 dB, or about 3 
dB, as advertised. Note that because only the ratio [2/I; is given (and not 
the actual intensities), this result is true for any intensities that differ by a 
factor of two. For example, a 56.0 dB sound is twice as intense as a 53.0 
dB sound, a 97.0 dB sound is half as intense as a 100 dB sound, and so on. 


It should be noted at this point that there is another decibel scale in use, 
called the sound pressure level, based on the ratio of the pressure 
amplitude to a reference pressure. This scale is used particularly in 
applications where sound travels in water. It is beyond the scope of most 
introductory texts to treat this scale because it is not commonly used for 
sounds in air, but it is important to note that very different decibel levels 
may be encountered when sound pressure levels are quoted. For example, 
ocean noise pollution produced by ships may be as great as 200 dB 
expressed in the sound pressure level, where the more familiar sound 
intensity level we use here would be something under 140 dB for the same 
sound. 


Note: 
Take-Home Investigation: Feeling Sound 


Find a CD player and a CD that has rock music. Place the player on a light 
table, insert the CD into the player, and start playing the CD. Place your 
hand gently on the table next to the speakers. Increase the volume and note 
the level when the table just begins to vibrate as the rock music plays. 
Increase the reading on the volume control until it doubles. What has 
happened to the vibrations? 


Exercise: 
Check Your Understanding 


Problem: 


Describe how amplitude is related to the loudness of a sound. 


Solution: 


Amplitude is directly proportional to the experience of loudness. As 
amplitude increases, loudness increases. 


Exercise: 
Check Your Understanding 


Problem: 

Identify common sounds at the levels of 10 dB, 50 dB, and 100 dB. 
Solution: 

10 dB: Running fingers through your hair. 

50 dB: Inside a quiet home with no television or radio. 


100 dB: Take-off of a jet plane. 


Section Summary 


e Intensity is the same for a sound wave as was defined for all waves; it 
is 
Equation: 


where P is the power crossing area A. The SI unit for I is watts per 
meter squared. The intensity of a sound wave is also related to the 
pressure amplitude Ap 

Equation: 


(Ap)’ 
20Vy, 


) 


where p is the density of the medium in which the sound wave travels 
and vy is the speed of sound in the medium. 


e Sound intensity level in units of decibels (dB) is 
Equation: 


8 (dB) = 10 logu( =), 


where I) = 10°!” W/m’ is the threshold intensity of hearing. 


Conceptual Questions 


Exercise: 


Problem: 


Six members of a synchronized swim team wear earplugs to protect 
themselves against water pressure at depths, but they can still hear the 
music and perform the combinations in the water perfectly. One day, 
they were asked to leave the pool so the dive team could practice a few 
dives, and they tried to practice on a mat, but seemed to have a lot 
more difficulty. Why might this be? 


Exercise: 
Problem: 
A community is concerned about a plan to bring train service to their 
downtown from the town’s outskirts. The current sound intensity level, 
even though the rail yard is blocks away, is 70 dB downtown. The 
mayor assures the public that there will be a difference of only 30 dB 


in sound in the downtown area. Should the townspeople be concerned? 
Why? 


Problems & Exercises 


Exercise: 


Problem: 


What is the intensity in watts per meter squared of 85.0-dB sound? 


Solution: 
Equation: 


3.16 x 10 * W/m? 


Exercise: 


Problem: 


The warning tag on a lawn mower states that it produces noise at a 

level of 91.0 dB. What is this in watts per meter squared? 
Exercise: 

Problem: 


A sound wave traveling in 20°C air has a pressure amplitude of 0.5 Pa. 
What is the intensity of the wave? 


Solution: 
Equation: 


3.04 x 104 W/m’ 


Exercise: 


Problem: 


What intensity level does the sound in the preceding problem 
correspond to? 


Exercise: 


Problem: 


What sound intensity level in dB is produced by earphones that create 
an intensity of 4.00 x 10-2 W/m”? 


Solution: 


106 dB 
Exercise: 


Problem: 


Show that an intensity of 10°! W/m? is the same as 10°!® W/cm’. 


Exercise: 
Problem: 
(a) What is the decibel level of a sound that is twice as intense as a 


90.0-dB sound? (b) What is the decibel level of a sound that is one- 
fifth as intense as a 90.0-dB sound? 


Solution: 
(a) 93 dB 


(b) 83 dB 
Exercise: 


Problem: 


(a) What is the intensity of a sound that has a level 7.00 dB lower than 
a 4.00 x 10° W / m” sound? (b) What is the intensity of a sound that 
is 3.00 dB higher than a 4.00 x 10° W/m’ sound? 


Exercise: 


Problem: 


(a) How much more intense is a sound that has a level 17.0 dB higher 
than another? (b) If one sound has a level 23.0 dB less than another, 
what is the ratio of their intensities? 


Solution: 
(a) 50.1 


—3 1 
(b) 5.01 x 10° or 200 


Exercise: 


Problem: 


People with good hearing can perceive sounds as low in level as 
—8.00 dB at a frequency of 3000 Hz. What is the intensity of this 
sound in watts per meter squared? 


Exercise: 
Problem: 
If a large housefly 3.0 m away from you makes a noise of 40.0 dB, 


what is the noise level of 1000 flies at that distance, assuming 
interference has a negligible effect? 


Solution: 


70.0 dB 
Exercise: 
Problem: 
Ten cars in a circle at a boom box competition produce a 120-dB 
sound intensity level at the center of the circle. What is the average 


sound intensity level produced there by each stereo, assuming 
interference effects can be neglected? 


Exercise: 
Problem: 
The amplitude of a sound wave is measured in terms of its maximum 


gauge pressure. By what factor does the amplitude of a sound wave 
increase if the sound intensity level goes up by 40.0 dB? 


Solution: 


100 


Exercise: 


Problem: 


If a sound intensity level of 0 dB at 1000 Hz corresponds to a 
maximum gauge pressure (sound amplitude) of 10°? atm, what is the 
maximum gauge pressure in a 60-dB sound? What is the maximum 
gauge pressure in a 120-dB sound? 


Exercise: 


Problem: 


An 8-hour exposure to a sound intensity level of 90.0 dB may cause 
hearing damage. What energy in joules falls on a 0.800-cm-diameter 
eardrum so exposed? 


Solution: 
Equation: 


1.45 x 10° J 


Exercise: 


Problem: 


(a) Ear trumpets were never very common, but they did aid people 
with hearing losses by gathering sound over a large area and 
concentrating it on the smaller area of the eardrum. What decibel 
increase does an ear trumpet produce if its sound gathering area is 

900 cm? and the area of the eardrum is 0.500 cm2, but the trumpet 
only has an efficiency of 5.00% in transmitting the sound to the 
eardrum? (b) Comment on the usefulness of the decibel increase found 
in part (a). 


Exercise: 


Problem: 


Sound is more effectively transmitted into a stethoscope by direct 
contact than through the air, and it is further intensified by being 
concentrated on the smaller area of the eardrum. It is reasonable to 
assume that sound is transmitted into a stethoscope 100 times as 
effectively compared with transmission though the air. What, then, is 
the gain in decibels produced by a stethoscope that has a sound 
gathering area of 15.0 cm’, and concentrates the sound onto two 
eardrums with a total area of 0.900 cm? with an efficiency of 40.0%? 


Solution: 


28.2 dB 
Exercise: 


Problem: 


Loudspeakers can produce intense sounds with surprisingly small 
energy input in spite of their low efficiencies. Calculate the power 
input needed to produce a 90.0-dB sound intensity level for a 12.0-cm- 
diameter speaker that has an efficiency of 1.00%. (This value is the 
sound intensity level right at the speaker.) 


Glossary 


intensity 
the power per unit area carried by a wave 


sound intensity level 
a unitless quantity telling you the level of the sound relative to a fixed 
standard 


sound pressure level 
the ratio of the pressure amplitude to a reference pressure 


Doppler Effect and Sonic Booms 


¢ Define Doppler effect, Doppler shift, and sonic boom. 

¢ Calculate the frequency of a sound heard by someone observing 
Doppler shift. 

e Describe the sounds produced by objects moving faster than the speed 
of sound. 


The characteristic sound of a motorcycle buzzing by is an example of the 
Doppler effect. The high-pitch scream shifts dramatically to a lower-pitch 
roar as the motorcycle passes by a stationary observer. The closer the 
motorcycle brushes by, the more abrupt the shift. The faster the motorcycle 
moves, the greater the shift. We also hear this characteristic shift in 
frequency for passing race cars, airplanes, and trains. It is so familiar that it 
is used to imply motion and children often mimic it in play. 


The Doppler effect is an alteration in the observed frequency of a sound due 
to motion of either the source or the observer. Although less familiar, this 
effect is easily noticed for a stationary source and moving observer. For 
example, if you ride a train past a stationary warning bell, you will hear the 
bell’s frequency shift from high to low as you pass by. The actual change in 
frequency due to relative motion of source and observer is called a Doppler 
shift. The Doppler effect and Doppler shift are named for the Austrian 
physicist and mathematician Christian Johann Doppler (1803-1853), who 
did experiments with both moving sources and moving observers. Doppler, 
for example, had musicians play on a moving open train car and also play 
standing next to the train tracks as a train passed by. Their music was 
observed both on and off the train, and changes in frequency were 
measured. 


What causes the Doppler shift? [link], [link], and [link] compare sound 
waves emitted by stationary and moving sources in a stationary air mass. 
Each disturbance spreads out spherically from the point where the sound 
was emitted. If the source is stationary, then all of the spheres representing 
the air compressions in the sound wave centered on the same point, and the 
stationary observers on either side see the same wavelength and frequency 
as emitted by the source, as in [link]. If the source is moving, as in [link], 
then the situation is different. Each compression of the air moves out in a 


sphere from the point where it was emitted, but the point of emission 
moves. This moving emission point causes the air compressions to be closer 
together on one side and farther apart on the other. Thus, the wavelength is 
shorter in the direction the source is moving (on the right in [link]), and 
longer in the opposite direction (on the left in [link]). Finally, if the 
observers move, as in [link], the frequency at which they receive the 
compressions changes. The observer moving toward the source receives 
them at a higher frequency, and the person moving away from the source 
receives them at a lower frequency. 


Sounds emitted by a 
source spread out in 
spherical waves. Because 
the source, observers, and 
air are stationary, the 
wavelength and 
frequency are the same in 
all directions and to all 
observers. 


12345 


Sounds emitted by a 


source moving to the 
right spread out from the 
points at which they were 
emitted. The wavelength 
is reduced and, 
consequently, the 
frequency is increased in 
the direction of motion, 
so that the observer on 
the right hears a higher- 
pitch sound. The opposite 
is true for the observer on 
the left, where the 
wavelength is increased 
and the frequency is 
reduced. 


The same effect is 
produced when the 
observers move relative 
to the source. Motion 
toward the source 
increases frequency as the 
observer on the right 
passes through more 
wave crests than she 
would if stationary. 
Motion away from the 


source decreases 
frequency as the observer 
on the left passes through 
fewer wave crests than he 
would if stationary. 


We know that wavelength and frequency are related by vw = fA, where vw 
is the fixed speed of sound. The sound moves in a medium and has the 
same speed Uy in that medium whether the source is moving or not. Thus f 
multiplied by A is a constant. Because the observer on the right in [link] 
receives a shorter wavelength, the frequency she receives must be higher. 
Similarly, the observer on the left receives a longer wavelength, and hence 
he hears a lower frequency. The same thing happens in [link]. A higher 
frequency is received by the observer moving toward the source, and a 
lower frequency is received by an observer moving away from the source. 
In general, then, relative motion of source and observer toward one another 
increases the received frequency. Relative motion apart decreases 
frequency. The greater the relative speed is, the greater the effect. 


Note: 

The Doppler Effect 

The Doppler effect occurs not only for sound but for any wave when there 
is relative motion between the observer and the source. There are Doppler 
shifts in the frequency of sound, light, and water waves, for example. 
Doppler shifts can be used to determine velocity, such as when ultrasound 
is reflected from blood in a medical diagnostic. The recession of galaxies is 
determined by the shift in the frequencies of light received from them and 
has implied much about the origins of the universe. Modern physics has 
been profoundly affected by observations of Doppler shifts. 


For a stationary observer and a moving source, the frequency fobs received 
by the observer can be shown to be 


Equation: 


Uw 
fobs = 4( =), 


where f, is the frequency of the source, v, is the speed of the source along a 
line joining the source and observer, and v,, is the speed of sound. The 
minus sign is used for motion toward the observer and the plus sign for 
motion away from the observer, producing the appropriate shifts up and 
down in frequency. Note that the greater the speed of the source, the greater 
the effect. Similarly, for a stationary source and moving observer, the 
frequency received by the observer fops is given by 

Equation: 


Vag E Vob 
fobs = p( =="), 


w 


where Uops is the speed of the observer along a line joining the source and 
observer. Here the plus sign is for motion toward the source, and the minus 
is for motion away from the source. 


Example: 

Calculate Doppler Shift: A Train Horn 

Suppose a train that has a 150-Hz horn is moving at 35.0 m/s in still air on 
a day when the speed of sound is 340 m/s. 

(a) What frequencies are observed by a stationary person at the side of the 
tracks as the train approaches and after it passes? 

(b) What frequency is observed by the train’s engineer traveling on the 
train? 

Strategy 


Vv. 


To find the observed frequency in (a), fobs = fs (<2) , must be used 


(UesiaVe 


because the source is moving. The minus sign is used for the approaching 


train, and the plus sign for the receding train. In (b), there are two Doppler 
shifts—one for a moving source and the other for a moving observer. 
Solution for (a) 


(1) Enter known values into fops = fs (=) ; 
Equation: 


7 Ue 7 340 m/s 
fos = f(— = -| = (150 Hz) (nee) 


(2) Calculate the frequency observed by a stationary person as the train 
approaches. 
Equation: 


fops = (150 Hz)(1.11) = 167 Hz 


(3) Use the same equation with the plus sign to find the frequency heard by 
a stationary person as the train recedes. 
Equation: 


7 Uy 7 340 m/s 
fan= (5 F,) = C5089 (sae some) 


(4) Calculate the second frequency. 
Equation: 


fobs = (150 Hz)(0.907) = 136 Hz 


Discussion on (a) 

The numbers calculated are valid when the train is far enough away that 
the motion is nearly along the line joining train and observer. In both cases, 
the shift is significant and easily noticed. Note that the shift is 17.0 Hz for 
motion toward and 14.0 Hz for motion away. The shifts are not symmetric. 
Solution for (b) 

(1) Identify knowns: 


e It seems reasonable that the engineer would receive the same 
frequency as emitted by the hom, because the relative velocity 


between them is zero. 
e Relative to the medium (air), the speeds are vs = Vobs = 35.0 m/s. 
e The first Doppler shift is for the moving observer; the second is for 
the moving source. 


(2) Use the following equation: 


Equation: 
Var ais: Oaks (be 
itn = ( =) ). 
(bee (Diels 


The quantity in the square brackets is the Doppler-shifted frequency due to 
a moving observer. The factor on the right is the effect of the moving 
source. 

(3) Because the train engineer is moving in the direction toward the horn, 
we must use the plus sign for vpps; however, because the horn is also 
moving in the direction away from the engineer, we also use the plus sign 
for vs. But the train is carrying both the engineer and the horn at the same 
velocity, SO Us = Vobs. AS a result, everything but f, cancels, yielding 
Equation: 


foos=fs- 


Discussion for (b) 

We may expect that there is no change in frequency when source and 
observer move together because it fits your experience. For example, there 
is no Doppler shift in the frequency of conversations between driver and 
passenger on a motorcycle. People talking when a wind moves the air 
between them also observe no Doppler shift in their conversation. The 
crucial point is that source and observer are not moving relative to each 
other. 


Sonic Booms to Bow Wakes 


What happens to the sound produced by a moving source, such as a jet 
airplane, that approaches or even exceeds the speed of sound? The answer 


to this question applies not only to sound but to all other waves as well. 


Suppose a jet airplane is coming nearly straight at you, emitting a sound of 
frequency f;. The greater the plane’s speed vg, the greater the Doppler shift 
and the greater the value observed for fops. Now, aS vs approaches the speed 
of sound, fops approaches infinity, because the denominator in 


fobs = fs( —*~ ) approaches zero. At the speed of sound, this result 


Us =e Us 
means that in front of the source, each successive wave is superimposed on 
the previous one because the source moves forward at the speed of sound. 
The observer gets them all at the same instant, and so the frequency is 
infinite. (Before airplanes exceeded the speed of sound, some people argued 
it would be impossible because such constructive superposition would 
produce pressures great enough to destroy the airplane.) If the source 
exceeds the speed of sound, no sound is received by the observer until the 
source has passed, so that the sounds from the approaching source are 
mixed with those from it when receding. This mixing appears messy, but 
something interesting happens—a sonic boom is created. (See [link].) 


Sound waves from 


a source that moves 
faster than the 
speed of sound 

spread spherically 
from the point 
where they are 
emitted, but the 
source moves 
ahead of each. 


Constructive 
interference along 
the lines shown 
(actually a cone in 
three dimensions) 
creates a shock 
wave Called a sonic 
boom. The faster 
the speed of the 
source, the smaller 
the angle 0. 


There is constructive interference along the lines shown (a cone in three 
dimensions) from similar sound waves arriving there simultaneously. This 
superposition forms a disturbance called a sonic boom, a constructive 
interference of sound created by an object moving faster than sound. Inside 
the cone, the interference is mostly destructive, and so the sound intensity 
there is much less than on the shock wave. An aircraft creates two sonic 
booms, one from its nose and one from its tail. (See [link].) During 
television coverage of space shuttle landings, two distinct booms could 
often be heard. These were separated by exactly the time it would take the 
shuttle to pass by a point. Observers on the ground often do not see the 
aircraft creating the sonic boom, because it has passed by before the shock 
wave reaches them, as seen in [link]. If the aircraft flies close by at low 
altitude, pressures in the sonic boom can be destructive and break windows 
as well as rattle nerves. Because of how destructive sonic booms can be, 
supersonic flights are banned over populated areas of the United States. 


Two sonic booms, 
created by the nose 
and tail of an 
aircraft, are 
observed on the 
ground after the 
plane has passed 
by. 


Sonic booms are one example of a broader phenomenon called bow wakes. 
A bow wake, such as the one in [link], is created when the wave source 
moves faster than the wave propagation speed. Water waves spread out in 
circles from the point where created, and the bow wake is the familiar V- 
shaped wake trailing the source. A more exotic bow wake is created when a 
subatomic particle travels through a medium faster than the speed of light 
travels in that medium. (In a vacuum, the maximum speed of light will be 

c = 3.00 x 10° m /s; in the medium of water, the speed of light is closer to 
0.75c. If the particle creates light in its passage, that light spreads on a cone 
with an angle indicative of the speed of the particle, as illustrated in [link]. 
Such a bow wake is called Cerenkov radiation and is commonly observed 
in particle physics. 


Bow wake created 
by a duck. 
Constructive 
interference 
produces the rather 
structured wake, 
while there is 
relatively little 
wave action inside 
the wake, where 
interference is 
mostly destructive. 
(credit: Horia 
Varlan, Flickr) 


The blue glow in 
this research 
reactor pool is 
Cerenkov radiation 
caused by 
subatomic particles 
traveling faster than 
the speed of light in 
water. (credit: U.S. 
Nuclear Regulatory 
Commission) 


Doppler shifts and sonic booms are interesting sound phenomena that occur 
in all types of waves. They can be of considerable use. For example, the 
Doppler shift in ultrasound can be used to measure blood velocity, while 
police use the Doppler shift in radar (a microwave) to measure car 
velocities. In meteorology, the Doppler shift is used to track the motion of 
storm clouds; such “Doppler Radar” can give velocity and direction and 
rain or snow potential of imposing weather fronts. In astronomy, we can 
examine the light emitted from distant galaxies and determine their speed 
relative to ours. As galaxies move away from us, their light is shifted to a 
lower frequency, and so to a longer wavelength—the so-called red shift. 
Such information from galaxies far, far away has allowed us to estimate the 
age of the universe (from the Big Bang) as about 14 billion years. 
Exercise: 

Check Your Understanding 


Problem: 


Why did scientist Christian Doppler observe musicians both on a 
moving train and also from a stationary point not on the train? 


Solution: 


Doppler needed to compare the perception of sound when the observer 
is stationary and the sound source moves, as well as when the sound 


source and the observer are both in motion. 


Exercise: 
Check Your Understanding 


Problem: 


Describe a situation in your life when you might rely on the Doppler 
shift to help you either while driving a car or walking near traffic. 


Solution: 


If I am driving and I hear Doppler shift in an ambulance siren, I would 
be able to tell when it was getting closer and also if it has passed by. 
This would help me to know whether I needed to pull over and let the 
ambulance through. 


Section Summary 


The Doppler effect is an alteration in the observed frequency of a 
sound due to motion of either the source or the observer. 

The actual change in frequency is called the Doppler shift. 

A sonic boom is constructive interference of sound created by an 
object moving faster than sound. 

A sonic boom is a type of bow wake created when any wave source 
moves faster than the wave propagation speed. 

For a stationary observer and a moving source, the observed frequency 
fobs is: 

Equation: 


Uw 
Tabs =~ #(——), 


where f, is the frequency of the source, v, is the speed of the source, 
and vy is the speed of sound. The minus sign is used for motion 
toward the observer and the plus sign for motion away. 

For a stationary source and moving observer, the observed frequency 
is: 


Equation: 


Vee = Vobs 
Fabs = p( =="), 


Uy 
where Uops is the speed of the observer. 


Conceptual Questions 


Exercise: 


Problem: Is the Doppler shift real or just a sensory illusion? 


Exercise: 


Problem: 


Due to efficiency considerations related to its bow wake, the 
supersonic transport aircraft must maintain a cruising speed that is a 
constant ratio to the speed of sound (a constant Mach number). If the 
aircraft flies from warm air into colder air, should it increase or 


decrease its speed? Explain your answer. 
Exercise: 


Problem: 


When you hear a sonic boom, you often cannot see the plane that made 


it. Why is that? 


Problems & Exercises 


Exercise: 


Problem: 


(a) What frequency is received by a person watching an oncoming 
ambulance moving at 110 km/h and emitting a steady 800-Hz sound 
from its siren? The speed of sound on this day is 345 m/s. (b) What 
frequency does she receive after the ambulance has passed? 


Solution: 
(a) 878 Hz 


(b) 735 Hz 

Exercise: 
Problem: 
(a) At an air show a jet flies directly toward the stands at a speed of 
1200 km/h, emitting a frequency of 3500 Hz, on a day when the speed 
of sound is 342 m/s. What frequency is received by the observers? (b) 


What frequency do they receive as the plane flies directly away from 
them? 


Exercise: 


Problem: 

What frequency is received by a mouse just before being dispatched by 
a hawk flying at it at 25.0 m/s and emitting a screech of frequency 
3500 Hz? Take the speed of sound to be 331 m/s. 


Solution: 
Equation: 


3.79 x 10? Hz 


Exercise: 


Problem: 


A spectator at a parade receives an 888-Hz tone from an oncoming 
trumpeter who is playing an 880-Hz note. At what speed is the 
musician approaching if the speed of sound is 338 m/s? 


Exercise: 
Problem: 
A commuter train blows its 200-Hz horn as it approaches a crossing. 
The speed of sound is 335 m/s. (a) An observer waiting at the crossing 


receives a frequency of 208 Hz. What is the speed of the train? (b) 
What frequency does the observer receive as the train moves away? 


Solution: 
(a) 12.9 m/s 


(b) 193 Hz 
Exercise: 
Problem: 
Can you perceive the shift in frequency produced when you pull a 
tuning fork toward you at 10.0 m/s on a day when the speed of sound 


is 344 m/s? To answer this question, calculate the factor by which the 
frequency shifts and see if it is greater than 0.300%. 


Exercise: 
Problem: 
Two eagles fly directly toward one another, the first at 15.0 m/s and the 
second at 20.0 m/s. Both screech, the first one emitting a frequency of 


3200 Hz and the second one emitting a frequency of 3800 Hz. What 
frequencies do they receive if the speed of sound is 330 m/s? 


Solution: 


First eagle hears 4.23 x 10° Hz 


Second eagle hears 3.56 x 10° Hz 
Exercise: 


Problem: 


What is the minimum speed at which a source must travel toward you 
for you to be able to hear that its frequency is Doppler shifted? That is, 
what speed produces a shift of 0.300% on a day when the speed of 
sound is 331 m/s? 


Glossary 


Doppler effect 
an alteration in the observed frequency of a sound due to motion of 
either the source or the observer 


Doppler shift 
the actual change in frequency due to relative motion of source and 
observer 


sonic boom 
a constructive interference of sound created by an object moving faster 
than sound 


bow wake 
V-shaped disturbance created when the wave source moves faster than 
the wave propagation speed 


Sound Interference and Resonance: Standing Waves in Air Columns 


e Define antinode, node, fundamental, overtones, and harmonics. 

e Identify instances of sound interference in everyday situations. 

e Describe how sound interference occurring inside open and closed 
tubes changes the characteristics of the sound, and how this applies to 
sounds produced by musical instruments. 

e Calculate the length of a tube using sound wave measurements. 


Some types 
of 
headphones 
use the 
phenomena 
of 


constructiv 
e and 
destructive 
interference 
to cancel 
out outside 
noises. 
(credit: 
JVC 
America, 
Flickr) 


Interference is the hallmark of waves, all of which exhibit constructive and 
destructive interference exactly analogous to that seen for water waves. In 
fact, one way to prove something “is a wave” is to observe interference 
effects. So, sound being a wave, we expect it to exhibit interference; we 
have already mentioned a few such effects, such as the beats from two 
similar notes played simultaneously. 


[link] shows a clever use of sound interference to cancel noise. Larger-scale 
applications of active noise reduction by destructive interference are 
contemplated for entire passenger compartments in commercial aircraft. To 
obtain destructive interference, a fast electronic analysis is performed, and a 
second sound is introduced with its maxima and minima exactly reversed 
from the incoming noise. Sound waves in fluids are pressure waves and 
consistent with Pascal’s principle; pressures from two different sources add 
and subtract like simple numbers; that is, positive and negative gauge 
pressures add to a much smaller pressure, producing a lower-intensity 
sound. Although completely destructive interference is possible only under 
the simplest conditions, it is possible to reduce noise levels by 30 dB or 
more using this technique. 


: Boom and cable 
Noise sensor 


Noise 
Cancellation | Driver 
System 

Pressure servo 


Cushions 


Boom mic 
with low distortion housing 


Headphones designed to cancel noise 
with destructive interference create a 
sound wave exactly opposite to the 
incoming sound. These headphones 
can be more effective than the simple 
passive attenuation used in most ear 
protection. Such headphones were 


used on the record-setting, around the 
world nonstop flight of the Voyager 
aircraft to protect the pilots’ hearing 
from engine noise. 


Where else can we observe sound interference? All sound resonances, such 
as in musical instruments, are due to constructive and destructive 
interference. Only the resonant frequencies interfere constructively to form 
standing waves, while others interfere destructively and are absent. From 
the toot made by blowing over a bottle, to the characteristic flavor of a 
violin’s sounding box, to the recognizability of a great singer’s voice, 
resonance and standing waves play a vital role. 


Note: 

Interference 

Interference is such a fundamental aspect of waves that observing 
interference is proof that something is a wave. The wave nature of light 
was established by experiments showing interference. Similarly, when 
electrons scattered from crystals exhibited interference, their wave nature 
was confirmed to be exactly as predicted by symmetry with certain wave 
characteristics of light. 


Suppose we hold a tuning fork near the end of a tube that is closed at the 
other end, as shown in [Link], [link], [link], and [link]. If the tuning fork has 
just the right frequency, the air column in the tube resonates loudly, but at 
most frequencies it vibrates very little. This observation just means that the 
air column has only certain natural frequencies. The figures show how a 
resonance at the lowest of these natural frequencies is formed. A 
disturbance travels down the tube at the speed of sound and bounces off the 
closed end. If the tube is just the right length, the reflected sound arrives 
back at the tuning fork exactly half a cycle later, and it interferes 


constructively with the continuing sound produced by the tuning fork. The 
incoming and reflected sounds form a standing wave in the tube as shown. 


my 


Resonance of air in a tube 
closed at one end, caused 
by a tuning fork. A 
disturbance moves down 
the tube. 


Resonance of air in a tube 
closed at one end, caused 
by a tuning fork. The 
disturbance reflects from 
the closed end of the tube. 


Resonance of air in a tube 
closed at one end, caused 
by a tuning fork. If the 
length of the tube L is 
just right, the disturbance 
gets back to the tuning 
fork half a cycle later and 
interferes constructively 
with the continuing sound 
from the tuning fork. This 
interference forms a 
standing wave, and the air 
column resonates. 


Resonance of air in a tube 
closed at one end, caused 
by a tuning fork. A graph 
of air displacement along 
the length of the tube 
shows none at the closed 


end, where the motion is 
constrained, and a 
maximum at the open 
end. This standing wave 
has one-fourth of its 
wavelength in the tube, so 
that A = 4L. 


The standing wave formed in the tube has its maximum air displacement 
(an antinode) at the open end, where motion is unconstrained, and no 
displacement (a node) at the closed end, where air movement is halted. The 
distance from a node to an antinode is one-fourth of a wavelength, and this 
equals the length of the tube; thus, A = 4. This same resonance can be 
produced by a vibration introduced at or near the closed end of the tube, as 
shown in [link]. It is best to consider this a natural vibration of the air 
column independently of how it is induced. 


The same standing wave is created in 
the tube by a vibration introduced near 
its closed end. 


Given that maximum air displacements are possible at the open end and 
none at the closed end, there are other, shorter wavelengths that can 
resonate in the tube, such as the one shown in [link]. Here the standing 
wave has three-fourths of its wavelength in the tube, or L = (3/4), so 
that \/= 40/3. Continuing this process reveals a whole series of shorter- 
wavelength and higher-frequency sounds that resonate in the tube. We use 
specific terms for the resonances in any system. The lowest resonant 
frequency is called the fundamental, while all higher resonant frequencies 
are called overtones. All resonant frequencies are integral multiples of the 
fundamental, and they are collectively called harmonics. The fundamental 
is the first harmonic, the first overtone is the second harmonic, and so on. 
[link] shows the fundamental and the first three overtones (the first four 
harmonics) in a tube closed at one end. 


Another resonance 
for a tube closed at 
one end. This has 
maximum air 
displacements at 
the open end, and 
none at the closed 
end. The 
wavelength is 
shorter, with three- 
fourths A/ equaling 
the length of the 
tube, so that 
Al= 41/3. This 
higher-frequency 
vibration is the first 
overtone. 
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The fundamental and three lowest overtones for a tube closed 
at one end. All have maximum air displacements at the open 
end and none at the closed end. 


The fundamental and overtones can be present simultaneously in a variety 
of combinations. For example, middle C on a trumpet has a sound 
distinctively different from middle C on a clarinet, both instruments being 
modified versions of a tube closed at one end. The fundamental frequency 
is the same (and usually the most intense), but the overtones and their mix 
of intensities are different and subject to shading by the musician. This mix 
is what gives various musical instruments (and human voices) their 
distinctive characteristics, whether they have air columns, strings, sounding 
boxes, or drumheads. In fact, much of our speech is determined by shaping 
the cavity formed by the throat and mouth and positioning the tongue to 
adjust the fundamental and combination of overtones. Simple resonant 
cavities can be made to resonate with the sound of the vowels, for example. 
(See [link].) In boys, at puberty, the larynx grows and the shape of the 
resonant cavity changes giving rise to the difference in predominant 
frequencies in speech between men and women. 


The throat and mouth form an air column closed at 
one end that resonates in response to vibrations in 
the voice box. The spectrum of overtones and their 
intensities vary with mouth shaping and tongue 
position to form different sounds. The voice box 
can be replaced with a mechanical vibrator, and 
understandable speech is still possible. Variations 
in basic shapes make different voices 
recognizable. 


Now let us look for a pattern in the resonant frequencies for a simple tube 
that is closed at one end. The fundamental has A = 40, and frequency is 
related to wavelength and the speed of sound as given by: 

Equation: 


t= TA; 


Solving for f in this equation gives 
Equation: 


Uw Uw 


an eae 


where Uy is the speed of sound in air. Similarly, the first overtone has 
Al= 41/3 (see [link]), so that 
Equation: 


Uw 


Because f/= 3f, we call the first overtone the third harmonic. Continuing 
this process, we see a pattern that can be generalized in a single expression. 
The resonant frequencies of a tube closed at one end are 

Equation: 


where fy; is the fundamental, f3 is the first overtone, and so on. It is 
interesting that the resonant frequencies depend on the speed of sound and, 
hence, on temperature. This dependence poses a noticeable problem for 
organs in old unheated cathedrals, and it is also the reason why musicians 
commonly bring their wind instruments to room temperature before playing 
them. 


Example: 

Find the Length of a Tube with a 128 Hz Fundamental 

(a) What length should a tube closed at one end have on a day when the air 
temperature, is 22.0°C, if its fundamental frequency is to be 128 Hz (C 
below middle C)? 

(b) What is the frequency of its fourth overtone? 

Strategy 

The length LZ can be found from the relationship in f, = n 7, but we will 
first need to find the speed of sound vy. 

Solution for (a) 

(1) Identify knowns: 


e the fundamental frequency is 128 Hz 
e the air temperature is 22.0°C 


(2) Use f, = ea to find the fundamental frequency (n = 1). 
Equation: 


ee 
t= aR 
(3) Solve this equation for length. 
Equation: 
Uw 


= 
4 fi 


(4) Find the speed of sound using vy = (331 m/s)4/ a3, - 
Equation: 


295 K 
Uw = (331 m/s)y/ 25% = 344 m/s 


(5) Enter the values of the speed of sound and frequency into the 
expression for L. 
Equation: 


44 
ee TA _344 m/s ~ 0.672m 
4f,  4(128 Hz) 


Discussion on (a) 

Many wind instruments are modified tubes that have finger holes, valves, 
and other devices for changing the length of the resonating air column and 
hence, the frequency of the note played. Horns producing very low 
frequencies, such as tubas, require tubes so long that they are coiled into 
loops. 

Solution for (b) 

(1) Identify knowns: 


e the first overtone has n = 3 
e the second overtone has n = 5 
e the third overtone has n = 7 
e the fourth overtone has n = 9 


(2) Enter the value for the fourth overtone into f, = nq. 
Equation: 


fo Ys Of, = 1.15 kHz 


Discussion on (b) 
Whether this overtone occurs in a simple tube or a musical instrument 
depends on how it is stimulated to vibrate and the details of its shape. The 


trombone, for example, does not produce its fundamental frequency and 
only makes overtones. 


Another type of tube is one that is open at both ends. Examples are some 
organ pipes, flutes, and oboes. The resonances of tubes open at both ends 
can be analyzed in a very similar fashion to those for tubes closed at one 
end. The air columns in tubes open at both ends have maximum air 
displacements at both ends, as illustrated in [link]. Standing waves form as 
shown. 
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The resonant frequencies of a tube open at both ends are shown, 
including the fundamental and the first three overtones. In all cases 
the maximum air displacements occur at both ends of the tube, giving 
it different natural frequencies than a tube closed at one end. 


Based on the fact that a tube open at both ends has maximum air 
displacements at both ends, and using [link] as a guide, we can see that the 
resonant frequencies of a tube open at both ends are: 

Equation: 


where fy; is the fundamental, fg is the first overtone, f3 is the second 
overtone, and so on. Note that a tube open at both ends has a fundamental 
frequency twice what it would have if closed at one end. It also has a 
different spectrum of overtones than a tube closed at one end. So if you had 


two tubes with the same fundamental frequency but one was open at both 
ends and the other was closed at one end, they would sound different when 
played because they have different overtones. Middle C, for example, 
would sound richer played on an open tube, because it has even multiples of 
the fundamental as well as odd. A closed tube has only odd multiples. 


Note: 

Real-World Applications: Resonance in Everyday Systems 

Resonance occurs in many different systems, including strings, air 
columns, and atoms. Resonance is the driven or forced oscillation of a 
system at its natural frequency. At resonance, energy is transferred rapidly 
to the oscillating system, and the amplitude of its oscillations grows until 
the system can no longer be described by Hooke’s law. An example of this 
is the distorted sound intentionally produced in certain types of rock music. 


Wind instruments use resonance in air columns to amplify tones made by 
lips or vibrating reeds. Other instruments also use air resonance in clever 
ways to amplify sound. [link] shows a violin and a guitar, both of which 
have sounding boxes but with different shapes, resulting in different 
overtone structures. The vibrating string creates a sound that resonates in 
the sounding box, greatly amplifying the sound and creating overtones that 
give the instrument its characteristic flavor. The more complex the shape of 
the sounding box, the greater its ability to resonate over a wide range of 
frequencies. The marimba, like the one shown in [link] uses pots or gourds 
below the wooden slats to amplify their tones. The resonance of the pot can 
be adjusted by adding water. 


String instruments such 
as violins and guitars use 
resonance in their 
sounding boxes to 
amplify and enrich the 
sound created by their 
vibrating strings. The 
bridge and supports 
couple the string 
vibrations to the sounding 
boxes and air within. 
(credits: guitar, Feliciano 
Guimares, Fotopedia; 
violin, Steve Snodgrass, 
Flickr) 


Resonance has been used in 
musical instruments since 
prehistoric times. This marimba 
uses gourds as resonance 
chambers to amplify its sound. 
(credit: APC Events, Flickr) 


We have emphasized sound applications in our discussions of resonance 
and standing waves, but these ideas apply to any system that has wave 
characteristics. Vibrating strings, for example, are actually resonating and 
have fundamentals and overtones similar to those for air columns. More 
subtle are the resonances in atoms due to the wave character of their 
electrons. Their orbitals can be viewed as standing waves, which have a 
fundamental (ground state) and overtones (excited states). It is fascinating 
that wave characteristics apply to such a wide range of physical systems. 
Exercise: 

Check Your Understanding 


Problem: 


Describe how noise-canceling headphones differ from standard 
headphones used to block outside sounds. 


Solution: 


Regular headphones only block sound waves with a physical barrier. 
Noise-canceling headphones use destructive interference to reduce the 
loudness of outside sounds. 


Exercise: 
Check Your Understanding 


Problem: 


How is it possible to use a standing wave's node and antinode to 
determine the length of a closed-end tube? 


Solution: 


When the tube resonates at its natural frequency, the wave's node is 
located at the closed end of the tube, and the antinode is located at the 
open end. The length of the tube is equal to one-fourth of the 
wavelength of this wave. Thus, if we know the wavelength of the 
wave, we can determine the length of the tube. 


Note: 

PhET Explorations: Sound 

This simulation lets you see sound waves. Adjust the frequency or volume 
and you can see and hear how the wave changes. Move the listener around 
and hear what she hears. 
https://archive.cnx.org/specials/c4d3b96e-41f3-11e5-ab7b- 
47e22dffc18e/sound/#sim-single-source 


Section Summary 


e Sound interference and resonance have the same properties as defined 
for all waves. 

e In air columns, the lowest-frequency resonance is called the 
fundamental, whereas all higher resonant frequencies are called 
overtones. Collectively, they are called harmonics. 


e The resonant frequencies of a tube closed at one end are: 
Equation: 
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f; is the fundamental and L is the length of the tube. 
e The resonant frequencies of a tube open at both ends are: 
Equation: 


Conceptual Questions 


Exercise: 
Problem: 
How does an unamplified guitar produce sounds so much more intense 
than those of a plucked string held taut by a simple stick? 
Exercise: 
Problem: 
You are given two wind instruments of identical length. One is open at 


both ends, whereas the other is closed at one end. Which is able to 
produce the lowest frequency? 


Exercise: 


Problem: 


What is the difference between an overtone and a harmonic? Are all 
harmonics overtones? Are all overtones harmonics? 


Problems & Exercises 


Exercise: 
Problem: 
A “showy” custom-built car has two brass horns that are supposed to 


produce the same frequency but actually emit 263.8 and 264.5 Hz. 
What beat frequency is produced? 


Solution: 


0.7 Hz 
Exercise: 
Problem: 
What beat frequencies will be present: (a) If the musical notes A and C 
are played together (frequencies of 220 and 264 Hz)? (b) If D and F 


are played together (frequencies of 297 and 352 Hz)? (c) If all four are 
played together? 


Exercise: 
Problem: 


What beat frequencies result if a piano hammer hits three strings that 
emit frequencies of 127.8, 128.1, and 128.3 Hz? 


Solution: 


0.3 Hz, 0.2 Hz, 0.5 Hz 
Exercise: 
Problem: 
A piano tuner hears a beat every 2.00 s when listening to a 264.0-Hz 


tuning fork and a single piano string. What are the two possible 
frequencies of the string? 


Exercise: 


Problem: 


(a) What is the fundamental frequency of a 0.672-m-long tube, open at 
both ends, on a day when the speed of sound is 344 m/s? (b) What is 
the frequency of its second harmonic? 


Solution: 
(a) 256 Hz 


(b) 512 Hz 
Exercise: 
Problem: 
If a wind instrument, such as a tuba, has a fundamental frequency of 
32.0 Hz, what are its first three overtones? It is closed at one end. (The 


overtones of a real tuba are more complex than this example, because 
it is a tapered tube.) 


Exercise: 
Problem: 
What are the first three overtones of a bassoon that has a fundamental 
frequency of 90.0 Hz? It is open at both ends. (The overtones of a real 


bassoon are more complex than this example, because its double reed 
makes it act more like a tube closed at one end.) 


Solution: 


180 Hz, 270 Hz, 360 Hz 


Exercise: 


Problem: 


How long must a flute be in order to have a fundamental frequency of 
262 Hz (this frequency corresponds to middle C on the evenly 
tempered chromatic scale) on a day when air temperature is 20.0°C? It 
is open at both ends. 


Exercise: 
Problem: 
What length should an oboe have to produce a fundamental frequency 


of 110 Hz on a day when the speed of sound is 343 m/s? It is open at 
both ends. 


Solution: 


1.56 m 
Exercise: 


Problem: 


What is the length of a tube that has a fundamental frequency of 176 
Hz and a first overtone of 352 Hz if the speed of sound is 343 m/s? 


Exercise: 
Problem: 
(a) Find the length of an organ pipe closed at one end that produces a 
fundamental frequency of 256 Hz when air temperature is 18.0°C. (b) 
What is its fundamental frequency at 25.0°C? 
Solution: 


(a) 0.334 m 


(b) 259 Hz 


Exercise: 


Problem: 


By what fraction will the frequencies produced by a wind instrument 
change when air temperature goes from 10.0°C to 30.0°C? That is, 
find the ratio of the frequencies at those temperatures. 


Exercise: 


Problem: 


The ear canal resonates like a tube closed at one end. (See [link].) If 
ear canals range in length from 1.80 to 2.60 cm in an average 
population, what is the range of fundamental resonant frequencies? 
Take air temperature to be 37.0°C, which is the same as body 
temperature. How does this result correlate with the intensity versus 
frequency graph ([link] of the human ear? 


Solution: 


3.39 to 4.90 kHz 
Exercise: 


Problem: 


Calculate the first overtone in an ear canal, which resonates like a 
2.40-cm-long tube closed at one end, by taking air temperature to be 
37.0°C. Is the ear particularly sensitive to such a frequency? (The 
resonances of the ear canal are complicated by its nonuniform shape, 
which we shall ignore.) 


Exercise: 


Problem: 


A crude approximation of voice production is to consider the breathing 
passages and mouth to be a resonating tube closed at one end. (See 
[link].) (a) What is the fundamental frequency if the tube is 0.240-m 
long, by taking air temperature to be 37.0°C? (b) What would this 
frequency become if the person replaced the air with helium? Assume 
the same temperature dependence for helium as for air. 


Solution: 
(a) 367 Hz 


(b) 1.07 kHz 

Exercise: 
Problem: 
(a) Students in a physics lab are asked to find the length of an air 
column in a tube closed at one end that has a fundamental frequency of 
256 Hz. They hold the tube vertically and fill it with water to the top, 
then lower the water while a 256-Hz tuning fork is rung and listen for 
the first resonance. What is the air temperature if the resonance occurs 


for a length of 0.336 m? (b) At what length will they observe the 
second resonance (first overtone)? 


Exercise: 
Problem: 
What frequencies will a 1.80-m-long tube produce in the audible range 
at 20.0°C if: (a) The tube is closed at one end? (b) It is open at both 
ends? 
Solution: 


(ai = 147.6 Az) wad, 3. bi 419 


(b) f, = n(95.3 Hz), n = 1, 2, 3,..., 210 


Glossary 


antinode 
point of maximum displacement 


node 


point of zero displacement 


fundamental 
the lowest-frequency resonance 


overtones 
all resonant frequencies higher than the fundamental 


harmonics 
the term used to refer collectively to the fundamental and its overtones 


Hearing 


e Define hearing, pitch, loudness, timbre, note, tone, phon, ultrasound, 
and infrasound. 

e Compare loudness to frequency and intensity of a sound. 

e Identify structures of the inner ear and explain how they relate to 
sound perception. 


Hearing allows this 
vocalist, his band, and his 
fans to enjoy music. 
(credit: West Point Public 
Affairs, Flickr) 


The human ear has a tremendous range and sensitivity. It can give us a 
wealth of simple information—such as pitch, loudness, and direction. And 
from its input we can detect musical quality and nuances of voiced emotion. 
How is our hearing related to the physical qualities of sound, and how does 
the hearing mechanism work? 


Hearing is the perception of sound. (Perception is commonly defined to be 
awareness through the senses, a typically circular definition of higher-level 
processes in living organisms.) Normal human hearing encompasses 
frequencies from 20 to 20,000 Hz, an impressive range. Sounds below 20 
Hz are called infrasound, whereas those above 20,000 Hz are ultrasound. 
Neither is perceived by the ear, although infrasound can sometimes be felt 
as vibrations. When we do hear low-frequency vibrations, such as the 


sounds of a diving board, we hear the individual vibrations only because 
there are higher-frequency sounds in each. Other animals have hearing 
ranges different from that of humans. Dogs can hear sounds as high as 
30,000 Hz, whereas bats and dolphins can hear up to 100,000-Hz sounds. 
You may have noticed that dogs respond to the sound of a dog whistle 
which produces sound out of the range of human hearing. Elephants are 
known to respond to frequencies below 20 Hz. 


The perception of frequency is called pitch. Most of us have excellent 
relative pitch, which means that we can tell whether one sound has a 
different frequency from another. Typically, we can discriminate between 
two sounds if their frequencies differ by 0.3% or more. For example, 500.0 
and 501.5 Hz are noticeably different. Pitch perception is directly related to 
frequency and is not greatly affected by other physical quantities such as 
intensity. Musical notes are particular sounds that can be produced by most 
instruments and in Western music have particular names. Combinations of 
notes constitute music. Some people can identify musical notes, such as A- 
sharp, C, or E-flat, just by listening to them. This uncommon ability is 
called perfect pitch. 


The ear is remarkably sensitive to low-intensity sounds. The lowest audible 
intensity or threshold is about 1071? W/ m/ or 0 dB. Sounds as much as 
10/2 more intense can be briefly tolerated. Very few measuring devices are 
capable of observations over a range of a trillion. The perception of 
intensity is called loudness. At a given frequency, it is possible to discern 
differences of about 1 dB, and a change of 3 dB is easily noticed. But 
loudness is not related to intensity alone. Frequency has a major effect on 
how loud a sound seems. The ear has its maximum sensitivity to 
frequencies in the range of 2000 to 5000 Hz, so that sounds in this range are 
perceived as being louder than, say, those at 500 or 10,000 Hz, even when 
they all have the same intensity. Sounds near the high- and low-frequency 
extremes of the hearing range seem even less loud, because the ear is even 
less sensitive at those frequencies. [link] gives the dependence of certain 
human hearing perceptions on physical quantities. 


Perception Physical quantity 
Pitch Frequency 
Loudness Intensity and Frequency 


Number and relative intensity of multiple 
frequencies. 


Timbre ; 
Subtle craftsmanship leads to non-linear effects and 
more detail. 

Note Basic unit of music with specific names, combined to 
generate tunes 
Number and relative intensity of multiple 

Tone 


frequencies. 


Sound Perceptions 


When a violin plays middle C, there is no mistaking it for a piano playing 
the same note. The reason is that each instrument produces a distinctive set 
of frequencies and intensities. We call our perception of these combinations 
of frequencies and intensities tone quality, or more commonly the timbre 
of the sound. It is more difficult to correlate timbre perception to physical 
quantities than it is for loudness or pitch perception. Timbre is more 
subjective. Terms such as dull, brilliant, warm, cold, pure, and rich are 
employed to describe the timbre of a sound. So the consideration of timbre 
takes us into the realm of perceptual psychology, where higher-level 
processes in the brain are dominant. This is true for other perceptions of 
sound, such as music and noise. We shall not delve further into them; rather, 
we will concentrate on the question of loudness perception. 


A unit called a phon is used to express loudness numerically. Phons differ 
from decibels because the phon is a unit of loudness perception, whereas 
the decibel is a unit of physical intensity. [link] shows the relationship of 
loudness to intensity (or intensity level) and frequency for persons with 
normal hearing. The curved lines are equal-loudness curves. Each curve is 


labeled with its loudness in phons. Any sound along a given curve will be 
perceived as equally loud by the average person. The curves were 
determined by having large numbers of people compare the loudness of 
sounds at different frequencies and sound intensity levels. At a frequency of 
1000 Hz, phons are taken to be numerically equal to decibels. The 
following Ses i aoe how to use the graph: 
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The relationship of loudness in 
phons to intensity level (in 
decibels) and intensity (in watts 
per meter squared) for persons 
with normal hearing. The 
curved lines are equal-loudness 
curves—all sounds on a given 
curve are perceived as equally 
loud. Phons and decibels are 
defined to be the same at 1000 
Hz: 


Example: 

Measuring Loudness: Loudness Versus Intensity Level and Frequency 
(a) What is the loudness in phons of a 100-Hz sound that has an intensity 
level of 80 dB? (b) What is the intensity level in decibels of a 4000-Hz 


sound having a loudness of 70 phons? (c) At what intensity level will an 
8000-Hz sound have the same loudness as a 200-Hz sound at 60 dB? 
Strategy for (a) 

The graph in [link] should be referenced in order to solve this example. To 
find the loudness of a given sound, you must know its frequency and 
intensity level and locate that point on the square grid, then interpolate 
between loudness curves to get the loudness in phons. 

Solution for (a) 

(1) Identify knowns: 


e The square grid of the graph relating phons and decibels is a plot of 
intensity level versus frequency—both physical quantities. 

e 100 Hz at 80 dB lies halfway between the curves marked 70 and 80 
phons. 


(2) Find the loudness: 75 phons. 

Strategy for (b) 

The graph in [link] should be referenced in order to solve this example. To 
find the intensity level of a sound, you must have its frequency and 
loudness. Once that point is located, the intensity level can be determined 
from the vertical axis. 

Solution for (b) 

(1) Identify knowns: 


e Values are given to be 4000 Hz at 70 phons. 


(2) Follow the 70-phon curve until it reaches 4000 Hz. At that point, it is 
below the 70 dB line at about 67 dB. 

(3) Find the intensity level: 

67 dB 

Strategy for (c) 

The graph in [link] should be referenced in order to solve this example. 
Solution for (c) 

(1) Locate the point for a 200 Hz and 60 dB sound. 

(2) Find the loudness: This point lies just slightly above the 50-phon curve, 
and so its loudness is 51 phons. 

(3) Look for the 51-phon level is at 8000 Hz: 63 dB. 

Discussion 


These answers, like all information extracted from [link], have 
uncertainties of several phons or several decibels, partly due to difficulties 
in interpolation, but mostly related to uncertainties in the equal-loudness 
curves. 


Further examination of the graph in [link] reveals some interesting facts 
about human hearing. First, sounds below the 0-phon curve are not 
perceived by most people. So, for example, a 60 Hz sound at 40 dB is 
inaudible. The 0-phon curve represents the threshold of normal hearing. We 
can hear some sounds at intensity levels below 0 dB. For example, a 3-dB, 
5000-Hz sound is audible, because it lies above the 0-phon curve. The 
loudness curves all have dips in them between about 2000 and 5000 Hz. 
These dips mean the ear is most sensitive to frequencies in that range. For 
example, a 15-dB sound at 4000 Hz has a loudness of 20 phons, the same as 
a 20-dB sound at 1000 Hz. The curves rise at both extremes of the 
frequency range, indicating that a greater-intensity level sound is needed at 
those frequencies to be perceived to be as loud as at middle frequencies. For 
example, a sound at 10,000 Hz must have an intensity level of 30 dB to 
seem as loud as a 20 dB sound at 1000 Hz. Sounds above 120 phons are 
painful as well as damaging. 


We do not often utilize our full range of hearing. This is particularly true for 
frequencies above 8000 Hz, which are rare in the environment and are 
unnecessary for understanding conversation or appreciating music. In fact, 
people who have lost the ability to hear such high frequencies are usually 
unaware of their loss until tested. The shaded region in [link] is the 
frequency and intensity region where most conversational sounds fall. The 
curved lines indicate what effect hearing losses of 40 and 60 phons will 
have. A 40-phon hearing loss at all frequencies still allows a person to 
understand conversation, although it will seem very quiet. A person with a 
60-phon loss at all frequencies will hear only the lowest frequencies and 
will not be able to understand speech unless it is much louder than normal. 
Even so, speech may seem indistinct, because higher frequencies are not as 
well perceived. The conversational speech region also has a gender 
component, in that female voices are usually characterized by higher 


frequencies. So the person with a 60-phon hearing impediment might have 
difficulty understanding the normal conversation of a woman. 
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The shaded region represents 
frequencies and intensity levels 
found in normal conversational 

speech. The 0-phon line 
represents the normal hearing 
threshold, while those at 40 and 
60 represent thresholds for 
people with 40- and 60-phon 
hearing losses, respectively. 


Hearing tests are performed over a range of frequencies, usually from 250 
to 8000 Hz, and can be displayed graphically in an audiogram like that in 
[link]. The hearing threshold is measured in dB relative to the normal 
threshold, so that normal hearing registers as 0 dB at all frequencies. 
Hearing loss caused by noise typically shows a dip near the 4000 Hz 
frequency, irrespective of the frequency that caused the loss and often 
affects both ears. The most common form of hearing loss comes with age 
and is called presbycusis—literally elder ear. Such loss is increasingly 


severe at higher frequencies, and interferes with music appreciation and 
speech recognition. 


| 
| ;+{ 1} 1 
40 | @ = rightear . 
_@ =leftear | 


Hearing threshold level (dB) 


es SH 
2egl 1000 2000 4000 8000 
Frequency f (Hz) 


Hearing threshold level (dB) 


ang 1000 2000 4000 8000 
00 Frequency f (Hz) 


il 


e= right ear 
40 -~ © = left ear 
[ ] = bone conduction 


t t y t 
260 1000 2000 4000 8000 
500 


Frequency f (Hz) 


Hearing threshold level (dB) 


Audiograms showing the 
threshold in intensity level 
versus frequency for three 

different individuals. Intensity 
level is measured relative to the 
normal threshold. The top left 
graph is that of a person with 
normal hearing. The graph to its 
right has a dip at 4000 Hz and is 
that of a child who suffered 
hearing loss due to a cap gun. 
The third graph is typical of 
presbycusis, the progressive 
loss of higher frequency hearing 
with age. Tests performed by 
bone conduction (brackets) can 
distinguish nerve damage from 
middle ear damage. 


Note: 

The Hearing Mechanism 

The hearing mechanism involves some interesting physics. The sound 
wave that impinges upon our ear is a pressure wave. The ear is a transducer 
that converts sound waves into electrical nerve impulses in a manner much 
more sophisticated than, but analogous to, a microphone. [link] shows the 
gross anatomy of the ear with its division into three parts: the outer ear or 
ear canal; the middle ear, which runs from the eardrum to the cochlea; and 
the inner ear, which is the cochlea itself. The body part normally referred 
to as the ear is technically called the pinna. 
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The illustration shows the gross 
anatomy of the human ear. 


The outer ear, or ear canal, carries sound to the recessed protected eardrum. 
The air column in the ear canal resonates and is partially responsible for the 
sensitivity of the ear to sounds in the 2000 to 5000 Hz range. The middle 
ear converts sound into mechanical vibrations and applies these vibrations 
to the cochlea. The lever system of the middle ear takes the force exerted on 
the eardrum by sound pressure variations, amplifies it and transmits it to the 


inner ear via the oval window, creating pressure waves in the cochlea 
approximately 40 times greater than those impinging on the eardrum. (See 
[link].) Two muscles in the middle ear (not shown) protect the inner ear 
from very intense sounds. They react to intense sound in a few milliseconds 
and reduce the force transmitted to the cochlea. This protective reaction can 
also be triggered by your own voice, so that humming while shooting a gun, 
for example, can reduce noise damage. 
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This schematic shows the 
middle ear’s system for 
converting sound pressure into 
force, increasing that force 
through a lever system, and 
applying the increased force to 
a small area of the cochlea, 
thereby creating a pressure 
about 40 times that in the 
original sound wave. A 
protective muscle reaction to 
intense sounds greatly reduces 
the mechanical advantage of the 
lever system. 


[link] shows the middle and inner ear in greater detail. Pressure waves 
moving through the cochlea cause the tectorial membrane to vibrate, 
rubbing cilia (called hair cells), which stimulate nerves that send electrical 
signals to the brain. The membrane resonates at different positions for 
different frequencies, with high frequencies stimulating nerves at the near 
end and low frequencies at the far end. The complete operation of the 
cochlea is still not understood, but several mechanisms for sending 
information to the brain are known to be involved. For sounds below about 
1000 Hz, the nerves send signals at the same frequency as the sound. For 
frequencies greater than about 1000 Hz, the nerves signal frequency by 
position. There is a structure to the cilia, and there are connections between 
nerve cells that perform signal processing before information is sent to the 
brain. Intensity information is partly indicated by the number of nerve 
signals and by volleys of signals. The brain processes the cochlear nerve 
signals to provide additional information such as source direction (based on 
time and intensity comparisons of sounds from both ears). Higher-level 


processing produces many nuances, such as music appreciation. 
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The inner ear, or cochlea, is a 
coiled tube about 3 mm in 
diameter and 3 cm in length if 
uncoiled. When the oval 
window is forced inward, as 
shown, a pressure wave travels 
through the perilymph in the 
direction of the arrows, 
stimulating nerves at the base of 
cilia in the organ of Corti. 


Hearing losses can occur because of problems in the middle or inner ear. 
Conductive losses in the middle ear can be partially overcome by sending 
sound vibrations to the cochlea through the skull. Hearing aids for this 
purpose usually press against the bone behind the ear, rather than simply 
amplifying the sound sent into the ear canal as many hearing aids do. 
Damage to the nerves in the cochlea is not repairable, but amplification can 
partially compensate. There is a risk that amplification will produce further 
damage. Another common failure in the cochlea is damage or loss of the 
cilia but with nerves remaining functional. Cochlear implants that stimulate 
the nerves directly are now available and widely accepted. Over 100,000 
implants are in use, in about equal numbers of adults and children. 


The cochlear implant was pioneered in Melbourne, Australia, by Graeme 
Clark in the 1970s for his deaf father. The implant consists of three external 
components and two internal components. The external components are a 
microphone for picking up sound and converting it into an electrical signal, 
a speech processor to select certain frequencies and a transmitter to transfer 
the signal to the internal components through electromagnetic induction. 
The internal components consist of a receiver/transmitter secured in the 
bone beneath the skin, which converts the signals into electric impulses and 
sends them through an internal cable to the cochlea and an array of about 24 
electrodes wound through the cochlea. These electrodes in turn send the 
impulses directly into the brain. The electrodes basically emulate the cilia. 
Exercise: 

Check Your Understanding 


Problem: 


Are ultrasound and infrasound imperceptible to all hearing organisms? 
Explain your answer. 


Solution: 


No, the range of perceptible sound is based in the range of human 
hearing. Many other organisms perceive either infrasound or 
ultrasound. 


Section Summary 


e The range of audible frequencies is 20 to 20,000 Hz. 

e Those sounds above 20,000 Hz are ultrasound, whereas those below 
20 Hz are infrasound. 

e The perception of frequency is pitch. 

e The perception of intensity is loudness. 

e Loudness has units of phons. 


Conceptual Questions 


Exercise: 
Problem: 
Why can a hearing test show that your threshold of hearing is 0 dB at 


250 Hz, when [link] implies that no one can hear such a frequency at 
less than 20 dB? 


Problems & Exercises 


Exercise: 
Problem: 
The factor of 10~!? in the range of intensities to which the ear can 
respond, from threshold to that causing damage after brief exposure, is 
truly remarkable. If you could measure distances over the same range 


with a single instrument and the smallest distance you could measure 
was 1 mm, what would the largest be? 


Solution: 
Equation: 


1x 10°km 


Exercise: 


Problem: 


The frequencies to which the ear responds vary by a factor of 10°. 
Suppose the speedometer on your car measured speeds differing by the 
same factor of 10°, and the greatest speed it reads is 90.0 mi/h. What 
would be the slowest nonzero speed it could read? 


Exercise: 
Problem: 
What are the closest frequencies to 500 Hz that an average person can 


clearly distinguish as being different in frequency from 500 Hz? The 
sounds are not present simultaneously. 


Solution: 


498.5 or 501.5 Hz 
Exercise: 
Problem: 
Can the average person tell that a 2002-Hz sound has a different 


frequency than a 1999-Hz sound without playing them 
simultaneously? 


Exercise: 
Problem: 
If your radio is producing an average sound intensity level of 85 dB, 


what is the next lowest sound intensity level that is clearly less 
intense? 


Solution: 


82 dB 


Exercise: 


Problem: 
Can you tell that your roommate turned up the sound on the TV if its 
average sound intensity level goes from 70 to 73 dB? 

Exercise: 
Problem: 
Based on the graph in [link], what is the threshold of hearing in 
decibels for frequencies of 60, 400, 1000, 4000, and 15,000 Hz? Note 
that many AC electrical appliances produce 60 Hz, music is commonly 


A400 Hz, a reference frequency is 1000 Hz, your maximum sensitivity 
is near 4000 Hz, and many older TVs produce a 15,750 Hz whine. 


Solution: 


approximately 48, 9, 0, —7, and 20 dB, respectively 
Exercise: 
Problem: 
What sound intensity levels must sounds of frequencies 60, 3000, and 


8000 Hz have in order to have the same loudness as a 40-dB sound of 
frequency 1000 Hz (that is, to have a loudness of 40 phons)? 


Exercise: 


Problem: 


What is the approximate sound intensity level in decibels of a 600-Hz 
tone if it has a loudness of 20 phons? If it has a loudness of 70 phons? 


Solution: 
(a) 23 dB 


(b) 70 dB 


Exercise: 


Problem: 


(a) What are the loudnesses in phons of sounds having frequencies of 
200, 1000, 5000, and 10,000 Hz, if they are all at the same 60.0-dB 
sound intensity level? (b) If they are all at 110 dB? (c) If they are all at 
20.0 dB? 


Exercise: 
Problem: 
Suppose a person has a 50-dB hearing loss at all frequencies. By how 
many factors of 10 will low-intensity sounds need to be amplified to 


seem normal to this person? Note that smaller amplification is 
appropriate for more intense sounds to avoid further hearing damage. 


Solution: 


Five factors of 10 

Exercise: 
Problem: 
If a woman needs an amplification of 5.0 x 10!” times the threshold 
intensity to enable her to hear at all frequencies, what is her overall 
hearing loss in dB? Note that smaller amplification is appropriate for 


more intense sounds to avoid further damage to her hearing from 
levels above 90 dB. 


Exercise: 
Problem: 
(a) What is the intensity in watts per meter squared of a just barely 
audible 200-Hz sound? (b) What is the intensity in watts per meter 
squared of a barely audible 4000-Hz sound? 


Solution: 


(a) 2 x 10°12? W/m’ 


(b) 2 x 10-8 W/m? 
Exercise: 
Problem: 
(a) Find the intensity in watts per meter squared of a 60.0-Hz sound 


having a loudness of 60 phons. (b) Find the intensity in watts per meter 
squared of a 10,000-Hz sound having a loudness of 60 phons. 


Exercise: 
Problem: 
A person has a hearing threshold 10 dB above normal at 100 Hz and 
50 dB above normal at 4000 Hz. How much more intense must a 100- 


Hz tone be than a 4000-Hz tone if they are both barely audible to this 
person? 


Solution: 


25 
Exercise: 
Problem: 
A child has a hearing loss of 60 dB near 5000 Hz, due to noise 
exposure, and normal hearing elsewhere. How much more intense is a 


5000-Hz tone than a 400-Hz tone if they are both barely audible to the 
child? 


Exercise: 


Problem: 


What is the ratio of intensities of two sounds of identical frequency if 
the first is just barely discernible as louder to a person than the second? 


Solution: 


1.26 


Glossary 


loudness 
the perception of sound intensity 


timbre 
number and relative intensity of multiple sound frequencies 


note 
basic unit of music with specific names, combined to generate tunes 


tone 
number and relative intensity of multiple sound frequencies 


phon 
the numerical unit of loudness 


ultrasound 
sounds above 20,000 Hz 


infrasound 
sounds below 20 Hz 


Ultrasound 


¢ Define acoustic impedance and intensity reflection coefficient. 

¢ Describe medical and other uses of ultrasound technology. 

¢ Calculate acoustic impedance using density values and the speed of ultrasound. 
¢ Calculate the velocity of a moving object using Doppler-shifted ultrasound. 


Ultrasound is used in medicine 
to painlessly and noninvasively 
monitor patient health and 
diagnose a wide range of 
disorders. (credit: 
abbybatchelder, Flickr) 


Any sound with a frequency above 20,000 Hz (or 20 kHz)—that is, above the highest 
audible frequency—is defined to be ultrasound. In practice, it is possible to create 
ultrasound frequencies up to more than a gigahertz. (Higher frequencies are difficult to 
create; furthermore, they propagate poorly because they are very strongly absorbed.) 
Ultrasound has a tremendous number of applications, which range from burglar alarms 
to use in cleaning delicate objects to the guidance systems of bats. We begin our 
discussion of ultrasound with some of its applications in medicine, in which it is used 
extensively both for diagnosis and for therapy. 


Note: 

Characteristics of Ultrasound 

The characteristics of ultrasound, such as frequency and intensity, are wave properties 
common to all types of waves. Ultrasound also has a wavelength that limits the 
fineness of detail it can detect. This characteristic is true of all waves. We can never 
observe details significantly smaller than the wavelength of our probe; for example, 


we will never see individual atoms with visible light, because the atoms are so small 
compared with the wavelength of light. 


Ultrasound in Medical Therapy 


Ultrasound, like any wave, carries energy that can be absorbed by the medium 
carrying it, producing effects that vary with intensity. When focused to intensities of 
10° to 10° W/ m’, ultrasound can be used to shatter gallstones or pulverize cancerous 
tissue in surgical procedures. (See [link].) Intensities this great can damage individual 
cells, variously causing their protoplasm to stream inside them, altering their 
permeability, or rupturing their walls through cavitation. Cavitation is the creation of 
vapor cavities in a fluid—the longitudinal vibrations in ultrasound alternatively 
compress and expand the medium, and at sufficient amplitudes the expansion 
separates molecules. Most cavitation damage is done when the cavities collapse, 
producing even greater shock pressures. 


The tip of this 
small probe 
oscillates at 23 kHz 
with such a large 
amplitude that it 
pulverizes tissue on 
contact. The debris 
is then aspirated. 
The speed of the tip 
may exceed the 
speed of sound in 
tissue, thus creating 
shock waves and 
cavitation, rather 
than a smooth 


simple harmonic 
oscillator—type 
wave. 


Most of the energy carried by high-intensity ultrasound in tissue is converted to 
thermal energy. In fact, intensities of 10° to 104 W/ m’ are commonly used for deep- 
heat treatments called ultrasound diathermy. Frequencies of 0.8 to 1 MHz are typical. 
In both athletics and physical therapy, ultrasound diathermy is most often applied to 
injured or overworked muscles to relieve pain and improve flexibility. Skill is needed 
by the therapist to avoid “bone burns” and other tissue damage caused by overheating 
and cavitation, sometimes made worse by reflection and focusing of the ultrasound by 
joint and bone tissue. 


In some instances, you may encounter a different decibel scale, called the sound 
pressure level, when ultrasound travels in water or in human and other biological 
tissues. We shall not use the scale here, but it is notable that numbers for sound 
pressure levels range 60 to 70 dB higher than you would quote for {, the sound 
intensity level used in this text. Should you encounter a sound pressure level of 220 
decibels, then, it is not an astronomically high intensity, but equivalent to about 155 
dB—high enough to destroy tissue, but not as unreasonably high as it might seem at 
first. 


Ultrasound in Medical Diagnostics 


When used for imaging, ultrasonic waves are emitted from a transducer, a crystal 
exhibiting the piezoelectric effect (the expansion and contraction of a substance when 
a voltage is applied across it, causing a vibration of the crystal). These high-frequency 
vibrations are transmitted into any tissue in contact with the transducer. Similarly, if a 
pressure is applied to the crystal (in the form of a wave reflected off tissue layers), a 
voltage is produced which can be recorded. The crystal therefore acts as both a 
transmitter and a receiver of sound. Ultrasound is also partially absorbed by tissue on 
its path, both on its journey away from the transducer and on its return journey. From 
the time between when the original signal is sent and when the reflections from 
various boundaries between media are received, (as well as a measure of the intensity 
loss of the signal), the nature and position of each boundary between tissues and 
organs may be deduced. 


Reflections at boundaries between two different media occur because of differences in 
a characteristic known as the acoustic impedance Z of each substance. Impedance is 
defined as 


Equation: 


Z = pv, 


where p is the density of the medium (in kg/ m’°) and v is the speed of sound through 
the medium (in m/s). The units for Z are therefore kg /(m? - s). 


[link] shows the density and speed of sound through various media (including various 
soft tissues) and the associated acoustic impedances. Note that the acoustic 
impedances for soft tissue do not vary much but that there is a big difference between 
the acoustic impedance of soft tissue and air and also between soft tissue and bone. 


Medium 
Air 
Water 
Blood 
Fat 


Muscle (average) 


Bone (varies) 


Barium titanate 
(transducer 
material) 


Density 
(kg/m?) 


1.3 
1000 
1060 
925 
1075 
1400- 


1900 


5600 


Speed of 
Ultrasound 
(m/s) 

330 

1500 

1570 

1450 


1590 


4080 


5500 


Acoustic 
Impedance 


(kg/(m? -s)) 
429 

1.5 x 10° 
1.66 x 10° 
1.34 x 10° 
1.70 x 10° 
5.7 x 10° to 


7.8 x 10° 


30.8 x 10° 


The Ultrasound Properties of Various Media, Including Soft Tissue Found in the Body 


At the boundary between media of different acoustic impedances, some of the wave 
energy is reflected and some is transmitted. The greater the difference in acoustic 
impedance between the two media, the greater the reflection and the smaller the 
transmission. 


The intensity reflection coefficient a is defined as the ratio of the intensity of the 
reflected wave relative to the incident (transmitted) wave. This statement can be 
written mathematically as 

Equation: 


_ (2-4) 
(21 + Z2)° 


where Z; and Z» are the acoustic impedances of the two media making up the 
boundary. A reflection coefficient of zero (corresponding to total transmission and no 
reflection) occurs when the acoustic impedances of the two media are the same. An 
impedance “match” (no reflection) provides an efficient coupling of sound energy 
from one medium to another. The image formed in an ultrasound is made by tracking 
reflections (as shown in [link]) and mapping the intensity of the reflected sound waves 
in a two-dimensional plane. 


Example: 

Calculate Acoustic Impedance and Intensity Reflection Coefficient: Ultrasound 
and Fat Tissue 

(a) Using the values for density and the speed of ultrasound given in [link], show that 
the acoustic impedance of fat tissue is indeed 1.34 x 10° kg/(m?-s). 

(b) Calculate the intensity reflection coefficient of ultrasound when going from fat to 
muscle tissue. 

Strategy for (a) 

The acoustic impedance can be calculated using Z = pv and the values for p and v 
found in [link]. 

Solution for (a) 

(1) Substitute known values from [link] into Z = pv. 

Equation: 


Lai = (925 kg/m*) (1450 m/s) 


(2) Calculate to find the acoustic impedance of fat tissue. 
Equation: 


1.34 x 10° kg /(m?-s) 


This value is the same as the value given for the acoustic impedance of fat tissue. 


Strategy for (b) 
The intensity reflection coefficient for any boundary between two media is given by 
2 
= eZ and the acoustic impedance of muscle is given in [link]. 
(Z1 + Z2) 
Solution for (b) 
2 
Substitute known values into a = ena to find the intensity reflection 
1 2 
coefficient: 
Equation: 
2 
2 
 (-th)? (1.34 x 10° kg/(m?- s) — 1.70 x 10° kg/(m -s)) 


= 0.014 
2 2 
(21 + 22) (1.70 x 10° kg/(m?- s) + 1.34 x 10° kg/(m?. s)) 


Discussion 
This result means that only 1.4% of the incident intensity is reflected, with the 


remaining being transmitted. 


The applications of ultrasound in medical diagnostics have produced untold benefits 
with no known risks. Diagnostic intensities are too low (about 10°? W / m7’) to cause 
thermal damage. More significantly, ultrasound has been in use for several decades 
and detailed follow-up studies do not show evidence of ill effects, quite unlike the case 
for x-rays. 


Transducer 
Speaker 
and 
= microphone 
=” 
(a) 
t 
(b) 


(a) An 
ultrasound 
speaker 
doubles as a 
microphone. 
Brief bleeps 
are broadcast, 
and echoes 
are recorded 
from various 
depths. (b) 
Graph of echo 
intensity 
versus time. 
The time for 
echoes to 
return is 
directly 
proportional 
to the 
distance of 
the reflector, 
yielding this 
information 
noninvasively 


The most common ultrasound applications produce an image like that shown in [link]. 
The speaker-microphone broadcasts a directional beam, sweeping the beam across the 
area of interest. This is accomplished by having multiple ultrasound sources in the 
probe’s head, which are phased to interfere constructively in a given, adjustable 
direction. Echoes are measured as a function of position as well as depth. A computer 
constructs an image that reveals the shape and density of internal structures. 


~.<sPosition direction and 
: time information 


j to computer 
Ultrasonic 


beam 


(b) 


(a) An ultrasonic image is 
produced by sweeping the 
ultrasonic beam across the area 
of interest, in this case the 
woman’s abdomen. Data are 
recorded and analyzed in a 
computer, providing a two- 
dimensional image. (b) 
Ultrasound image of 12-week- 
old fetus. (credit: Margaret W. 
Carruthers, Flickr) 


How much detail can ultrasound reveal? The image in [link] is typical of low-cost 
systems, but that in [link] shows the remarkable detail possible with more advanced 
systems, including 3D imaging. Ultrasound today is commonly used in prenatal care. 
Such imaging can be used to see if the fetus is developing at a normal rate, and help in 
the determination of serious problems early in the pregnancy. Ultrasound is also in 
wide use to image the chambers of the heart and the flow of blood within the beating 
heart, using the Doppler effect (echocardiology). 


Whenever a wave is used as a probe, it is very difficult to detect details smaller than 
its wavelength A. Indeed, current technology cannot do quite this well. Abdominal 


scans may use a 7-MHz frequency, and the speed of sound in tissue is about 1540 m/s 


nA ; _ vy _ 1540 m/s 
—-so the wavelength limit to detail would be A = * = 7xi0°Hs 


practice, 1-mm detail is attainable, which is sufficient for many purposes. Higher- 
frequency ultrasound would allow greater detail, but it does not penetrate as well as 
lower frequencies do. The accepted rule of thumb is that you can effectively scan to a 
depth of about 500A into tissue. For 7 MHz, this penetration limit is 500 x 0.22 mm, 
which is 0.11 m. Higher frequencies may be employed in smaller organs, such as the 
eye, but are not practical for looking deep into the body. 


= 0.22 mm. In 


A 3D ultrasound image of a 
fetus. As well as for the 
detection of any abnormalities, 
such scans have also been 
shown to be useful for 
strengthening the emotional 
bonding between parents and 
their unborn child. (credit: 
Jennie Cu, Wikimedia 
Commons) 


In addition to shape information, ultrasonic scans can produce density information 
superior to that found in X-rays, because the intensity of a reflected sound is related to 
changes in density. Sound is most strongly reflected at places where density changes 
are greatest. 


Another major use of ultrasound in medical diagnostics is to detect motion and 
determine velocity through the Doppler shift of an echo, known as Doppler-shifted 
ultrasound. This technique is used to monitor fetal heartbeat, measure blood velocity, 
and detect occlusions in blood vessels, for example. (See [link].) The magnitude of the 
Doppler shift in an echo is directly proportional to the velocity of whatever reflects the 
sound. Because an echo is involved, there is actually a double shift. The first occurs 
because the reflector (say a fetal heart) is a moving observer and receives a Doppler- 
shifted frequency. The reflector then acts as a moving source, producing a second 
Doppler shift. 


4:18:42 pm 
3 #50 


6.0MHz = 35mm 
KAI 


This Doppler-shifted ultrasonic 
image of a partially occluded 
artery uses color to indicate 
velocity. The highest velocities 
are in red, while the lowest are 
blue. The blood must move 
faster through the constriction 
to carry the same flow. (credit: 
Arning C, Grzyska U, 
Wikimedia Commons) 


A clever technique is used to measure the Doppler shift in an echo. The frequency of 
the echoed sound is superimposed on the broadcast frequency, producing beats. The 
beat frequency is Fg =| f1 — fo |, and so it is directly proportional to the Doppler 
shift (f1 — f2) and hence, the reflector’s velocity. The advantage in this technique is 
that the Doppler shift is small (because the reflector’s velocity is small), so that great 
accuracy would be needed to measure the shift directly. But measuring the beat 
frequency is easy, and it is not affected if the broadcast frequency varies somewhat. 
Furthermore, the beat frequency is in the audible range and can be amplified for audio 
feedback to the medical observer. 


Note: 

Uses for Doppler-Shifted Radar 

Doppler-shifted radar echoes are used to measure wind velocities in storms as well as 
aircraft and automobile speeds. The principle is the same as for Doppler-shifted 
ultrasound. There is evidence that bats and dolphins may also sense the velocity of an 
object (such as prey) reflecting their ultrasound signals by observing its Doppler shift. 


Example: 

Calculate Velocity of Blood: Doppler-Shifted Ultrasound 

Ultrasound that has a frequency of 2.50 MHz is sent toward blood in an artery that is 
moving toward the source at 20.0 cm/s, as illustrated in [link]. Use the speed of sound 
in human tissue as 1540 m/s. (Assume that the frequency of 2.50 MHz is accurate to 
seven significant figures.) 


a. What frequency does the blood receive? 

b. What frequency returns to the source? 

c. What beat frequency is produced if the source and returning frequencies are 
mixed? 


Speaker— 
microphone 


Ultrasound is 
partly 
reflected by 
blood cells 
and plasma 
back toward 
the speaker- 
microphone. 
Because the 
cells are 
moving, two 
Doppler 
shifts are 
produced— 
one for blood 
as a moving 
observer, and 
the other for 
the reflected 
sound 
coming from 
a moving 
source. The 
magnitude of 
the shift is 
directly 
proportional 
to blood 
velocity. 


Strategy 


The first two questions can be answered using fobs = fall no ) and 


Uw = Us 


Uw. ac Vobs 


ee (222s ) for the Doppler shift. The last question asks for beat frequency, 


which is the difference between the original and returning frequencies. 
Solution for (a) 


(1) Identify knowns: 
e The blood is a moving observer, and so the frequency it receives is given by 
Equation: 
Cae wan 
i obs — f s (=) . 


¢ Up is the blood velocity (v ops here) and the plus sign is chosen because the 
motion is toward the source. 


(2) Enter the given values into the equation. 
Equation: 


1540 0.2 
fobs = (2,500,000 Hz) (See) 


1540 m/s 


(3) Calculate to find the frequency: 2,500,325 Hz. 
Solution for (b) 
(1) Identify knowns: 


e The blood acts as a moving source. 

e The microphone acts as a stationary observer. 

e The frequency leaving the blood is 2,500,325 Hz, but it is shifted upward as 
given by 
Equation: 


fobs is the frequency received by the speaker-microphone. 
The source velocity is vp. 
The minus sign is used because the motion is toward the observer. 


The minus sign is used because the motion is toward the observer. 
(2) Enter the given values into the equation: 
Equation: 


1540 m/s 
fobs = (2,500,325 Hz) 


1540 m/s — 0.200 m/s 


(3) Calculate to find the frequency returning to the source: 2,500,649 Hz. 
Solution for (c) 
(1) Identify knowns: 


¢ The beat frequency is simply the absolute value of the difference between f, and 
fobs, aS Stated in: 
Equation: 


fs =| fobs a fs |. 


(2) Substitute known values: 
Equation: 


| 2,500,649 Hz — 2,500,000 Hz | 


(3) Calculate to find the beat frequency: 649 Hz. 

Discussion 

The Doppler shifts are quite small compared with the original frequency of 2.50 
MHz. It is far easier to measure the beat frequency than it is to measure the echo 
frequency with an accuracy great enough to see shifts of a few hundred hertz out of a 
couple of megahertz. Furthermore, variations in the source frequency do not greatly 
affect the beat frequency, because both f, and f,p,would increase or decrease. Those 
changes subtract out in fg =| fobs — fs |- 


Note: 

Industrial and Other Applications of Ultrasound 

Industrial, retail, and research applications of ultrasound are common. A few are 
discussed here. Ultrasonic cleaners have many uses. Jewelry, machined parts, and 
other objects that have odd shapes and crevices are immersed in a cleaning fluid that 
is agitated with ultrasound typically about 40 kHz in frequency. The intensity is great 
enough to cause cavitation, which is responsible for most of the cleansing action. 
Because cavitation-produced shock pressures are large and well transmitted in a fluid, 


they reach into small crevices where even a low-surface-tension cleaning fluid might 
not penetrate. 

Sonar is a familiar application of ultrasound. Sonar typically employs ultrasonic 
frequencies in the range from 30.0 to 100 kHz. Bats, dolphins, submarines, and even 
some birds use ultrasonic sonar. Echoes are analyzed to give distance and size 
information both for guidance and finding prey. In most sonar applications, the sound 
reflects quite well because the objects of interest have significantly different density 
than the medium in which they travel. When the Doppler shift is observed, velocity 
information can also be obtained. Submarine sonar can be used to obtain such 
information, and there is evidence that some bats also sense velocity from their 
echoes. 

Similarly, there are a range of relatively inexpensive devices that measure distance by 
timing ultrasonic echoes. Many cameras, for example, use such information to focus 
automatically. Some doors open when their ultrasonic ranging devices detect a nearby 
object, and certain home security lights turn on when their ultrasonic rangers observe 
motion. Ultrasonic “measuring tapes” also exist to measure such things as room 
dimensions. Sinks in public restrooms are sometimes automated with ultrasound 
devices to turn faucets on and off when people wash their hands. These devices 
reduce the spread of germs and can conserve water. 

Ultrasound is used for nondestructive testing in industry and by the military. Because 
ultrasound reflects well from any large change in density, it can reveal cracks and 
voids in solids, such as aircraft wings, that are too small to be seen with x-rays. For 
similar reasons, ultrasound is also good for measuring the thickness of coatings, 
particularly where there are several layers involved. 

Basic research in solid state physics employs ultrasound. Its attenuation is related to a 
number of physical characteristics, making it a useful probe. Among these 
characteristics are structural changes such as those found in liquid crystals, the 
transition of a material to a superconducting phase, as well as density and other 
properties. 

These examples of the uses of ultrasound are meant to whet the appetites of the 
curious, as well as to illustrate the underlying physics of ultrasound. There are many 
more applications, as you can easily discover for yourself. 


Exercise: 
Check Your Understanding 


Problem: 


Why is it possible to use ultrasound both to observe a fetus in the womb and also 
to destroy cancerous tumors in the body? 


Solution: 


Ultrasound can be used medically at different intensities. Lower intensities do not 
cause damage and are used for medical imaging. Higher intensities can pulverize 
and destroy targeted substances in the body, such as tumors. 


Section Summary 


e The acoustic impedance is defined as: 
Equation: 


Z = pv, 


p is the density of a medium through which the sound travels and v is the speed 
of sound through that medium. 

e The intensity reflection coefficient a, a measure of the ratio of the intensity of the 
wave reflected off a boundary between two media relative to the intensity of the 
incident wave, is given by 
Equation: 


(Z — Z1)’ 


a 
(21 + Ze) 


e The intensity reflection coefficient is a unitless quantity. 


Conceptual Questions 


Exercise: 
Problem: 
If audible sound follows a rule of thumb similar to that for ultrasound, in terms of 
its absorption, would you expect the high or low frequencies from your 


neighbor’s stereo to penetrate into your house? How does this expectation 
compare with your experience? 


Exercise: 
Problem: 
Elephants and whales are known to use infrasound to communicate over very 


large distances. What are the advantages of infrasound for long distance 
communication? 


Exercise: 
Problem: 
It is more difficult to obtain a high-resolution ultrasound image in the abdominal 


region of someone who is overweight than for someone who has a slight build. 
Explain why this statement is accurate. 


Exercise: 
Problem: 
Suppose you read that 210-dB ultrasound is being used to pulverize cancerous 


tumors. You calculate the intensity in watts per centimeter squared and find it is 
unreasonably high (10° W rl cm’). What is a possible explanation? 


Problems & Exercises 


Unless otherwise indicated, for problems in this section, assume that the speed of 
sound through human tissues is 1540 m/s. 
Exercise: 


Problem: 


What is the sound intensity level in decibels of ultrasound of intensity 
10° W / m”, used to pulverize tissue during surgery? 


Solution: 


170 dB 
Exercise: 
Problem: 
Is 155-dB ultrasound in the range of intensities used for deep heating? Calculate 


the intensity of this ultrasound and compare this intensity with values quoted in 
the text. 


Exercise: 
Problem: 


Find the sound intensity level in decibels of 2.00 x 10°? W/ m’” ultrasound used 
in medical diagnostics. 


Solution: 


103 dB 
Exercise: 
Problem: 
The time delay between transmission and the arrival of the reflected wave of a 


signal using ultrasound traveling through a piece of fat tissue was 0.13 ms. At 
what depth did this reflection occur? 


Exercise: 
Problem: 
In the clinical use of ultrasound, transducers are always coupled to the skin by a 
thin layer of gel or oil, replacing the air that would otherwise exist between the 
transducer and the skin. (a) Using the values of acoustic impedance given in 
[link] calculate the intensity reflection coefficient between transducer material 
and air. (b) Calculate the intensity reflection coefficient between transducer 
material and gel (assuming for this problem that its acoustic impedance is 


identical to that of water). (c) Based on the results of your calculations, explain 
why the gel is used. 


Solution: 
(a) 1.00 
(b) 0.823 
(c) Gel is used to facilitate the transmission of the ultrasound between the 
transducer and the patient’s body. 
Exercise: 
Problem: 
(a) Calculate the minimum frequency of ultrasound that will allow you to see 


details as small as 0.250 mm in human tissue. (b) What is the effective depth to 
which this sound is effective as a diagnostic probe? 


Exercise: 


Problem: 


(a) Find the size of the smallest detail observable in human tissue with 20.0-MHz 
ultrasound. (b) Is its effective penetration depth great enough to examine the 
entire eye (about 3.00 cm is needed)? (c) What is the wavelength of such 
ultrasound in 0°C air? 


Solution: 
(a) 77.0 pm 
(b) Effective penetration depth = 3.85 cm, which is enough to examine the eye. 


(c) 16.6 pm 

Exercise: 
Problem: 
(a) Echo times are measured by diagnostic ultrasound scanners to determine 
distances to reflecting surfaces in a patient. What is the difference in echo times 
for tissues that are 3.50 and 3.60 cm beneath the surface? (This difference is the 
minimum resolving time for the scanner to see details as small as 0.100 cm, or 
1.00 mm. Discrimination of smaller time differences is needed to see smaller 
details.) (b) Discuss whether the period T of this ultrasound must be smaller than 


the minimum time resolution. If so, what is the minimum frequency of the 
ultrasound and is that out of the normal range for diagnostic ultrasound? 


Exercise: 


Problem: 


(a) How far apart are two layers of tissue that produce echoes having round-trip 
times (used to measure distances) that differ by 0.750 ps ? (b) What minimum 
frequency must the ultrasound have to see detail this small? 


Solution: 
(a) 5.78 x 104m 


(b) 2.67 x 10° Hz 


Exercise: 


Problem: 


(a) A bat uses ultrasound to find its way among trees. If this bat can detect echoes 
1.00 ms apart, what minimum distance between objects can it detect? (b) Could 
this distance explain the difficulty that bats have finding an open door when they 
accidentally get into a house? 


Exercise: 


Problem: 


A dolphin is able to tell in the dark that the ultrasound echoes received from two 
sharks come from two different objects only if the sharks are separated by 3.50 
m, one being that much farther away than the other. (a) If the ultrasound has a 
frequency of 100 kHz, show this ability is not limited by its wavelength. (b) If 
this ability is due to the dolphin’s ability to detect the arrival times of echoes, 
what is the minimum time difference the dolphin can perceive? 


Solution: 


(a) vy = 1540 m/s = fA => A= SRE = 0.0154 m < 3.50 m. Because 


the wavelength is much shorter than the distance in question, the wavelength is 
not the limiting factor. 
(b) 4.55 ms 

Exercise: 
Problem: 
A diagnostic ultrasound echo is reflected from moving blood and returns with a 
frequency 500 Hz higher than its original 2.00 MHz. What is the velocity of the 


blood? (Assume that the frequency of 2.00 MHz is accurate to seven significant 
figures and 500 Hz is accurate to three significant figures. ) 


Exercise: 
Problem: 
Ultrasound reflected from an oncoming bloodstream that is moving at 30.0 cm/s 
is mixed with the original frequency of 2.50 MHz to produce beats. What is the 


beat frequency? (Assume that the frequency of 2.50 MHz is accurate to seven 
significant figures.) 


Solution: 


974 Hz 


(Note: extra digits were retained in order to show the difference.) 


Glossary 


acoustic impedance 
property of medium that makes the propagation of sound waves more difficult 


intensity reflection coefficient 
a measure of the ratio of the intensity of the wave reflected off a boundary 
between two media relative to the intensity of the incident wave 


Doppler-shifted ultrasound 
a medical technique to detect motion and determine velocity through the Doppler 
shift of an echo 


Introduction to Geometric Optics 
class="introduction" 


Geometric Optics 


Light from this page or screen is formed into an image by the lens of your 
eye, much as the lens of the camera that made this photograph. Mirrors, like 
lenses, can also form images that in turn are captured by your eye. 


Image 
seen aS a 
result of 
reflectio 
n of light 

ona 
plane 
smooth 
surface. 

(credit: 

NASA 
Goddard 

Photo 

and 

Video, 

via 

Flickr) 


Our lives are filled with light. Through vision, the most valued of our 
senses, light can evoke spiritual emotions, such as when we view a 
magnificent sunset or glimpse a rainbow breaking through the clouds. Light 
can also simply amuse us in a theater, or warn us to stop at an intersection. 
It has innumerable uses beyond vision. Light can carry telephone signals 
through glass fibers or cook a meal in a solar oven. Life itself could not 
exist without light’s energy. From photosynthesis in plants to the sun 
warming a cold-blooded animal, its supply of energy is vital. 


Double Rainbow over the bay 


of Pocitos in Montevideo, 
Uruguay. (credit: Madrax, 
Wikimedia Commons) 


We already know that visible light is the type of electromagnetic waves to 
which our eyes respond. That knowledge still leaves many questions 
regarding the nature of light and vision. What is color, and how do our eyes 
detect it? Why do diamonds sparkle? How does light travel? How do lenses 
and mirrors form images? These are but a few of the questions that are 
answered by the study of optics. Optics is the branch of physics that deals 
with the behavior of visible light and other electromagnetic waves. In 
particular, optics is concerned with the generation and propagation of light 
and its interaction with matter. What we have already learned about the 
generation of light in our study of heat transfer by radiation will be 
expanded upon in later topics, especially those on atomic physics. Now, we 
will concentrate on the propagation of light and its interaction with matter. 


It is convenient to divide optics into two major parts based on the size of 
objects that light encounters. When light interacts with an object that is 
several times as large as the light’s wavelength, its observable behavior is 
like that of a ray; it does not prominently display its wave characteristics. 
We call this part of optics “geometric optics.” This chapter will concentrate 
on such situations. When light interacts with smaller objects, it has very 
prominent wave characteristics, such as constructive and destructive 
interference. Wave Optics will concentrate on such situations. 


The Ray Aspect of Light 
e List the ways by which light travels from a source to another location. 


There are three ways in which light can travel from a source to another 
location. (See [link].) It can come directly from the source through empty 
space, such as from the Sun to Earth. Or light can travel through various 
media, such as air and glass, to the person. Light can also arrive after being 
reflected, such as by a mirror. In all of these cases, light is modeled as 
traveling in straight lines called rays. Light may change direction when it 
encounters objects (such as a mirror) or in passing from one material to 
another (such as in passing from air to glass), but it then continues in a 
straight line or as a ray. The word ray comes from mathematics and here 
means a straight line that originates at some point. It is acceptable to 
visualize light rays as laser rays (or even science fiction depictions of ray 
guns). 


Note: 

Ray 

The word “ray” comes from mathematics and here means a straight line 
that originates at some point. 


Three methods for light to 
travel from a source to 
another location. (a) Light 
reaches the upper 
atmosphere of Earth 
traveling through empty 
space directly from the 
source. (b) Light can 
reach a person in one of 
two ways. It can travel 
through media like air 
and glass. It can also 
reflect from an object like 
a mirror. In the situations 
shown here, light 
interacts with objects 
large enough that it 
travels in straight lines, 
like a ray. 


Experiments, as well as our own experiences, show that when light interacts 
with objects several times as large as its wavelength, it travels in straight 
lines and acts like a ray. Its wave characteristics are not pronounced in such 
situations. Since the wavelength of light is less than a micron (a thousandth 
of a millimeter), it acts like a ray in the many common situations in which it 
encounters objects larger than a micron. For example, when light 
encounters anything we can observe with unaided eyes, such as a mirror, it 
acts like a ray, with only subtle wave characteristics. We will concentrate on 
the ray characteristics in this chapter. 


Since light moves in straight lines, changing directions when it interacts 
with materials, it is described by geometry and simple trigonometry. This 
part of optics, where the ray aspect of light dominates, is therefore called 
geometric optics. There are two laws that govern how light changes 
direction when it interacts with matter. These are the law of reflection, for 


situations in which light bounces off matter, and the law of refraction, for 
situations in which light passes through matter. 


Note: 

Geometric Optics 

The part of optics dealing with the ray aspect of light is called geometric 
optics. 


Section Summary 


e A straight line that originates at some point is called a ray. 

e The part of optics dealing with the ray aspect of light is called 
geometric optics. 

e Light can travel in three ways from a source to another location: (1) 
directly from the source through empty space; (2) through various 
media; (3) after being reflected from a mirror. 


Problems & Exercises 


Exercise: 


Problem: 


Suppose a man stands in front of a mirror as shown in [link]. His eyes 
are 1.65 m above the floor, and the top of his head is 0.13 m higher. 
Find the height above the floor of the top and bottom of the smallest 
mirror in which he can see both the top of his head and his feet. How is 
this distance related to the man’s height? 


A full-length 
mirror is one 


in which you 

can see all of 
yourself. It 
need not be 

as big as you, 

and its size is 
independent 

of your 
distance from 
it. 


Solution: 
Top 1.715 m from floor, bottom 0.825 m from floor. Height of mirror 
is 0.890 m, or precisely one-half the height of the person. 

Glossary 


ray 
straight line that originates at some point 


geometric optics 
part of optics dealing with the ray aspect of light 


The Law of Reflection 
e Explain reflection of light from polished and rough surfaces. 


Whenever we look into a mirror, or squint at sunlight glinting from a lake, 
we are seeing a reflection. When you look at this page, too, you are seeing 
light reflected from it. Large telescopes use reflection to form an image of 
stars and other astronomical objects. 


The law of reflection is illustrated in [link], which also shows how the 
angles are measured relative to the perpendicular to the surface at the point 
where the light ray strikes. We expect to see reflections from smooth 
surfaces, but [link] illustrates how a rough surface reflects light. Since the 
light strikes different parts of the surface at different angles, it is reflected in 
many different directions, or diffused. Diffused light is what allows us to 
see a Sheet of paper from any angle, as illustrated in [link]. Many objects, 
such as people, clothing, leaves, and walls, have rough surfaces and can be 
seen from all sides. A mirror, on the other hand, has a smooth surface 
(compared with the wavelength of light) and reflects light at specific angles, 
as illustrated in [link]. When the moon reflects from a lake, as shown in 


[link], a combination of these effects takes place. 
Perpendicular 
to surface 
Reflected ray 


wi 


Incident ray 


The law of reflection 


states that the angle of 
reflection equals the 
angle of incidence— 

0, = 6;. The angles are 
measured relative to 
the perpendicular to 


the surface at the point 
where the ray strikes 
the surface. 


Light is diffused when it 
reflects from a rough 
surface. Here many 
parallel rays are incident, 
but they are reflected at 
many different angles 
since the surface is rough. 


When a sheet of paper is 
illuminated with many 
parallel incident rays, it 

can be seen at many 
different angles, because 


its surface is rough and 
diffuses the light. 


A mirror illuminated by 
many parallel rays 
reflects them in only one 
direction, since its surface 
is very smooth. Only the 
observer at a particular 
angle will see the 
reflected light. 


Moonlight is spread out 
when it is reflected by the 
lake, since the surface is 
shiny but uneven. (credit: 


Diego Torres Silvestre, 
Flickr) 


The law of reflection is very simple: The angle of reflection equals the 
angle of incidence. 


Note: 
The Law of Reflection 
The angle of reflection equals the angle of incidence. 


When we see ourselves in a mirror, it appears that our image is actually 
behind the mirror. This is illustrated in [link]. We see the light coming from 
a direction determined by the law of reflection. The angles are such that our 
image is exactly the same distance behind the mirror as we stand away from 
the mirror. If the mirror is on the wall of a room, the images in it are all 
behind the mirror, which can make the room seem bigger. Although these 
mirror images make objects appear to be where they cannot be (like behind 
a solid wall), the images are not figments of our imagination. Mirror images 
can be photographed and videotaped by instruments and look just as they 
do with our eyes (optical instruments themselves). The precise manner in 
which images are formed by mirrors and lenses will be treated in later 
sections of this chapter. 


Our image in a mirror is 
behind the mirror. The two 
rays shown are those that 
strike the mirror at just the 
correct angles to be reflected 
into the eyes of the person. 
The image appears to be in 
the direction the rays are 
coming from when they 
enter the eyes. 


Note: 

Take-Home Experiment: Law of Reflection 

Take a piece of paper and shine a flashlight at an angle at the paper, as 
shown in [link]. Now shine the flashlight at a mirror at an angle. Do your 
observations confirm the predictions in [link] and [link]? Shine the 
flashlight on various surfaces and determine whether the reflected light is 
diffuse or not. You can choose a shiny metallic lid of a pot or your skin. 
Using the mirror and flashlight, can you confirm the law of reflection? You 
will need to draw lines on a piece of paper showing the incident and 
reflected rays. (This part works even better if you use a laser pencil.) 


Section Summary 


e The angle of reflection equals the angle of incidence. 

e A mirror has a smooth surface and reflects light at specific angles. 
Light is diffused when it reflects from a rough surface. 

e Mirror images can be photographed and videotaped by instruments. 


Conceptual Questions 


Exercise: 


Problem: 


Using the law of reflection, explain how powder takes the shine off of 
a person’s nose. What is the name of the optical effect? 


Problems & Exercises 


Exercise: 


Problem: 


Show that when light reflects from two mirrors that meet each other at 


aright angle, the outgoing ray is parallel to the incoming ray, as 
illustrated in the following figure. 


A corner reflector 
sends the reflected 
ray back ina 
direction parallel to 
the incident ray, 
independent of 
incoming direction. 


Exercise: 
Problem: 
Light shows staged with lasers use moving mirrors to swing beams and 


create colorful effects. Show that a light ray reflected from a mirror 
changes direction by 20 when the mirror is rotated by an angle 0. 


Exercise: 


Problem: 


A flat mirror is neither converging nor diverging. To prove this, 
consider two rays originating from the same point and diverging at an 
angle @. Show that after striking a plane mirror, the angle between their 
directions remains 0. 


A flat mirror neither 
converges nor diverges 
light rays. Two rays 
continue to diverge at the 
same angle after 
reflection. 


Glossary 


mirror 


smooth surface that reflects light at specific angles, forming an image 
of the person or object in front of it 


law of reflection 
angle of reflection equals the angle of incidence 


The Law of Refraction 


e Determine the index of refraction, given the speed of light in a 
medium. 


It is easy to notice some odd things when looking into a fish tank. For 
example, you may see the same fish appearing to be in two different places. 
(See [link].) This is because light coming from the fish to us changes 
direction when it leaves the tank, and in this case, it can travel two different 
paths to get to our eyes. The changing of a light ray’s direction (loosely 
called bending) when it passes through variations in matter is called 
refraction. Refraction is responsible for a tremendous range of optical 
phenomena, from the action of lenses to voice transmission through optical 
fibers. 


Note: 

Refraction 

The changing of a light ray’s direction (loosely called bending) when it 
passes through variations in matter is called refraction. 


Note: 

Speed of Light 

The speed of light c not only affects refraction, it is one of the central 
concepts of Einstein’s theory of relativity. As the accuracy of the 
measurements of the speed of light were improved, c was found not to 
depend on the velocity of the source or the observer. However, the speed of 
light does vary in a precise manner with the material it traverses. These 
facts have far-reaching implications, as we will see in Special Relativity. It 
makes connections between space and time and alters our expectations that 
all observers measure the same time for the same event, for example. The 
speed of light is so important that its value in a vacuum is one of the most 
fundamental constants in nature as well as being one of the four 
fundamental SI units. 


Looking at the fish 
tank as shown, we 
can see the same 
fish in two different 
locations, because 
light changes 
directions when it 
passes from water 
to air. In this case, 
the light can reach 
the observer by two 
different paths, and 
so the fish seems to 
be in two different 
places. This 
bending of light is 
called refraction 
and is responsible 
for many optical 
phenomena. 


Why does light change direction when passing from one material (medium) 
to another? It is because light changes speed when going from one material 


to another. So before we study the law of refraction, it is useful to discuss 
the speed of light and how it varies in different media. 


The Speed of Light 


Early attempts to measure the speed of light, such as those made by Galileo, 
determined that light moved extremely fast, perhaps instantaneously. The 
first real evidence that light traveled at a finite speed came from the Danish 
astronomer Ole Roemer in the late 17th century. Roemer had noted that the 
average orbital period of one of Jupiter’s moons, as measured from Earth, 
varied depending on whether Earth was moving toward or away from 
Jupiter. He correctly concluded that the apparent change in period was due 
to the change in distance between Earth and Jupiter and the time it took 
light to travel this distance. From his 1676 data, a value of the speed of light 
was calculated to be 2.26 x 10° m/s (only 25% different from today’s 
accepted value). In more recent times, physicists have measured the speed 
of light in numerous ways and with increasing accuracy. One particularly 
direct method, used in 1887 by the American physicist Albert Michelson 
(1852-1931), is illustrated in [link]. Light reflected from a rotating set of 
mirrors was reflected from a stationary mirror 35 km away and returned to 
the rotating mirrors. The time for the light to travel can be determined by 
how fast the mirrors must rotate for the light to be returned to the observer’s 
eye. 


Observer 
x 


mm 
Eight-sided i 
; : Stationary 
rotating mirror mirror 
Light (e) 
source 
Stage 1 Stage 2 


xe 


| 35 km —— 


Stage 3 


A schematic of early apparatus used by 
Michelson and others to determine the 
speed of light. As the mirrors rotate, the 
reflected ray is only briefly directed at 
the stationary mirror. The returning ray 
will be reflected into the observer's eye 
only if the next mirror has rotated into 
the correct position just as the ray 
returns. By measuring the correct rotation 
rate, the time for the round trip can be 
measured and the speed of light 
calculated. Michelson’s calculated value 
of the speed of light was only 0.04% 
different from the value used today. 


The speed of light is now known to great precision. In fact, the speed of 
light in a vacuum c is so important that it is accepted as one of the basic 
physical quantities and has the fixed value 

Equation: 


c = 2.99792458 x 10° m/s = 3.00 x 10° m/s, 


where the approximate value of 3.00 x 10° m /s is used whenever three- 
digit accuracy is sufficient. The speed of light through matter is less than it 
is in a vacuum, because light interacts with atoms in a material. The speed 
of light depends strongly on the type of material, since its interaction with 
different atoms, crystal lattices, and other substructures varies. We define 
the index of refraction n of a material to be 

Equation: 


where v is the observed speed of light in the material. Since the speed of 
light is always less than c in matter and equals c only in a vacuum, the 
index of refraction is always greater than or equal to one. 


Note: 
Value of the Speed of Light 
Equation: 
c = 2.99792458 x 10° m/s = 3.00 x 10° m/s 
Note: 


Index of Refraction 
Equation: 


Se 


That is, n > 1. [link] gives the indices of refraction for some representative 
substances. The values are listed for a particular wavelength of light, 
because they vary slightly with wavelength. (This can have important 
effects, such as colors produced by a prism.) Note that for gases, n is close 
to 1.0. This seems reasonable, since atoms in gases are widely separated 
and light travels at c in the vacuum between atoms. It is common to take 

n = 1 for gases unless great precision is needed. Although the speed of 
light v in a medium varies considerably from its value c in a vacuum, it is 
still a large speed. 


Medium n 


Gases at 0°C, 1 atm 


Air 1.000293 
Carbon dioxide 1.00045 
Hydrogen 1.000139 
Oxygen 1.000271 
Liquids at 20°C 

Benzene 1.501 


Carbon disulfide 1.628 


Medium 

Carbon tetrachloride 
Ethanol 

Glycerine 

Water, fresh 
Solids at 20°C 
Diamond 

Fluorite 

Glass, crown 
Glass, flint 

Ice at 20°C 
Polystyrene 
Plexiglas 

Quartz, crystalline 
Quartz, fused 
Sodium chloride 


Zircon 


Index of Refraction in Various Media 


Example: 

Speed of Light in Matter 

Calculate the speed of light in zircon, a material used in jewelry to imitate 
diamond. 

Strategy 

The speed of light in a material, v, can be calculated from the index of 
refraction n of the material using the equation n = c/v. 

Solution 

The equation for index of refraction states that n = c/v. Rearranging this 
to determine v gives 

Equation: 


Cc 
os 

n 

The index of refraction for zircon is given as 1.923 in [link], and c is given 


in the equation for speed of light. Entering these values in the last 
expression gives 


Equation: 
= 3.00 x 10° m/s 
a 1.923 
= 1.56 x 10° m/s. 
Discussion 


This speed is slightly larger than half the speed of light in a vacuum and is 
still high compared with speeds we normally experience. The only 
substance listed in [link] that has a greater index of refraction than zircon is 
diamond. We shall see later that the large index of refraction for zircon 
makes it sparkle more than glass, but less than diamond. 


Law of Refraction 


[link] shows how a ray of light changes direction when it passes from one 
medium to another. As before, the angles are measured relative to a 
perpendicular to the surface at the point where the light ray crosses it. 


(Some of the incident light will be reflected from the surface, but for now 
we will concentrate on the light that is transmitted.) The change in direction 
of the light ray depends on how the speed of light changes. The change in 
the speed of light is related to the indices of refraction of the media 
involved. In the situations shown in [link], medium 2 has a greater index of 
refraction than medium 1. This means that the speed of light is less in 
medium 2 than in medium 1. Note that as shown in [link](a), the direction 
of the ray moves closer to the perpendicular when it slows down. 
Conversely, as shown in [link](b), the direction of the ray moves away from 
the perpendicular when it speeds up. The path is exactly reversible. In both 
cases, you can imagine what happens by thinking about pushing a lawn 
mower from a footpath onto grass, and vice versa. Going from the footpath 
to grass, the front wheels are slowed and pulled to the side as shown. This is 
the same change in direction as for light when it goes from a fast medium to 
a slow one. When going from the grass to the footpath, the front wheels can 
move faster and the mower changes direction as shown. This, too, is the 
same change in direction as for light going from slow to fast. 


Sidewalk (n,) 


Q\ : Sidewalk (n;) 
Incident NS. 


(a) (b) 


The change in direction of a light ray depends on how 
the speed of light changes when it crosses from one 
medium to another. The speed of light is greater in 
medium 1 than in medium 2 in the situations shown here. 
(a) A ray of light moves closer to the perpendicular when 
it slows down. This is analogous to what happens when a 
lawn mower goes from a footpath to grass. (b) A ray of 


light moves away from the perpendicular when it speeds 
up. This is analogous to what happens when a lawn 
mower goes from grass to footpath. The paths are exactly 
reversible. 


The amount that a light ray changes its direction depends both on the 
incident angle and the amount that the speed changes. For a ray at a given 
incident angle, a large change in speed causes a large change in direction, 
and thus a large change in angle. The exact mathematical relationship is the 
law of refraction, or “Snell’s Law,” which is stated in equation form as 
Equation: 


n, sin 0; = ny sin Oo. 


Here nj and nz are the indices of refraction for medium 1 and 2, and 0; and 
9, are the angles between the rays and the perpendicular in medium 1 and 2, 
as shown in [link]. The incoming ray is called the incident ray and the 
outgoing ray the refracted ray, and the associated angles the incident angle 
and the refracted angle. The law of refraction is also called Snell’s law after 
the Dutch mathematician Willebrord Snell (1591-1626), who discovered it 
in 1621. Snell’s experiments showed that the law of refraction was obeyed 
and that a characteristic index of refraction n could be assigned to a given 
medium. Snell was not aware that the speed of light varied in different 
media, but through experiments he was able to determine indices of 
refraction from the way light rays changed direction. 


Note: 
The Law of Refraction 
Equation: 


nN, sin 6; = ny sin 4, 


Note: 

Take-Home Experiment: A Broken Pencil 

A classic observation of refraction occurs when a pencil is placed in a glass 
half filled with water. Do this and observe the shape of the pencil when 
you look at the pencil sideways, that is, through air, glass, water. Explain 
your observations. Draw ray diagrams for the situation. 


Example: 

Determine the Index of Refraction from Refraction Data 

Find the index of refraction for medium 2 in [link](a), assuming medium 1 
is air and given the incident angle is 30.0° and the angle of refraction is 
Oe 

Strategy 

The index of refraction for air is taken to be 1 in most cases (and up to four 
significant figures, it is 1.000). Thus n; = 1.00 here. From the given 
information, 0; = 30.0° and 62 = 22.0°. With this information, the only 
unknown in Snell’s law is m9, so that it can be used to find this unknown. 
Solution 

Snell’s law is 

Equation: 


mn, sin 6; = ng sin Oo. 


Rearranging to isolate ny gives 
Equation: 


Entering known values, 
Equation: 


sin 22.0° 0.375 
1.33. 


1 00 sin 30.0° 0.500 


ng = 


Discussion 

This is the index of refraction for water, and Snell could have determined it 
by measuring the angles and performing this calculation. He would then 
have found 1.33 to be the appropriate index of refraction for water in all 
other situations, such as when a ray passes from water to glass. Today we 
can verify that the index of refraction is related to the speed of light in a 
medium by measuring that speed directly. 


Example: 

A Larger Change in Direction 

Suppose that in a situation like that in [link], light goes from air to 
diamond and that the incident angle is 30.0°. Calculate the angle of 
refraction 92 in the diamond. 

Strategy 

Again the index of refraction for air is taken to be n, = 1.00, and we are 
given 8; = 30.0°. We can look up the index of refraction for diamond in 
[link], finding nz = 2.419. The only unknown in Snell’s law is 62, which 
we wish to determine. 

Solution 

Solving Snell’s law for sin 02 yields 

Equation: 


: Lear 
sin 65 = —sin 9}. 
n2 


Entering known values, 
Equation: 


sin 05 = 


0 
5 sin 30.0°= (0.413 (0.500) = 0.207. 


The angle is thus 
Equation: 


6, = sin 10.207 = 11.9°. 


Discussion 

For the same 30° angle of incidence, the angle of refraction in diamond is 
significantly smaller than in water (11.9° rather than 22°—see the 
preceding example). This means there is a larger change in direction in 
diamond. The cause of a large change in direction is a large change in the 
index of refraction (or speed). In general, the larger the change in speed, 
the greater the effect on the direction of the ray. 


Section Summary 


e The changing of a light ray’s direction when it passes through 
variations in matter is called refraction. 

e The speed of light in vacuum 
c = 2.99792458 x 10° m/s = 3.00 x 10° m/s. 

e Index of refraction n = ~, where v is the speed of light in the 


material, c is the speed of light in vacuum, and n is the index of 
refraction. 

e Snell’s law, the law of refraction, is stated in equation form as 
ny, sin 0; = nz sin Oo. 


Conceptual Questions 


Exercise: 
Problem: 
Diffusion by reflection from a rough surface is described in this 


chapter. Light can also be diffused by refraction. Describe how this 
occurs in a specific situation, such as light interacting with crushed ice. 


Exercise: 


Problem: 


Why is the index of refraction always greater than or equal to 1? 


Exercise: 


Problem: 


Does the fact that the light flash from lightning reaches you before its 
sound prove that the speed of light is extremely large or simply that it 
is greater than the speed of sound? Discuss how you could use this 
effect to get an estimate of the speed of light. 


Exercise: 
Problem: 
Will light change direction toward or away from the perpendicular 
when it goes from air to water? Water to glass? Glass to air? 
Exercise: 
Problem: 
Explain why an object in water always appears to be at a depth 


shallower than it actually is? Why do people sometimes sustain neck 
and spinal injuries when diving into unfamiliar ponds or waters? 


Exercise: 
Problem: 
Explain why a person’s legs appear very short when wading in a pool. 


Justify your explanation with a ray diagram showing the path of rays 
from the feet to the eye of an observer who is out of the water. 


Exercise: 


Problem: Why is the front surface of a thermometer curved as shown? 


The curved surface 
of the thermometer 
serves a purpose. 


Exercise: 


Problem: 


Suppose light were incident from air onto a material that had a 
negative index of refraction, say —1.3; where does the refracted light 
ray go? 


Problems & Exercises 


Exercise: 


Problem: What is the speed of light in water? In glycerine? 
Solution: 

2.25 x 108 m/s in water 

2.04 x 10° m/s in glycerine 


Exercise: 


Problem: What is the speed of light in air? In crown glass? 
Exercise: 

Problem: 

Calculate the index of refraction for a medium in which the speed of 


light is 2.012 x 10° m /s, and identify the most likely substance based 
on [link]. 


Solution: 


1.490, polystyrene 
Exercise: 


Problem: 


In what substance in [link] is the speed of light 2.290 x 10° m /s? 
Exercise: 

Problem: 

There was a major collision of an asteroid with the Moon in medieval 

times. It was described by monks at Canterbury Cathedral in England 

as a red glow on and around the Moon. How long after the asteroid hit 


the Moon, which is 3.84 x 10° km away, would the light first arrive 
on Earth? 


Solution: 


1.288 
Exercise: 


Problem: 


A scuba diver training in a pool looks at his instructor as shown in 
[link]. What angle does the ray from the instructor’s face make with 
the perpendicular to the water at the point where the ray enters? The 
angle between the ray in the water and the perpendicular to the water is 
Zoi". 


A scuba diver in a pool and his 
trainer look at each other. 


Exercise: 
Problem: 
Components of some computers communicate with each other through 
optical fibers having an index of refraction n = 1.55. What time in 


nanoseconds is required for a signal to travel 0.200 m through such a 
fiber? 


Solution: 


1.03 ns 


Exercise: 


Problem: 


(a) Given that the angle between the ray in the water and the 
perpendicular to the water is 25.0°, and using information in [link], 
find the height of the instructor’s head above the water, noting that you 
will first have to calculate the angle of incidence. (b) Find the apparent 
depth of the diver’s head below water as seen by the instructor. 


Exercise: 


Problem: 


Suppose you have an unknown clear substance immersed in water, and 
you wish to identify it by finding its index of refraction. You arrange to 
have a beam of light enter it at an angle of 45.0°, and you observe the 
angle of refraction to be 40.3°. What is the index of refraction of the 
substance and its likely identity? 


Solution: 


n = 1.46, fused quartz 
Exercise: 


Problem: 


On the Moon’s surface, lunar astronauts placed a corner reflector, off 
which a laser beam is periodically reflected. The distance to the Moon 
is calculated from the round-trip time. What percent correction is 
needed to account for the delay in time due to the slowing of light in 
Earth’s atmosphere? Assume the distance to the Moon is precisely 
3.84 x 10° m, and Earth’s atmosphere (which varies in density with 
altitude) is equivalent to a layer 30.0 km thick with a constant index of 
refraction n = 1.000293. 


Exercise: 


Problem: 


Suppose [link] represents a ray of light going from air through crown 
glass into water, such as going into a fish tank. Calculate the amount 

the ray is displaced by the glass (Ax), given that the incident angle is 
40.0° and the glass is 1.00 cm thick. 


Exercise: 


Problem: 


[link] shows a ray of light passing from one medium into a second and 
then a third. Show that 03 is the same as it would be if the second 
medium were not present (provided total internal reflection does not 
occur). 


A ray of light passes from 
one medium to a third by 
traveling through a 
second. The final 
direction is the same as if 
the second medium were 
not present, but the ray is 
displaced by Az (shown 
exaggerated). 


Exercise: 


Problem: Unreasonable Results 


Suppose light travels from water to another substance, with an angle of 
incidence of 10.0° and an angle of refraction of 14.9°. (a) What is the 
index of refraction of the other substance? (b) What is unreasonable 
about this result? (c) Which assumptions are unreasonable or 
inconsistent? 


Solution: 
(a) 0.898 
(b) Can’t have n < 1.00 since this would imply a speed greater than c. 


(c) Refracted angle is too big relative to the angle of incidence. 


Exercise: 


Problem: Construct Your Own Problem 


Consider sunlight entering the Earth’s atmosphere at sunrise and sunset 
—that is, at a 90° incident angle. Taking the boundary between nearly 
empty space and the atmosphere to be sudden, calculate the angle of 
refraction for sunlight. This lengthens the time the Sun appears to be 
above the horizon, both at sunrise and sunset. Now construct a 
problem in which you determine the angle of refraction for different 
models of the atmosphere, such as various layers of varying density. 
Your instructor may wish to guide you on the level of complexity to 
consider and on how the index of refraction varies with air density. 


Exercise: 
Problem: Unreasonable Results 


Light traveling from water to a gemstone strikes the surface at an angle 
of 80.0° and has an angle of refraction of 15.2°. (a) What is the speed 


of light in the gemstone? (b) What is unreasonable about this result? 
(c) Which assumptions are unreasonable or inconsistent? 


Solution: 
(a)*eaa 


(b) Speed of light too slow, since index is much greater than that of 
diamond. 


(c) Angle of refraction is unreasonable relative to the angle of 
incidence. 


Glossary 


refraction 
changing of a light ray’s direction when it passes through variations in 
matter 


index of refraction 
for a material, the ratio of the speed of light in vacuum to that in the 
material 


Total Internal Reflection 


e Explain the phenomenon of total internal reflection. 
¢ Describe the workings and uses of fiber optics. 
e Analyze the reason for the sparkle of diamonds. 


A good-quality mirror may reflect more than 90% of the light that falls on 
it, absorbing the rest. But it would be useful to have a mirror that reflects all 
of the light that falls on it. Interestingly, we can produce total reflection 
using an aspect of refraction. 


Consider what happens when a ray of light strikes the surface between two 
materials, such as is shown in [link](a). Part of the light crosses the 
boundary and is refracted; the rest is reflected. If, as shown in the figure, the 
index of refraction for the second medium is less than for the first, the ray 
bends away from the perpendicular. (Since n; > ng, the angle of refraction 
is greater than the angle of incidence—that is, 2 > 61.) Now imagine what 
happens as the incident angle is increased. This causes 02 to increase also. 
The largest the angle of refraction @2can be is 90°, as shown in [link](b).The 
critical angle@, for a combination of materials is defined to be the incident 
angle 0; that produces an angle of refraction of 90°. That is, @, is the 
incident angle for which 6) = 90°. If the incident angle 0; is greater than 
the critical angle, as shown in [link](c), then all of the light is reflected back 
into medium 1, a condition called total internal reflection. 


Note: 

Critical Angle 

The incident angle 0; that produces an angle of refraction of 90° is called 
the critical angle, 0¢. 


Refracted ray 


nte 


1 
| reflection 


(a) A ray of light 
crosses a boundary 
where the speed of 
light increases and 

the index of 
refraction 
decreases. That is, 
ng <n ,. The ray 
bends away from 
the perpendicular. 
(b) The critical 


angle @, is the one 

for which the angle 

of refraction is . (c) 

Total internal 

reflection occurs 

when the incident 
angle is greater 
than the critical 

angle. 


Snell’s law states the relationship between angles and indices of refraction. 
It is given by 
Equation: 


n, sin 0; = ny sin 05. 


When the incident angle equals the critical angle (9; = 9,), the angle of 
refraction is 90° (@2 = 90°). Noting that sin 90°=1, Snell’s law in this case 
becomes 

Equation: 


ny, sin 0, = no. 


The critical angle 0, for a given combination of materials is thus 
Equation: 


0.= sin 1(n2/n1) for n1 > no. 


Total internal reflection occurs for any incident angle greater than the 
critical angle @,, and it can only occur when the second medium has an 
index of refraction less than the first. Note the above equation is written for 
a light ray that travels in medium 1 and reflects from medium 2, as shown 
in the figure. 


Example: 

How Big is the Critical Angle Here? 

What is the critical angle for light traveling in a polystyrene (a type of 
plastic) pipe surrounded by air? 

Strategy 

The index of refraction for polystyrene is found to be 1.49 in [link], and the 
index of refraction of air can be taken to be 1.00, as before. Thus, the 
condition that the second medium (air) has an index of refraction less than 
the first (plastic) is satisfied, and the equation 0, = sin~'(nz/n;) can be 
used to find the critical angle 0,. Here, then, n2 = 1.00 and n; = 1.49. 
Solution 

The critical angle is given by 

Equation: 


6, = sin ‘(n2/n}). 


Substituting the identified values gives 
Equation: 


6. = sin 1(1.00/1.49) = sin 1(0.671) 
42.2°. 


Discussion 

This means that any ray of light inside the plastic that strikes the surface at 
an angle greater than 42.2° will be totally reflected. This will make the 
inside surface of the clear plastic a perfect mirror for such rays without any 
need for the silvering used on common mirrors. Different combinations of 
materials have different critical angles, but any combination with n; > n2 
can produce total internal reflection. The same calculation as made here 
shows that the critical angle for a ray going from water to air is 48.6°, 
while that from diamond to air is 24.4°, and that from flint glass to crown 
glass is 66.3°. There is no total reflection for rays going in the other 
direction—for example, from air to water—since the condition that the 
second medium must have a smaller index of refraction is not satisfied. A 
number of interesting applications of total internal reflection follow. 


Fiber Optics: Endoscopes to Telephones 


Fiber optics is one application of total internal reflection that is in wide use. 
In communications, it is used to transmit telephone, internet, and cable TV 
signals. Fiber optics employs the transmission of light down fibers of 
plastic or glass. Because the fibers are thin, light entering one is likely to 
strike the inside surface at an angle greater than the critical angle and, thus, 
be totally reflected (See [link].) The index of refraction outside the fiber 
must be smaller than inside, a condition that is easily satisfied by coating 
the outside of the fiber with a material having an appropriate refractive 
index. In fact, most fibers have a varying refractive index to allow more 
light to be guided along the fiber through total internal refraction. Rays are 
reflected around corners as shown, making the fibers into tiny light pipes. 


Light entering a 
thin fiber may 
strike the inside 
surface at large or 
grazing angles and 
is completely 
reflected if these 
angles exceed the 
critical angle. Such 
rays continue down 
the fiber, even 
following it around 
corners, since the 
angles of reflection 


and incidence 
remain large. 


Bundles of fibers can be used to transmit an image without a lens, as 
illustrated in [link]. The output of a device called an endoscope is shown in 
[link ](b). Endoscopes are used to explore the body through various orifices 
or minor incisions. Light is transmitted down one fiber bundle to illuminate 
internal parts, and the reflected light is transmitted back out through another 
to be observed. Surgery can be performed, such as arthroscopic surgery on 
the knee joint, employing cutting tools attached to and observed with the 
endoscope. Samples can also be obtained, such as by lassoing an intestinal 
polyp for external examination. 


Fiber optics has revolutionized surgical techniques and observations within 
the body. There are a host of medical diagnostic and therapeutic uses. The 
flexibility of the fiber optic bundle allows it to navigate around difficult and 
small regions in the body, such as the intestines, the heart, blood vessels, 
and joints. Transmission of an intense laser beam to burn away obstructing 
plaques in major arteries as well as delivering light to activate 
chemotherapy drugs are becoming commonplace. Optical fibers have in 
fact enabled microsurgery and remote surgery where the incisions are small 
and the surgeon’s fingers do not need to touch the diseased tissue. 


(a) An image is 
transmitted by a bundle of 
fibers that have fixed 


neighbors. (b) An 

endoscope is used to 

probe the body, both 
transmitting light to the 
interior and returning an 

image such as the one 

shown. (credit: 
Med_Chaos, Wikimedia 
Commons) 


Fibers in bundles are surrounded by a cladding material that has a lower 
index of refraction than the core. (See [link].) The cladding prevents light 
from being transmitted between fibers in a bundle. Without cladding, light 
could pass between fibers in contact, since their indices of refraction are 
identical. Since no light gets into the cladding (there is total internal 
reflection back into the core), none can be transmitted between clad fibers 
that are in contact with one another. The cladding prevents light from 
escaping out of the fiber; instead most of the light is propagated along the 
length of the fiber, minimizing the loss of signal and ensuring that a quality 
image is formed at the other end. The cladding and an additional protective 
layer make optical fibers flexible and durable. 


Light ray 


Cladding 


Fibers in bundles 
are clad by a 
material that has a 
lower index of 
refraction than the 
core to ensure total 
internal reflection, 
even when fibers 
are in contact with 
one another. This 
shows a single fiber 
with its cladding. 


Note: 
Cladding 
The cladding prevents light from being transmitted between fibers in a 


bundle. 


Special tiny lenses that can be attached to the ends of bundles of fibers are 
being designed and fabricated. Light emerging from a fiber bundle can be 
focused and a tiny spot can be imaged. In some cases the spot can be 
scanned, allowing quality imaging of a region inside the body. Special 
minute optical filters inserted at the end of the fiber bundle have the 
capacity to image tens of microns below the surface without cutting the 
surface—non-intrusive diagnostics. This is particularly useful for 
determining the extent of cancers in the stomach and bowel. 


Most telephone conversations and Internet communications are now carried 
by laser signals along optical fibers. Extensive optical fiber cables have 
been placed on the ocean floor and underground to enable optical 
communications. Optical fiber communication systems offer several 
advantages over electrical (copper) based systems, particularly for long 


distances. The fibers can be made so transparent that light can travel many 
kilometers before it becomes dim enough to require amplification—much 
superior to copper conductors. This property of optical fibers is called low 
loss. Lasers emit light with characteristics that allow far more conversations 
in one fiber than are possible with electric signals on a single conductor. 
This property of optical fibers is called high bandwidth. Optical signals in 
one fiber do not produce undesirable effects in other adjacent fibers. This 
property of optical fibers is called reduced crosstalk. We shall explore the 
unique characteristics of laser radiation in a later chapter. 


Corner Reflectors and Diamonds 


A light ray that strikes an object consisting of two mutually perpendicular 
reflecting surfaces is reflected back exactly parallel to the direction from 
which it came. This is true whenever the reflecting surfaces are 
perpendicular, and it is independent of the angle of incidence. Such an 
object, shown in [link], is called a corner reflector, since the light bounces 
from its inside corner. Many inexpensive reflector buttons on bicycles, cars, 
and warning signs have corer reflectors designed to return light in the 
direction from which it originated. It was more expensive for astronauts to 
place one on the moon. Laser signals can be bounced from that corner 
reflector to measure the gradually increasing distance to the moon with 
great precision. 


(a) Astronauts 
placed a corner 
reflector on the 


moon to measure 
its gradually 
increasing orbital 
distance. (credit: 
NASA) (b) The 
bright spots on 
these bicycle safety 
reflectors are 
reflections of the 
flash of the camera 
that took this 
picture on a dark 
night. (credit: Julo, 
Wikimedia 
Commons) 


Corner reflectors are perfectly efficient when the conditions for total 
internal reflection are satisfied. With common materials, it is easy to obtain 
a critical angle that is less than 45°. One use of these perfect mirrors is in 
binoculars, as shown in [link]. Another use is in periscopes found in 
submarines. 


These binoculars 
employ commer 
reflectors with total 
internal reflection 
to get light to the 
observer’s eyes. 


The Sparkle of Diamonds 


Total internal reflection, coupled with a large index of refraction, explains 
why diamonds sparkle more than other materials. The critical angle for a 
diamond-to-air surface is only 24.4°, and so when light enters a diamond, it 
has trouble getting back out. (See [link].) Although light freely enters the 
diamond, it can exit only if it makes an angle less than 24.4°. Facets on 
diamonds are specifically intended to make this unlikely, so that the light 
can exit only in certain places. Good diamonds are very clear, so that the 
light makes many internal reflections and is concentrated at the few places 
it can exit—hence the sparkle. (Zircon is a natural gemstone that has an 
exceptionally large index of refraction, but not as large as diamond, so it is 


not as highly prized. Cubic zirconia is manufactured and has an even higher 
index of refraction (~ 2.17), but still less than that of diamond.) The colors 
you see emerging from a sparkling diamond are not due to the diamond’s 
color, which is usually nearly colorless. Those colors result from dispersion, 
the topic of Dispersion: The Rainbow and Prisms. Colored diamonds get 
their color from structural defects of the crystal lattice and the inclusion of 
minute quantities of graphite and other materials. The Argyle Mine in 
Western Australia produces around 90% of the world’s pink, red, 
champagne, and cognac diamonds, while around 50% of the world’s clear 
diamonds come from central and southern Africa. 


Critical angle 


A wg Diamond 
Total Air 
reflection 


Light cannot easily 
escape a diamond, 
because its critical 
angle with air is so 
small. Most reflections 
are total, and the facets 
are placed so that light 
can exit only in 
particular ways—thus 
concentrating the light 
and making the 
diamond sparkle. 


Note: 

PhET Explorations: Bending Light 

Explore bending of light between two media with different indices of 
refraction. See how changing from air to water to glass changes the 
bending angle. Play with prisms of different shapes and make rainbows. 


https://phet.colorado.edu/sims/html/bending-light/latest/bending- 
light en. html 


Section Summary 


The incident angle that produces an angle of refraction of 90° is called 
critical angle. 

Total internal reflection is a phenomenon that occurs at the boundary 
between two mediums, such that if the incident angle in the first 
medium is greater than the critical angle, then all the light is reflected 
back into that medium. 

Fiber optics involves the transmission of light down fibers of plastic or 
glass, applying the principle of total internal reflection. 

Endoscopes are used to explore the body through various orifices or 
minor incisions, based on the transmission of light through optical 
fibers. 

Cladding prevents light from being transmitted between fibers in a 
bundle. 

Diamonds sparkle due to total internal reflection coupled with a large 
index of refraction. 


Conceptual Questions 


Exercise: 


Problem: 


A ring with a colorless gemstone is dropped into water. The gemstone 
becomes invisible when submerged. Can it be a diamond? Explain. 


Exercise: 
Problem: 
A high-quality diamond may be quite clear and colorless, transmitting 


all visible wavelengths with little absorption. Explain how it can 
sparkle with flashes of brilliant color when illuminated by white light. 


Exercise: 
Problem: 
Is it possible that total internal reflection plays a role in rainbows? 
Explain in terms of indices of refraction and angles, perhaps referring 


to [link]. Some of us have seen the formation of a double rainbow. Is it 
physically possible to observe a triple rainbow? 


Double rainbows are not a very 
common observance. (credit: 
InvictusOU812, Flickr) 


Exercise: 


Problem: 


The most common type of mirage is an illusion that light from faraway 
objects is reflected by a pool of water that is not really there. Mirages 
are generally observed in deserts, when there is a hot layer of air near 
the ground. Given that the refractive index of air is lower for air at 
higher temperatures, explain how mirages can be formed. 


Problems & Exercises 


Exercise: 
Problem: 
Verify that the critical angle for light going from water to air is 48.6°, 


as discussed at the end of [link], regarding the critical angle for light 
traveling in a polystyrene (a type of plastic) pipe surrounded by air. 


Exercise: 
Problem: 
(a) At the end of [link], it was stated that the critical angle for light 


going from diamond to air is 24.4°. Verify this. (b) What is the critical 
angle for light going from zircon to air? 


Exercise: 


Problem: 


An optical fiber uses flint glass clad with crown glass. What is the 
critical angle? 


Solution: 


66.3° 


Exercise: 


Problem: 
At what minimum angle will you get total internal reflection of light 
traveling in water and reflected from ice? 

Exercise: 
Problem: 
Suppose you are using total internal reflection to make an efficient 
comer reflector. If there is air outside and the incident angle is 45.0°, 


what must be the minimum index of refraction of the material from 
which the reflector is made? 


Solution: 


> 1414 
Exercise: 


Problem: 


You can determine the index of refraction of a substance by 
determining its critical angle. (a) What is the index of refraction of a 
substance that has a critical angle of 68.4° when submerged in water? 
What is the substance, based on [link]? (b) What would the critical 
angle be for this substance in air? 


Exercise: 
Problem: 
A ray of light, emitted beneath the surface of an unknown liquid with 
air above it, undergoes total internal reflection as shown in [Link]. 


What is the index of refraction for the liquid and its likely 
identification? 


A light ray inside a liquid 
strikes the surface at the 
critical angle and 
undergoes total internal 
reflection. 


Solution: 


1.50, benzene 
Exercise: 
Problem: 
A light ray entering an optical fiber surrounded by air is first refracted 


and then reflected as shown in [link]. Show that if the fiber is made 
from crown glass, any incident ray will be totally internally reflected. 


A light ray enters the end 
of a fiber, the surface of 
which is perpendicular to 
its sides. Examine the 
conditions under which it 


may be totally internally 
reflected. 


Glossary 


critical angle 
incident angle that produces an angle of refraction of 90° 


fiber optics 
transmission of light down fibers of plastic or glass, applying the 
principle of total internal reflection 


comer reflector 
an object consisting of two mutually perpendicular reflecting surfaces, 
so that the light that enters is reflected back exactly parallel to the 
direction from which it came 


zircon 
natural gemstone with a large index of refraction 


Dispersion: The Rainbow and Prisms 


e Explain the phenomenon of dispersion and discuss its advantages and 
disadvantages. 


Everyone enjoys the spectacle of a rainbow glimmering against a dark stormy sky. How 
does sunlight falling on clear drops of rain get broken into the rainbow of colors we see? 
The same process causes white light to be broken into colors by a clear glass prism or a 

diamond. (See [link].) 


The colors of the 
rainbow (a) and those 
produced by a prism 
(b) are identical. 
(credit: Alfredo55, 
Wikimedia Commons; 
NASA) 


We see about six colors in a rainbow—red, orange, yellow, green, blue, and violet; 
sometimes indigo is listed, too. Those colors are associated with different wavelengths 
of light, as shown in [link]. When our eye receives pure-wavelength light, we tend to 
see only one of the six colors, depending on wavelength. The thousands of other hues 
we can sense in other situations are our eye’s response to various mixtures of 
wavelengths. White light, in particular, is a fairly uniform mixture of all visible 
wavelengths. Sunlight, considered to be white, actually appears to be a bit yellow 
because of its mixture of wavelengths, but it does contain all visible wavelengths. The 
sequence of colors in rainbows is the same sequence as the colors plotted versus 
wavelength in [link]. What this implies is that white light is spread out according to 


wavelength in a rainbow. Dispersion is defined as the spreading of white light into its 
full spectrum of wavelengths. More technically, dispersion occurs whenever there is a 
process that changes the direction of light in a manner that depends on wavelength. 
Dispersion, as a general phenomenon, can occur for any type of wave and always 
involves wavelength-dependent processes. 


Note: 

Dispersion 

Dispersion is defined to be the spreading of white light into its full spectrum of 
wavelengths. 


Visible light 
Orange Green Violet 
Infrared Red Yellow Blue Ultraviolet 
800 700 600 500 400 300 A (nm) 


Even though rainbows are associated with seven 
colors, the rainbow is a continuous distribution of 
colors according to wavelengths. 


Refraction is responsible for dispersion in rainbows and many other situations. The 
angle of refraction depends on the index of refraction, as we saw in The Law of 
Refraction. We know that the index of refraction n depends on the medium. But for a 
given medium, n also depends on wavelength. (See [link]. Note that, for a given 
medium, n increases as wavelength decreases and is greatest for violet light. Thus violet 
light is bent more than red light, as shown for a prism in [link](b), and the light is 
dispersed into the same sequence of wavelengths as seen in [link] and [link]. 


Note: 

Making Connections: Dispersion 

Any type of wave can exhibit dispersion. Sound waves, all types of electromagnetic 
waves, and water waves can be dispersed according to wavelength. Dispersion occurs 
whenever the speed of propagation depends on wavelength, thus separating and 
spreading out various wavelengths. Dispersion may require special circumstances and 
can result in spectacular displays such as in the production of a rainbow. This is also 


true for sound, since all frequencies ordinarily travel at the same speed. If you listen to 
sound through a long tube, such as a vacuum cleaner hose, you can easily hear it is 
dispersed by interaction with the tube. Dispersion, in fact, can reveal a great deal about 
what the wave has encountered that disperses its wavelengths. The dispersion of 
electromagnetic radiation from outer space, for example, has revealed much about what 
exists between the stars—the so-called empty space. 


Red Orange Yellow Green Blue Violet 

(660 (610 (580 (550 (470 (410 
Medium nm) nm) nm) nm) nm) nm) 
Water 1331 1.332 1.333 1.335 1.338 1.342 
Diamond 2.410 2.415 2.417 2.426 2.444 2.458 
Glass, 1512 | 1.514 1.518 1.519 1524 ‘1,530 
crown 
Glass, flint 1.662 1.665 1.667 1.674 1.684 1.698 
Polystyrene 1.488 1.490 1.492 1.493 1.499 1.506 
Quartz, 1.455 1.456 1.458 1.459 1.462 1.468 
fused 


Index of Refraction n in Selected Media at Various Wavelengths 


Glass prism 


Incident >>. 
light i 


Pure dA 


Glass prism 


Incident 
white light 
Red 
(760 nm) 


Violet 
(380 nm) 


(b) 


(a) A pure wavelength 
of light falls onto a 
prism and is refracted 
at both surfaces. (b) 
White light is 
dispersed by the prism 
(shown exaggerated). 
Since the index of 
refraction varies with 
wavelength, the angles 
of refraction vary with 
wavelength. A 
sequence of red to 
violet is produced, 
because the index of 
refraction increases 
steadily with 
decreasing 
wavelength. 


Rainbows are produced by a combination of refraction and reflection. You may have 
noticed that you see a rainbow only when you look away from the sun. Light enters a 
drop of water and is reflected from the back of the drop, as shown in [link]. The light is 
refracted both as it enters and as it leaves the drop. Since the index of refraction of water 


varies with wavelength, the light is dispersed, and a rainbow is observed, as shown in 
[link] (a). (There is no dispersion caused by reflection at the back surface, since the law 
of reflection does not depend on wavelength.) The actual rainbow of colors seen by an 
observer depends on the myriad of rays being refracted and reflected toward the 
observer’s eyes from numerous drops of water. The effect is most spectacular when the 
background is dark, as in stormy weather, but can also be observed in waterfalls and 
lawn sprinklers. The arc of a rainbow comes from the need to be looking at a specific 
angle relative to the direction of the sun, as illustrated in [link] (b). (If there are two 
reflections of light within the water drop, another “secondary” rainbow is produced. 
This rare event produces an arc that lies above the primary rainbow arc—see [link] (c).) 


Note: 
Rainbows 
Rainbows are produced by a combination of refraction and reflection. 


Water 
droplet 


Sunlight 


Violet 


Part of the light falling 
on this water drop 
enters and is reflected 
from the back of the 
drop. This light is 
refracted and 
dispersed both as it 
enters and as it leaves 
the drop. 


(a) Different colors emerge in 
different directions, and so you 
must look at different locations 

to see the various colors of a 

rainbow. (b) The arc of a 
rainbow results from the fact 
that a line between the observer 
and any point on the arc must 
make the correct angle with the 
parallel rays of sunlight to 

receive the refracted rays. (c) 


Double rainbow. (credit: 
Nicholas, Wikimedia 
Commons) 


Dispersion may produce beautiful rainbows, but it can cause problems in optical 
systems. White light used to transmit messages in a fiber is dispersed, spreading out in 
time and eventually overlapping with other messages. Since a laser produces a nearly 
pure wavelength, its light experiences little dispersion, an advantage over white light for 
transmission of information. In contrast, dispersion of electromagnetic waves coming to 
us from outer space can be used to determine the amount of matter they pass through. 
As with many phenomena, dispersion can be useful or a nuisance, depending on the 
situation and our human goals. 


Note: 

PhET Explorations: Geometric Optics 

How does a lens form an image? See how light rays are refracted by a lens. Watch how 
the image changes when you adjust the focal length of the lens, move the object, move 
the lens, or move the screen. 


https://phet.colorado.edu/sims/geometric-optics/geometric-optics en.html 


Section Summary 


¢ The spreading of white light into its full spectrum of wavelengths is called 
dispersion. 

e Rainbows are produced by a combination of refraction and reflection and involve 
the dispersion of sunlight into a continuous distribution of colors. 

e Dispersion produces beautiful rainbows but also causes problems in certain optical 
systems. 


Problems & Exercises 


Exercise: 


Problem: 


(a) What is the ratio of the speed of red light to violet light in diamond, based on 
[link]? (b) What is this ratio in polystyrene? (c) Which is more dispersive? 


Exercise: 
Problem: 


A beam of white light goes from air into water at an incident angle of 75.0°. At 
what angles are the red (660 nm) and violet (410 nm) parts of the light refracted? 


Solution: 


46.5°, red; 46.0°, violet 
Exercise: 
Problem: 
By how much do the critical angles for red (660 nm) and violet (410 nm) light 
differ in a diamond surrounded by air? 
Exercise: 
Problem: 
(a) A narrow beam of light containing yellow (580 nm) and green (550 nm) 
wavelengths goes from polystyrene to air, striking the surface at a 30.0° incident 


angle. What is the angle between the colors when they emerge? (b) How far would 
they have to travel to be separated by 1.00 mm? 


Solution: 
(a) 0.043° 


(b) 1.33 m 
Exercise: 
Problem: 
A parallel beam of light containing orange (610 nm) and violet (410 nm) 


wavelengths goes from fused quartz to water, striking the surface between them at 
a 60.0° incident angle. What is the angle between the two colors in water? 


Exercise: 
Problem: 
A ray of 610 nm light goes from air into fused quartz at an incident angle of 55.0°. 


At what incident angle must 470 nm light enter flint glass to have the same angle 
of refraction? 


Solution: 


TL 

Exercise: 
Problem: 
A narrow beam of light containing red (660 nm) and blue (470 nm) wavelengths 
travels from air through a 1.00 cm thick flat piece of crown glass and back to air 
again. The beam strikes at a 30.0° incident angle. (a) At what angles do the two 


colors emerge? (b) By what distance are the red and blue separated when they 
emerge? 


Exercise: 
Problem: 
A narrow beam of white light enters a prism made of crown glass at a 45.0° 


incident angle, as shown in [link]. At what angles, @p and 0@y, do the red (660 nm) 
and violet (410 nm) components of the light emerge from the prism? 


Incident 


light Red (660 nm) 


Violet 
(410 nm) 


60° 
This prism will disperse 
the white light into a 
rainbow of colors. The 
incident angle is 45.0°, 
and the angles at which 
the red and violet light 
emerge are Op and Oy. 


Solution: 


53.5°, red; 55.2°, violet 


Glossary 


dispersion 
spreading of white light into its full spectrum of wavelengths 


rainbow 
dispersion of sunlight into a continuous distribution of colors according to 
wavelength, produced by the refraction and reflection of sunlight by water droplets 
in the sky 


Image Formation by Lenses 


e List the rules for ray tracking for thin lenses. 
e Illustrate the formation of images using the technique of ray tracking. 
¢ Determine power of a lens given the focal length. 


Lenses are found in a huge array of optical instruments, ranging from a 
simple magnifying glass to the eye to a camera’s zoom lens. In this section, 
we will use the law of refraction to explore the properties of lenses and how 
they form images. 


The word lens derives from the Latin word for a lentil bean, the shape of 
which is similar to the convex lens in [link]. The convex lens shown has 
been shaped so that all light rays that enter it parallel to its axis cross one 
another at a single point on the opposite side of the lens. (The axis is 
defined to be a line normal to the lens at its center, as shown in [link].) Such 
a lens is called a converging (or convex) lens for the converging effect it 
has on light rays. An expanded view of the path of one ray through the lens 
is shown, to illustrate how the ray changes direction both as it enters and as 
it leaves the lens. Since the index of refraction of the lens is greater than 
that of air, the ray moves towards the perpendicular as it enters and away 
from the perpendicular as it leaves. (This is in accordance with the law of 
refraction.) Due to the lens’s shape, light is thus bent toward the axis at both 
surfaces. The point at which the rays cross is defined to be the focal point F 
of the lens. The distance from the center of the lens to its focal point is 
defined to be the focal length of the lens. [link] shows how a converging 
lens, such as that in a magnifying glass, can converge the nearly parallel 
light rays from the sun to a small spot. 


Rays of light entering a converging lens parallel to its 
axis converge at its focal point F. (Ray 2 lies on the 
axis of the lens.) The distance from the center of the 
lens to the focal point is the lens’s focal length f. An 
expanded view of the path taken by ray 1 shows the 

perpendiculars and the angles of incidence and 
refraction at both surfaces. 


Note: 

Converging or Convex Lens 

The lens in which light rays that enter it parallel to its axis cross one 
another at a single point on the opposite side with a converging effect is 
called converging lens. 


Note: 

Focal Point F 

The point at which the light rays cross is called the focal point F of the 
lens. 


Note: 

Focal Length f 

The distance from the center of the lens to its focal point is called focal 
length f. 


Sunlight focused by a 
converging 
magnifying glass can 
burn paper. Light rays 
from the sun are 
nearly parallel and 
cross at the focal point 
of the lens. The more 
powerful the lens, the 
closer to the lens the 
rays will cross. 


The greater effect a lens has on light rays, the more powerful it is said to be. 
For example, a powerful converging lens will focus parallel light rays closer 
to itself and will have a smaller focal length than a weak lens. The light will 
also focus into a smaller and more intense spot for a more powerful lens. 
The power FP of a lens is defined to be the inverse of its focal length. In 
equation form, this is 

Equation: 


Note: 

Power P 

The power FP of a lens is defined to be the inverse of its focal length. In 
equation form, this is 

Equation: 


where f is the focal length of the lens, which must be given in meters (and 
not cm or mm). The power of a lens P has the unit diopters (D), provided 
that the focal length is given in meters. That is, 1 D = 1/m, or 1 Tes 
(Note that this power (optical power, actually) is not the same as power in 
watts defined in Work, Energy, and Energy Resources. It is a concept 
related to the effect of optical devices on light.) Optometrists prescribe 
common spectacles and contact lenses in units of diopters. 


Example: 

What is the Power of a Common Magnifying Glass? 

Suppose you take a magnifying glass out on a sunny day and you find that 
it concentrates sunlight to a small spot 8.00 cm away from the lens. What 
are the focal length and power of the lens? 

Strategy 

The situation here is the same as those shown in [link] and [link]. The Sun 
is so far away that the Sun’s rays are nearly parallel when they reach Earth. 
The magnifying glass is a convex (or converging) lens, focusing the nearly 
parallel rays of sunlight. Thus the focal length of the lens is the distance 
from the lens to the spot, and its power is the inverse of this distance (in 
m). 

Solution 

The focal length of the lens is the distance from the center of the lens to the 
spot, given to be 8.00 cm. Thus, 

Equation: 


f = 8.00 cm. 


To find the power of the lens, we must first convert the focal length to 
meters; then, we substitute this value into the equation for power. This 


gives 
Equation: 
1 il 
eS SS 15 
f 0.0800 m 
Discussion 


This is a relatively powerful lens. The power of a lens in diopters should 
not be confused with the familiar concept of power in watts. It is an 
unfortunate fact that the word “power” is used for two completely different 
concepts. If you examine a prescription for eyeglasses, you will note lens 
powers given in diopters. If you examine the label on a motor, you will 
note energy consumption rate given as a power in watts. 


[link] shows a concave lens and the effect it has on rays of light that enter it 
parallel to its axis (the path taken by ray 2 in the figure is the axis of the 
lens). The concave lens is a diverging lens, because it causes the light rays 
to bend away (diverge) from its axis. In this case, the lens has been shaped 
so that all light rays entering it parallel to its axis appear to originate from 
the same point, F, defined to be the focal point of a diverging lens. The 
distance from the center of the lens to the focal point is again called the 
focal length f of the lens. Note that the focal length and power of a 
diverging lens are defined to be negative. For example, if the distance to F’ 
in [link] is 5.00 cm, then the focal length is f = —5.00 cm and the power of 
the lens is P = —20 D. An expanded view of the path of one ray through 
the lens is shown in the figure to illustrate how the shape of the lens, 
together with the law of refraction, causes the ray to follow its particular 
path and be diverged. 


Rays of light entering a 
diverging lens parallel to 
its axis are diverged, and 
all appear to originate at 
its focal point F. The 
dashed lines are not rays 
—they indicate the 
directions from which the 
rays appear to come. The 
focal length f of a 
diverging lens is negative. 
An expanded view of the 
path taken by ray 1 shows 
the perpendiculars and 
the angles of incidence 
and refraction at both 
surfaces. 


Note: 
Diverging Lens 


A lens that causes the light rays to bend away from its axis is called a 
diverging lens. 


As noted in the initial discussion of the law of refraction in ‘The Law of 
Refraction, the paths of light rays are exactly reversible. This means that the 
direction of the arrows could be reversed for all of the rays in [link] and 
[link]. For example, if a point light source is placed at the focal point of a 
convex lens, as shown in [link], parallel light rays emerge from the other 
side. 


A small light source, 
like a light bulb 
filament, placed at the 
focal point of a convex 
lens, results in parallel 
rays of light emerging 
from the other side. 
The paths are exactly 
the reverse of those 
shown in [link]. This 
technique is used in 
lighthouses and 
sometimes in traffic 
lights to produce a 
directional beam of 
light from a source 
that emits light in all 
directions. 


Ray Tracing and Thin Lenses 


Ray tracing is the technique of determining or following (tracing) the paths 
that light rays take. For rays passing through matter, the law of refraction is 
used to trace the paths. Here we use ray tracing to help us understand the 
action of lenses in situations ranging from forming images on film to 
magnifying small print to correcting nearsightedness. While ray tracing for 
complicated lenses, such as those found in sophisticated cameras, may 
require computer techniques, there is a set of simple rules for tracing rays 
through thin lenses. A thin lens is defined to be one whose thickness allows 
rays to refract, as illustrated in [link], but does not allow properties such as 
dispersion and aberrations. An ideal thin lens has two refracting surfaces 
but the lens is thin enough to assume that light rays bend only once. A thin 
symmetrical lens has two focal points, one on either side and both at the 
same distance from the lens. (See [link].) Another important characteristic 
of a thin lens is that light rays through its center are deflected by a 
negligible amount, as seen in [link]. 


Note: 

Thin Lens 

A thin lens is defined to be one whose thickness allows rays to refract but 
does not allow properties such as dispersion and aberrations. 


Note: 

Take-Home Experiment: A Visit to the Optician 

Look through your eyeglasses (or those of a friend) backward and forward 
and comment on whether they act like thin lenses. 


Thin lenses have the same 
focal length on either 
side. (a) Parallel light 

rays entering a 
converging lens from the 
right cross at its focal 
point on the left. (b) 
Parallel light rays 
entering a diverging lens 
from the right seem to 
come from the focal point 
on the right. 


The light 
ray 
through 
the center 
of a thin 
lens is 
deflected 
bya 
negligible 
amount 
and is 
assumed 
to emerge 
parallel to 
its 
original 
path 
(shown as 
a shaded 
line). 


Using paper, pencil, and a straight edge, ray tracing can accurately describe 
the operation of a lens. The rules for ray tracing for thin lenses are based on 
the illustrations already discussed: 


1. A ray entering a converging lens parallel to its axis passes through the 
focal point F of the lens on the other side. (See rays 1 and 3 in [link].) 

2. A ray entering a diverging lens parallel to its axis seems to come from 
the focal point F. (See rays 1 and 3 in [link].) 

3. A ray passing through the center of either a converging or a diverging 
lens does not change direction. (See [link], and see ray 2 in [link] and 
[link].) 

4. A ray entering a converging lens through its focal point exits parallel 
to its axis. (The reverse of rays 1 and 3 in [link].) 

5. A ray that enters a diverging lens by heading toward the focal point on 
the opposite side exits parallel to the axis. (The reverse of rays 1 and 3 
in [link].) 


Note: 
Rules for Ray Tracing 


1. A ray entering a converging lens parallel to its axis passes through the 
focal point F of the lens on the other side. 

2. A ray entering a diverging lens parallel to its axis seems to come from 
the focal point F. 

3. A ray passing through the center of either a converging or a diverging 
lens does not change direction. 

4. A ray entering a converging lens through its focal point exits parallel 
to its axis. 

5. A ray that enters a diverging lens by heading toward the focal point on 
the opposite side exits parallel to the axis. 


Image Formation by Thin Lenses 


In some circumstances, a lens forms an obvious image, such as when a 

movie projector casts an image onto a screen. In other cases, the image is 
less obvious. Where, for example, is the image formed by eyeglasses? We 
will use ray tracing for thin lenses to illustrate how they form images, and 
we will develop equations to describe the image formation quantitatively. 


Consider an object some distance away from a converging lens, as shown in 
[link]. To find the location and size of the image formed, we trace the paths 
of selected light rays originating from one point on the object, in this case 
the top of the person’s head. The figure shows three rays from the top of the 
object that can be traced using the ray tracing rules given above. (Rays 
leave this point going in many directions, but we concentrate on only a few 
with paths that are easy to trace.) The first ray is one that enters the lens 
parallel to its axis and passes through the focal point on the other side (rule 
1). The second ray passes through the center of the lens without changing 
direction (rule 3). The third ray passes through the nearer focal point on its 
way into the lens and leaves the lens parallel to its axis (rule 4). The three 
rays cross at the same point on the other side of the lens. The image of the 
top of the person’s head is located at this point. All rays that come from the 
Same point on the top of the person’s head are refracted in such a way as to 
cross at the point shown. Rays from another point on the object, such as her 
belt buckle, will also cross at another common point, forming a complete 
image, as shown. Although three rays are traced in [link], only two are 
necessary to locate the image. It is best to trace rays for which there are 
simple ray tracing rules. Before applying ray tracing to other situations, let 
us consider the example shown in [link] in more detail. 


Ray tracing is used to 
locate the image formed 
by a lens. Rays 
originating from the same 
point on the object are 
traced—the three chosen 
rays each follow one of 
the rules for ray tracing, 
so that their paths are 
easy to determine. The 
image is located at the 
point where the rays 


cross. In this case, a real 
image—one that can be 
projected on a screen—is 
formed. 


The image formed in [link] is a real image, meaning that it can be 
projected. That is, light rays from one point on the object actually cross at 
the location of the image and can be projected onto a screen, a piece of film, 
or the retina of an eye, for example. [link] shows how such an image would 
be projected onto film by a camera lens. This figure also shows how a real 
image is projected onto the retina by the lens of an eye. Note that the image 
is there whether it is projected onto a screen or not. 


Note: 

Real Image 

The image in which light rays from one point on the object actually cross 
at the location of the image and can be projected onto a screen, a piece of 
film, or the retina of an eye is called a real image. 


(b) 


Real images can be 
projected. (a) A real 
image of the person is 
projected onto film. (b) 
The converging nature of 
the multiple surfaces that 
make up the eye result in 
the projection of a real 
image on the retina. 


Several important distances appear in [link]. We define d, to be the object 
distance, the distance of an object from the center of a lens. Image distance 
d; is defined to be the distance of the image from the center of a lens. The 
height of the object and height of the image are given the symbols hy and h; 
, respectively. Images that appear upright relative to the object have heights 
that are positive and those that are inverted have negative heights. Using the 
rules of ray tracing and making a scale drawing with paper and pencil, like 
that in [link], we can accurately describe the location and size of an image. 
But the real benefit of ray tracing is in visualizing how images are formed 
in a variety of situations. To obtain numerical information, we use a pair of 


equations that can be derived from a geometric analysis of ray tracing for 
thin lenses. The thin lens equations are 


Equation: 
1 i 
do d; 7 f 
and 
Equation: 
hj di 
— SS Sn. 
ho do 


We define the ratio of image height to object height (h;/h.) to be the 
magnification m. (The minus sign in the equation above will be discussed 
shortly.) The thin lens equations are broadly applicable to all situations 
involving thin lenses (and “thin” mirrors, as we will see later). We will 
explore many features of image formation in the following worked 
examples. 


Note: 

Image Distance 

The distance of the image from the center of the lens is called image 
distance. 


Note: 
Thin Lens Equations and Magnification 
Equation: 
1 s pepe 
dy d; f 


Equation: 


Example: 

Finding the Image of a Light Bulb Filament by Ray Tracing and by the 
Thin Lens Equations 

A clear glass light bulb is placed 0.750 m from a convex lens having a 
0.500 m focal length, as shown in [link]. Use ray tracing to get an 
approximate location for the image. Then use the thin lens equations to 
calculate (a) the location of the image and (b) its magnification. Verify that 
ray tracing and the thin lens equations produce consistent results. 


Light bulb — 


>» f=0.50m 


d, = 1.50 m——___3j 


A light bulb placed 0.750 m from a lens having 
a 0.500 m focal length produces a real image on 
a poster board as discussed in the example 
above. Ray tracing predicts the image location 
and size. 


Strategy and Concept 

Since the object is placed farther away from a converging lens than the 
focal length of the lens, this situation is analogous to those illustrated in 
[link] and [link]. Ray tracing to scale should produce similar results for d;. 
Numerical solutions for d,; and m can be obtained using the thin lens 
equations, noting that d, = 0.750 m and f = 0.500 m. 


Solutions (Ray tracing) 

The ray tracing to scale in [link] shows two rays from a point on the bulb’s 
filament crossing about 1.50 m on the far side of the lens. Thus the image 
distance d; is about 1.50 m. Similarly, the image height based on ray 
tracing is greater than the object height by about a factor of 2, and the 
image is inverted. Thus m is about —2. The minus sign indicates that the 
image is inverted. 

The thin lens equations can be used to find d; from the given information: 
Equation: 


1 i iy! 
d, d; 7 i 
Rearranging to isolate d; gives 
Equation: 
1 1 1 
d; 7 f d. 


Entering known quantities gives a value for 1/d;: 
Equation: 


1 1 1 0.667 


ad  0500m 0.750m  m 


This must be inverted to find d;: 
Equation: 


m 
d; = — = ]1.50m. 
0.667 


Note that another way to find d; is to rearrange the equation: 
Equation: 


= 
a oP dy 


This yields the equation for the image distance as: 


Equation: 


Note that there is no inverting here. 

The thin lens equations can be used to find the magnification ™m, since both 
d; and d, are known. Entering their values gives 

Equation: 


Discussion 

Note that the minus sign causes the magnification to be negative when the 
image is inverted. Ray tracing and the use of the thin lens equations 
produce consistent results. The thin lens equations give the most precise 
results, being limited only by the accuracy of the given information. Ray 
tracing is limited by the accuracy with which you can draw, but it is highly 
useful both conceptually and visually. 


Real images, such as the one considered in the previous example, are 
formed by converging lenses whenever an object is farther from the lens 
than its focal length. This is true for movie projectors, cameras, and the eye. 
We shall refer to these as case 1 images. A case 1 image is formed when 

d, > f and f is positive, as in [link](a). (A summary of the three cases or 
types of image formation appears at the end of this section.) 


A different type of image is formed when an object, such as a person's face, 
is held close to a convex lens. The image is upright and larger than the 
object, as seen in [link](b), and so the lens is called a magnifier. If you 
slowly pull the magnifier away from the face, you will see that the 
magnification steadily increases until the image begins to blur. Pulling the 
magnifier even farther away produces an inverted image as seen in [link] 
(a). The distance at which the image blurs, and beyond which it inverts, is 
the focal length of the lens. To use a convex lens as a magnifier, the object 


must be closer to the converging lens than its focal length. This is called a 
case 2 image. A case 2 image is formed when d, < f and f is positive. 


(a) When a converging 
lens is held farther 
away from the face 
than the lens’s focal 
length, an inverted 

image is formed. This 

is acase 1 image. Note 
that the image is in 
focus but the face is 
not, because the image 
is much closer to the 
camera taking this 
photograph than the 
face. (credit: 
DaMongMan, Flickr) 
(b) A magnified image 


of a face is produced 
by placing it closer to 
the converging lens 
than its focal length. 
This is a case 2 image. 
(credit: Casey Fleser, 
Flickr) 


[link] uses ray tracing to show how an image is formed when an object is 
held closer to a converging lens than its focal length. Rays coming from a 
common point on the object continue to diverge after passing through the 
lens, but all appear to originate from a point at the location of the image. 
The image is on the same side of the lens as the object and is farther away 
from the lens than the object. This image, like all case 2 images, cannot be 
projected and, hence, is called a virtual image. Light rays only appear to 
originate at a virtual image; they do not actually pass through that location 
in space. A screen placed at the location of a virtual image will receive only 
diffuse light from the object, not focused rays from the lens. Additionally, a 
screen placed on the opposite side of the lens will receive rays that are still 
diverging, and so no image will be projected on it. We can see the 
magnified image with our eyes, because the lens of the eye converges the 
rays into a real image projected on our retina. Finally, we note that a virtual 
image is upright and larger than the object, meaning that the magnification 
is positive and greater than 1. 


Ray tracing predicts the 
image location and size 
for an object held closer 
to a converging lens than 
its focal length. Ray 1 
enters parallel to the axis 
and exits through the 
focal point on the 
opposite side, while ray 2 
passes through the center 
of the lens without 
changing path. The two 
rays continue to diverge 
on the other side of the 
lens, but both appear to 
come from a common 
point, locating the 
upright, magnified, 


virtual image. This is a 
case 2 image. 


Note: 

Virtual Image 

An image that is on the same side of the lens as the object and cannot be 
projected on a screen is called a virtual image. 


Example: 

Image Produced by a Magnifying Glass 

Suppose the book page in [link] (a) is held 7.50 cm from a convex lens of 
focal length 10.0 cm, such as a typical magnifying glass might have. What 
magnification is produced? 

Strategy and Concept 

We are given that d, = 7.50 cm and f = 10.0 cm, so we have a situation 
where the object is placed closer to the lens than its focal length. We 
therefore expect to get a case 2 virtual image with a positive magnification 
that is greater than 1. Ray tracing produces an image like that shown in 
[link], but we will use the thin lens equations to get numerical solutions in 
this example. 

Solution 

To find the magnification m, we try to use magnification equation, 

m = —d;/d . We do not have a value for d;, so that we must first find the 
location of the image using lens equation. (The procedure is the same as 
followed in the preceding example, where d, and f were known.) 
Rearranging the magnification equation to isolate d; gives 

Equation: 


Entering known values, we obtain a value for 1/d;: 
Equation: 


1 1 1 0.0333 


d; ~ 10.0cm  7.50cm cm 
This must be inverted to find d;: 
Equation: 
cm 
d; = — 0.0333 —30.0 cm. 


Now the thin lens equation can be used to find the magnification m, since 
both d; and d, are known. Entering their values gives 
Equation: 


dj —30. 
Pe i ee CE 
d, 7.50 cm 


Discussion 

A number of results in this example are true of all case 2 images, as well as 
being consistent with [link]. Magnification is indeed positive (as 
predicted), meaning the image is upright. The magnification is also greater 
than 1, meaning that the image is larger than the object—in this case, by a 
factor of 4. Note that the image distance is negative. This means the image 
is on the same side of the lens as the object. Thus the image cannot be 
projected and is virtual. (Negative values of d; occur for virtual images.) 
The image is farther from the lens than the object, since the image distance 
is greater in magnitude than the object distance. The location of the image 
is not obvious when you look through a magnifier. In fact, since the image 
is bigger than the object, you may think the image is closer than the object. 
But the image is farther away, a fact that is useful in correcting 
farsightedness, as we shall see in a later section. 


A third type of image is formed by a diverging or concave lens. Try looking 
through eyeglasses meant to correct nearsightedness. (See [link].) You will 
see an image that is upright but smaller than the object. This means that the 
magnification is positive but less than 1. The ray diagram in [link] shows 
that the image is on the same side of the lens as the object and, hence, 


cannot be projected—it is a virtual image. Note that the image is closer to 
the lens than the object. This is a case 3 image, formed for any object by a 
negative focal length or diverging lens. 


A car viewed through a 
concave or diverging lens 
looks upright. This is a 
case 3 image. (credit: 
Daniel Oines, Flickr) 


Ray tracing predicts the image 
location and size for a concave or 
diverging lens. Ray 1 enters 
parallel to the axis and is bent so 
that it appears to originate from the 
focal point. Ray 2 passes through 
the center of the lens without 
changing path. The two rays 
appear to come from a common 
point, locating the upright image. 
This is a case 3 image, which is 
closer to the lens than the object 
and smaller in height. 


Example: 


Image Produced by a Concave Lens 

Suppose an object such as a book page is held 7.50 cm from a concave lens 
of focal length —10.0 cm. Such a lens could be used in eyeglasses to correct 
pronounced nearsightedness. What magnification is produced? 

Strategy and Concept 

This example is identical to the preceding one, except that the focal length 
is negative for a concave or diverging lens. The method of solution is thus 
the same, but the results are different in important ways. 

Solution 

To find the magnification m, we must first find the image distance d; using 
thin lens equation 


Equation: 
ae 1 
ab Pe tla 
or its alternative rearrangement 
Equation: 
d 
d= Fda 
do aa f 


We are given that f = —10.0 cm and d, = 7.50 cm. Entering these yields 
a value for 1/d;: 


Equation: 
1 1 1 __ —0.2333 
d —10.0cm 7.50cm cm ~ 
This must be inverted to find d;: 
Equation: 
ea oc 
0.2333 
Or 


Equation: 


_(7.5)(-10) ae 
Se IC) 75/17.5 = —4.29 cm. 


Now the magnification equation can be used to find the magnification m, 
since both d; and d, are known. Entering their values gives 
Equation: 


d; —4,.29 cm 
Se al 
ar SET ae 


Discussion 

A number of results in this example are true of all case 3 images, as well as 
being consistent with [link]. Magnification is positive (as predicted), 
meaning the image is upright. The magnification is also less than 1, 
meaning the image is smaller than the object—in this case, a little over half 
its size. The image distance is negative, meaning the image is on the same 
side of the lens as the object. (The image is virtual.) The image is closer to 
the lens than the object, since the image distance is smaller in magnitude 
than the object distance. The location of the image is not obvious when you 
look through a concave lens. In fact, since the image is smaller than the 
object, you may think it is farther away. But the image is closer than the 
object, a fact that is useful in correcting nearsightedness, as we shall see in 
a later section. 


[link] summarizes the three types of images formed by single thin lenses. 
These are referred to as case 1, 2, and 3 images. Convex (converging) 
lenses can form either real or virtual images (cases 1 and 2, respectively), 
whereas concave (diverging) lenses can form only virtual images (always 
case 3). Real images are always inverted, but they can be either larger or 
smaller than the object. For example, a slide projector forms an image 
larger than the slide, whereas a camera makes an image smaller than the 
object being photographed. Virtual images are always upright and cannot be 
projected. Virtual images are larger than the object only in case 2, where a 
convex lens is used. The virtual image produced by a concave lens is 


always smaller than the object—a case 3 image. We can see and photograph 
virtual images only by using an additional lens to form a real image. 


Formed Image 
Type when type d; m 
f 
Case aay ah : 
1 positive, real positive negative 
d, >f 
ci - 
positive 
ae positive, virtual negative 
2 m>1 
dy < f 
C f positive 
ase 
irtual negative 
3 virtua gativ ee 
negative 


Three Types of Images Formed By Thin Lenses 


In Image Formation by Mirrors, we shall see that mirrors can form exactly 
the same types of images as lenses. 


Note: 


Take-Home Experiment: Concentrating Sunlight 

Find several lenses and determine whether they are converging or 
diverging. In general those that are thicker near the edges are diverging and 
those that are thicker near the center are converging. On a bright sunny day 
take the converging lenses outside and try focusing the sunlight onto a 
piece of paper. Determine the focal lengths of the lenses. Be careful 
because the paper may start to burn, depending on the type of lens you 
have selected. 


Problem-Solving Strategies for Lenses 


Step 1. Examine the situation to determine that image formation by a lens is 
involved. 


Step 2. Determine whether ray tracing, the thin lens equations, or both are 
to be employed. A sketch is very useful even if ray tracing is not 
specifically required by the problem. Write symbols and values on the 
sketch. 


Step 3. Identify exactly what needs to be determined in the problem 
(identify the unknowns). 


Step 4. Make alist of what is given or can be inferred from the problem as 
stated (identify the knowns). It is helpful to determine whether the situation 
involves a case 1, 2, or 3 image. While these are just names for types of 
images, they have certain characteristics (given in [link]) that can be of 
great use in solving problems. 


Step 5. If ray tracing is required, use the ray tracing rules listed near the 
beginning of this section. 


Step 6. Most quantitative problems require the use of the thin lens 
equations. These are solved in the usual manner by substituting knowns and 
solving for unknowns. Several worked examples serve as guides. 


Step 7. Check to see if the answer is reasonable: Does it make sense? If you 
have identified the type of image (case 1, 2, or 3), you should assess 
whether your answer is consistent with the type of image, magnification, 
and so on. 


Note: 

Misconception Alert 

We do not realize that light rays are coming from every part of the object, 
passing through every part of the lens, and all can be used to form the final 
image. 

We generally feel the entire lens, or mirror, is needed to form an image. 
Actually, half a lens will form the same, though a fainter, image. 


Section Summary 


e Light rays entering a converging lens parallel to its axis cross one 
another at a single point on the opposite side. 

e For a converging lens, the focal point is the point at which converging 
light rays cross; for a diverging lens, the focal point is the point from 
which diverging light rays appear to originate. 

e The distance from the center of the lens to its focal point is called the 
focal length f. 

¢ Power P of a lens is defined to be the inverse of its focal length, 
P=H+ 

f 

e A lens that causes the light rays to bend away from its axis is called a 
diverging lens. 

e Ray tracing is the technique of graphically determining the paths that 
light rays take. 

e The image in which light rays from one point on the object actually 
cross at the location of the image and can be projected onto a screen, a 
piece of film, or the retina of an eye is called a real image. 

e Thin lens equations are z + = = + and so = =e =m 


‘Oo 


(magnification). 


e The distance of the image from the center of the lens is called image 
distance. 

e An image that is on the same side of the lens as the object and cannot 
be projected on a screen is called a virtual image. 


Conceptual Questions 


Exercise: 
Problem: 
It can be argued that a flat piece of glass, such as in a window, is like a 


lens with an infinite focal length. If so, where does it form an image? 
That is, how are d; and d, related? 


Exercise: 
Problem: 
You can often see a reflection when looking at a sheet of glass, 


particularly if it is darker on the other side. Explain why you can often 
see a double image in such circumstances. 


Exercise: 
Problem: 
When you focus a camera, you adjust the distance of the lens from the 


film. If the camera lens acts like a thin lens, why can it not be a fixed 
distance from the film for both near and distant objects? 


Exercise: 
Problem: 
A thin lens has two focal points, one on either side, at equal distances 
from its center, and should behave the same for light entering from 


either side. Look through your eyeglasses (or those of a friend) 
backward and forward and comment on whether they are thin lenses. 


Exercise: 


Problem: 


Will the focal length of a lens change when it is submerged in water? 
Explain. 


Problems & Exercises 


Exercise: 
Problem: 
What is the power in diopters of a camera lens that has a 50.0 mm 
focal length? 
Exercise: 
Problem: 


Your camera’s zoom lens has an adjustable focal length ranging from 
80.0 to 200 mm. What is its range of powers? 


Solution: 


5.00 to 12.5 D 
Exercise: 
Problem: 
What is the focal length of 1.75 D reading glasses found on the rack in 
a pharmacy? 
Exercise: 
Problem: 


You note that your prescription for new eyeglasses is —4.50 D. What 
will their focal length be? 


Solution: 


—0.222 m 
Exercise: 
Problem: 
How far from the lens must the film in a camera be, if the lens has a 
35.0 mm focal length and is being used to photograph a flower 75.0 


cm away? Explicitly show how you follow the steps in the Problem- 
Solving Strategy for lenses. 


Exercise: 
Problem: 
A certain slide projector has a 100 mm focal length lens. (a) How far 
away is the screen, if a slide is placed 103 mm from the lens and 
produces a sharp image? (b) If the slide is 24.0 by 36.0 mm, what are 


the dimensions of the image? Explicitly show how you follow the 
steps in the Problem-Solving Strategy for lenses. 


Solution: 
(a) 3.43 m 


(b) 0.800 by 1.20 m 
Exercise: 


Problem: 


A doctor examines a mole with a 15.0 cm focal length magnifying 
glass held 13.5 cm from the mole (a) Where is the image? (b) What is 
its magnification? (c) How big is the image of a 5.00 mm diameter 
mole? 


Solution: 


(a) —1.35 m (on the object side of the lens). 


(b) +10.0 


(c) 5.00 cm 
Exercise: 
Problem: 


How far from a piece of paper must you hold your father’s 2.25 D 
reading glasses to try to burn a hole in the paper with sunlight? 


Solution: 


44.4 cm 

Exercise: 
Problem: 
A camera with a 50.0 mm focal length lens is being used to 
photograph a person standing 3.00 m away. (a) How far from the lens 
must the film be? (b) If the film is 36.0 mm high, what fraction of a 


1.75 m tall person will fit on it? (c) Discuss how reasonable this seems, 
based on your experience in taking or posing for photographs. 


Exercise: 


Problem: 


A camera lens used for taking close-up photographs has a focal length 
of 22.0 mm. The farthest it can be placed from the film is 33.0 mm. (a) 
What is the closest object that can be photographed? (b) What is the 
magnification of this closest object? 


Solution: 
(a) 6.60 cm 


(b) -0.333 


Exercise: 


Problem: 


Suppose your 50.0 mm focal length camera lens is 51.0 mm away 
from the film in the camera. (a) How far away is an object that is in 
focus? (b) What is the height of the object if its image is 2.00 cm high? 


Exercise: 
Problem: 
(a) What is the focal length of a magnifying glass that produces a 
magnification of 3.00 when held 5.00 cm from an object, such as a rare 
coin? (b) Calculate the power of the magnifier in diopters. (c) Discuss 
how this power compares to those for store-bought reading glasses 


(typically 1.0 to 4.0 D). Is the magnifier’s power greater, and should it 
be? 


Solution: 
(a) +7.50 cm 
(b) 13.3 D 


(c) Much greater 
Exercise: 
Problem: 
What magnification will be produced by a lens of power —4.00 D (such 
as might be used to correct myopia) if an object is held 25.0 cm away? 
Exercise: 
Problem: 
In [link], the magnification of a book held 7.50 cm from a 10.0 cm 
focal length lens was found to be 3.00. (a) Find the magnification for 
the book when it is held 8.50 cm from the magnifier. (b) Do the same 


for when it is held 9.50 cm from the magnifier. (c) Comment on the 
trend in m as the object distance increases as in these two calculations. 


Solution: 
(a) +6.67 
(b) +20.0 
(c) The magnification increases without limit (to infinity) as the object 
distance increases to the limit of the focal distance. 
Exercise: 
Problem: 
Suppose a 200 mm focal length telephoto lens is being used to 
photograph mountains 10.0 km away. (a) Where is the image? (b) 


What is the height of the image of a 1000 m high cliff on one of the 
mountains? 


Exercise: 
Problem: 
A camera with a 100 mm focal length lens is used to photograph the 
sun and moon. What is the height of the image of the sun on the film, 


given the sun is 1.40 x 10° km in diameter and is 1.50 x 10° km 
away? 


Solution: 


—0.933 mm 
Exercise: 


Problem: 


Combine thin lens equations to show that the magnification for a thin 
lens is determined by its focal length and the object distance and is 


given bym = f/(f — do). 


Glossary 


converging lens 
a convex lens in which light rays that enter it parallel to its axis 
converge at a single point on the opposite side 


diverging lens 
a concave lens in which light rays that enter it parallel to its axis bend 
away (diverge) from its axis 


focal point 
for a converging lens or mirror, the point at which converging light 
rays cross; for a diverging lens or mirror, the point from which 
diverging light rays appear to originate 


focal length 
distance from the center of a lens or curved mirror to its focal point 


magnification 
ratio of image height to object height 


power 
inverse of focal length 


real image 
image that can be projected 


virtual image 
image that cannot be projected 


Image Formation by Mirrors 


¢ Illustrate image formation in a flat mirror. 

e Explain with ray diagrams the formation of an image using spherical 
mirrors. 

¢ Determine focal length and magnification given radius of curvature, 
distance of object and image. 


We only have to look as far as the nearest bathroom to find an example of 
an image formed by a mirror. Images in flat mirrors are the same size as the 
object and are located behind the mirror. Like lenses, mirrors can form a 
variety of images. For example, dental mirrors may produce a magnified 
image, just as makeup mirrors do. Security mirrors in shops, on the other 
hand, form images that are smaller than the object. We will use the law of 
reflection to understand how mirrors form images, and we will find that 
mirror images are analogous to those formed by lenses. 


[link] helps illustrate how a flat mirror forms an image. Two rays are shown 
emerging from the same point, striking the mirror, and being reflected into 
the observer’s eye. The rays can diverge slightly, and both still get into the 
eye. If the rays are extrapolated backward, they seem to originate from a 
common point behind the mirror, locating the image. (The paths of the 
reflected rays into the eye are the same as if they had come directly from 
that point behind the mirror.) Using the law of reflection—the angle of 
reflection equals the angle of incidence—we can see that the image and 
object are the same distance from the mirror. This is a virtual image, since it 
cannot be projected—the rays only appear to originate from a common 
point behind the mirror. Obviously, if you walk behind the mirror, you 
cannot see the image, since the rays do not go there. But in front of the 
mirror, the rays behave exactly as if they had come from behind the mirror, 
so that is where the image is situated. 


Flat mirror ~/\. 


Two sets of rays from common points on an object 
are reflected by a flat mirror into the eye of an 
observer. The reflected rays seem to originate from 
behind the mirror, locating the virtual image. 


Now let us consider the focal length of a mirror—for example, the concave 
spherical mirrors in [link]. Rays of light that strike the surface follow the 
law of reflection. For a mirror that is large compared with its radius of 
curvature, as in [link](a), we see that the reflected rays do not cross at the 
Same point, and the mirror does not have a well-defined focal point. If the 
mirror had the shape of a parabola, the rays would all cross at a single point, 
and the mirror would have a well-defined focal point. But parabolic mirrors 
are much more expensive to make than spherical mirrors. The solution is to 
use a mirror that is small compared with its radius of curvature, as shown in 
[link](b). (This is the mirror equivalent of the thin lens approximation.) To a 
very good approximation, this mirror has a well-defined focal point at F that 
is the focal distance f from the center of the mirror. The focal length f of a 
concave mirror is positive, since it is a converging mirror. 


(a) Parallel rays reflected from a large spherical 
mirror do not all cross at a common point. (b) If a 
spherical mirror is small compared with its radius 

of curvature, parallel rays are focused to a 
common point. The distance of the focal point 
from the center of the mirror is its focal length f. 

Since this mirror is converging, it has a positive 

focal length. 


Just as for lenses, the shorter the focal length, the more powerful the mirror; 
thus, P = 1/f for a mirror, too. A more strongly curved mirror has a 
shorter focal length and a greater power. Using the law of reflection and 
some simple trigonometry, it can be shown that the focal length is half the 
radius of curvature, or 

Equation: 


where fF is the radius of curvature of a spherical mirror. The smaller the 
radius of curvature, the smaller the focal length and, thus, the more 
powerful the mirror. 


The convex mirror shown in [link] also has a focal point. Parallel rays of 
light reflected from the mirror seem to originate from the point F at the 


focal distance f behind the mirror. The focal length and power of a convex 
mirror are negative, since it is a diverging mirror. 


(negative) 


Parallel rays of light 
reflected from a convex 
spherical mirror (small in 
size compared with its 
radius of curvature) seem 
to originate from a well- 
defined focal point at the 
focal distance f behind 
the mirror. Convex 
mirrors diverge light rays 
and, thus, have a negative 
focal length. 


Ray tracing is as useful for mirrors as for lenses. The rules for ray tracing 
for mirrors are based on the illustrations just discussed: 


1. A ray approaching a concave converging mirror parallel to its axis is 
reflected through the focal point F of the mirror on the same side. (See 
rays 1 and 3 in [link](b).) 

2. A ray approaching a convex diverging mirror parallel to its axis is 
reflected so that it seems to come from the focal point F behind the 
mirror. (See rays 1 and 3 in [link].) 

3. Any ray striking the center of a mirror is followed by applying the law 
of reflection; it makes the same angle with the axis when leaving as 
when approaching. (See ray 2 in [link].) 

4. A ray approaching a concave converging mirror through its focal point 
is reflected parallel to its axis. (The reverse of rays 1 and 3 in [link].) 

5. A ray approaching a convex diverging mirror by heading toward its 
focal point on the opposite side is reflected parallel to the axis. (The 
reverse of rays 1 and 3 in [link].) 


We will use ray tracing to illustrate how images are formed by mirrors, and 
we Can use ray tracing quantitatively to obtain numerical information. But 
since we assume each mirror is small compared with its radius of curvature, 
we can use the thin lens equations for mirrors just as we did for lenses. 


Consider the situation shown in [link], concave spherical mirror reflection, 
in which an object is placed farther from a concave (converging) mirror 
than its focal length. That is, f is positive and d, > f, so that we may expect 
an image similar to the case 1 real image formed by a converging lens. Ray 
tracing in [link] shows that the rays from a common point on the object all 
cross at a point on the same side of the mirror as the object. Thus a real 
image can be projected onto a screen placed at this location. The image 
distance is positive, and the image is inverted, so its magnification is 
negative. This is a case 1 image for mirrors. It differs from the case 1 image 
for lenses only in that the image is on the same side of the mirror as the 
object. It is otherwise identical. 


A case 1 image for a 
mirror. An object is 
farther from the 
converging mirror than its 
focal length. Rays from a 
common point on the 
object are traced using the 
rules in the text. Ray 1 
approaches parallel to the 
axis, ray 2 strikes the 
center of the mirror, and 
ray 3 goes through the 
focal point on the way 
toward the mirror. All 
three rays cross at the 
same point after being 
reflected, locating the 
inverted real image. 
Although three rays are 
shown, only two of the 
three are needed to locate 
the image and determine 
its height. 


Example: 


A Concave Reflector 

Electric room heaters use a concave mirror to reflect infrared (IR) radiation 
from hot coils. Note that IR follows the same law of reflection as visible 
light. Given that the mirror has a radius of curvature of 50.0 cm and 
produces an image of the coils 3.00 m away from the mirror, where are the 
coils? 

Strategy and Concept 

We are given that the concave mirror projects a real image of the coils at an 
image distance d; = 3.00 m. The coils are the object, and we are asked to 
find their location—that is, to find the object distance d,. We are also given 
the radius of curvature of the mirror, so that its focal length is 

f = R/2 = 25.0 cm (positive since the mirror is concave or converging). 
Assuming the mirror is small compared with its radius of curvature, we can 
use the thin lens equations, to solve this problem. 

Solution 

Since d; and f are known, thin lens equation can be used to find dy: 
Equation: 


1 m el 
do d; 7 ‘ia 
Rearranging to isolate d, gives 
Equation: 
ee 1 
d. 7 f di 


Entering known quantities gives a value for 1/d,: 
Equation: 


ae 
d, 0.250m 3.0m mm 


This must be inverted to find d,: 
Equation: 


Discussion 

Note that the object (the filament) is farther from the mirror than the 
mirror’s focal length. This is a case 1 image (d, > f and f positive), 
consistent with the fact that a real image is formed. You will get the most 
concentrated thermal energy directly in front of the mirror and 3.00 m 
away from it. Generally, this is not desirable, since it could cause burns. 
Usually, you want the rays to emerge parallel, and this is accomplished by 
having the filament at the focal point of the mirror. 

Note that the filament here is not much farther from the mirror than its 
focal length and that the image produced is considerably farther away. This 
is exactly analogous to a slide projector. Placing a slide only slightly 
farther away from the projector lens than its focal length produces an 
image significantly farther away. As the object gets closer to the focal 
distance, the image gets farther away. In fact, as the object distance 
approaches the focal length, the image distance approaches infinity and the 
rays are sent out parallel to one another. 


Example: 

Solar Electric Generating System 

One of the solar technologies used today for generating electricity is a 
device (called a parabolic trough or concentrating collector) that 
concentrates the sunlight onto a blackened pipe that contains a fluid. This 
heated fluid is pumped to a heat exchanger, where its heat energy is 
transferred to another system that is used to generate steam—and so 
generate electricity through a conventional steam cycle. [link] shows such 
a working system in southern California. Concave mirrors are used to 
concentrate the sunlight onto the pipe. The mirror has the approximate 
shape of a section of a cylinder. For the problem, assume that the mirror is 
exactly one-quarter of a full cylinder. 


a. If we wish to place the fluid-carrying pipe 40.0 cm from the concave 
mirror at the mirror’s focal point, what will be the radius of curvature 
of the mirror? 

b. Per meter of pipe, what will be the amount of sunlight concentrated 
onto the pipe, assuming the insolation (incident solar radiation) is 


0.900 kW /m?? 

c. If the fluid-carrying pipe has a 2.00-cm diameter, what will be the 
temperature increase of the fluid per meter of pipe over a period of 
one minute? Assume all the solar radiation incident on the reflector is 
absorbed by the pipe, and that the fluid is mineral oil. 


Strategy 

To solve an Integrated Concept Problem we must first identify the physical 
principles involved. Part (a) is related to the current topic. Part (b) involves 
a little math, primarily geometry. Part (c) requires an understanding of heat 
and density. 

Solution to (a) 

To a good approximation for a concave or semi-spherical surface, the point 
where the parallel rays from the sun converge will be at the focal point, so 
i 27 —-o0r0.em* 

Solution to (b) 

The insolation is 900 W/ m”. We must find the cross-sectional area A of 


the concave mirror, since the power delivered is 900 W/ m? x A. The 
mirror in this case is a quarter-section of a cylinder, so the area for a length 
L of the mirror is A = +(27R)L. The area for a length of 1.00 m is then 


Equation: 


3.14 
5 R(1.00 i) (0.800 m)(1.00 m) = 1.26 m?. 


The insolation on the 1.00-m length of pipe is then 
Equation: 


(2.00 x ie) (1.26 m?) — 1130 W. 
m 


Solution to (c) 

The increase in temperature is given by Q = mc AT. The mass m of the 
mineral oil in the one-meter section of pipe is 

Equation: 


i — Ne pn(4)*(1.00 m) 
= (8.00 x 10? kg/m’) (3.14) (0.0100 m)?(1.00 m) 


= 0.251 kg. 
Therefore, the increase in temperature in one minute is 
Equation: 
AT = Q/mce 
= (1130 W)(60.0 s) 
~ (0.251 kg)(1670 J-kg/°C) 
= 162°C. 


Discussion for (c) 
An array of such pipes in the California desert can provide a thermal 
output of 250 MW ona sunny day, with fluids reaching temperatures as 
high as 400°C. We are considering only one meter of pipe here, and 
ignoring heat losses along the pipe. 


Parabolic trough collectors are 
used to generate electricity in 
southern California. (credit: 
kjkolb, Wikimedia Commons) 


What happens if an object is closer to a concave mirror than its focal 
length? This is analogous to a case 2 image for lenses (dy < f and f 
positive), which is a magnifier. In fact, this is how makeup mirrors act as 
magnifiers. [link](a) uses ray tracing to locate the image of an object 
placed close to a concave mirror. Rays from a common point on the object 


are reflected in such a manner that they appear to be coming from behind 
the mirror, meaning that the image is virtual and cannot be projected. As 
with a magnifying glass, the image is upright and larger than the object. 
This is a case 2 image for mirrors and is exactly analogous to that for 
lenses. 


(b) 


(a) Case 2 images for mirrors are 
formed when a converging mirror has 
an object closer to it than its focal 
length. Ray 1 approaches parallel to 
the axis, ray 2 strikes the center of the 
mirror, and ray 3 approaches the 
mirror as if it came from the focal 
point. (b) A magnifying mirror 
showing the reflection. (credit: Mike 
Melrose, Flickr) 


All three rays appear to originate from the same point after being reflected, 
locating the upright virtual image behind the mirror and showing it to be 
larger than the object. (b) Makeup mirrors are perhaps the most common 
use of a concave mirror to produce a larger, upright image. 

A convex mirror is a diverging mirror (f is negative) and forms only one 
type of image. It is a case 3 image—one that is upright and smaller than 
the object, just as for diverging lenses. [link](a) uses ray tracing to 
illustrate the location and size of the case 3 image for mirrors. Since the 
image is behind the mirror, it cannot be projected and is thus a virtual 
image. It is also seen to be smaller than the object. 


Case 3 images for mirrors 
are formed by any convex 
mirror. Ray 1 approaches 
parallel to the axis, ray 2 
strikes the center of the 


mirror, and ray 3 approaches 
toward the focal point. All 
three rays appear to originate 
from the same point after 
being reflected, locating the 
upright virtual image behind 
the mirror and showing it to 
be smaller than the object. 
(b) Security mirrors are 
convex, producing a smaller, 
upright image. Because the 
image is smaller, a larger 
area is imaged compared to 
what would be observed for 
a flat mirror (and hence 
security is improved). 
(credit: Laura D’ Alessandro, 
Flickr) 


Example: 

Image in a Convex Mirror 

A keratometer is a device used to measure the curvature of the cornea, 
particularly for fitting contact lenses. Light is reflected from the cornea, 
which acts like a convex mirror, and the keratometer measures the 
magnification of the image. The smaller the magnification, the smaller the 
radius of curvature of the cornea. If the light source is 12.0 cm from the 
comea and the image’s magnification is 0.0320, what is the cornea’s radius 
of curvature? 

Strategy 

If we can find the focal length of the convex mirror formed by the cornea, 
we can find its radius of curvature (the radius of curvature is twice the 
focal length of a spherical mirror). We are given that the object distance is 


d, = 12.0 cm and that m = 0.0320. We first solve for the image distance 
d;, and then for f. 


Solution 
m = —d;/d,. Solving this expression for d; gives 
Equation: 

d; = —md, 
Entering known values yields 
Equation: 

d; =— (0.0320)(12.0 cm) = -0.384 cm. 

Equation: 

eee mn 1 

f a do d; 


Substituting known values, 
Equation: 


1 1 aeiby: 


Fite 0G a 


This must be inverted to find f: 
Equation: 


v= cm 
9559 


f = ~0.400 cm. 


The radius of curvature is twice the focal length, so that 
Equation: 


R=2)| f |= 0.800 cm. 


Discussion 

Although the focal length f of a convex mirror is defined to be negative, 
we take the absolute value to give us a positive value for R. The radius of 
curvature found here is reasonable for a cornea. The distance from cornea 


to retina in an adult eye is about 2.0 cm. In practice, many corneas are not 
spherical, complicating the job of fitting contact lenses. Note that the 
image distance here is negative, consistent with the fact that the image is 
behind the mirror, where it cannot be projected. In this section’s Problems 
and Exercises, you will show that for a fixed object distance, the smaller 
the radius of curvature, the smaller the magnification. 

The three types of images formed by mirrors (cases 1, 2, and 3) are exactly 
analogous to those formed by lenses, as summarized in the table at the end 
of Image Formation by Lenses. It is easiest to concentrate on only three 
types of images—then remember that concave mirrors act like convex 
lenses, whereas convex mirrors act like concave lenses. 


Note: 

Take-Home Experiment: Concave Mirrors Close to Home 

Find a flashlight and identify the curved mirror used in it. Find another 
flashlight and shine the first flashlight onto the second one, which is turned 
off. Estimate the focal length of the mirror. You might try shining a 
flashlight on the curved mirror behind the headlight of a car, keeping the 
headlight switched off, and determine its focal length. 


Problem-Solving Strategy for Mirrors 


Step 1. Examine the situation to determine that image formation by a mirror 
is involved. 


Step 2. Refer to the Problem-Solving Strategies for Lenses. The same 
strategies are valid for mirrors as for lenses with one qualification—use the 
ray tracing rules for mirrors listed earlier in this section. 


Section Summary 


¢ The characteristics of an image formed by a flat mirror are: (a) The 
image and object are the same distance from the mirror, (b) The image 


is a virtual image, and (c) The image is situated behind the mirror. 
e Image length is half the radius of curvature. 
Equation: 


e A convex mirror is a diverging mirror and forms only one type of 
image, namely a virtual image. 


Conceptual Questions 


Exercise: 
Problem: 
What are the differences between real and virtual images? How can 


you tell (by looking) whether an image formed by a single lens or 
mirror is real or virtual? 


Exercise: 
Problem: 
Can you see a virtual image? Can you photograph one? Can one be 


projected onto a screen with additional lenses or mirrors? Explain your 
responses. 


Exercise: 


Problem: 


Is it necessary to project a real image onto a screen for it to exist? 
Exercise: 


Problem: 


At what distance is an image always located—at d,, d;, or f? 


Exercise: 


Problem: 
Under what circumstances will an image be located at the focal point 
of a lens or mirror? 

Exercise: 
Problem: 
What is meant by a negative magnification? What is meant by a 
magnification that is less than 1 in magnitude? 

Exercise: 
Problem: 
Can a case 1 image be larger than the object even though its 
magnification is always negative? Explain. 

Exercise: 
Problem: 
[link] shows a light bulb between two mirrors. One mirror produces a 
beam of light with parallel rays; the other keeps light from escaping 


without being put into the beam. Where is the filament of the light in 
relation to the focal point or radius of curvature of each mirror? 


The two mirrors trap most 
of the bulb’s light and 
form a directional beam 
as in a headlight. 


Exercise: 
Problem: 
Devise an arrangement of mirrors allowing you to see the back of your 
head. What is the minimum number of mirrors needed for this task? 
Exercise: 
Problem: 
If you wish to see your entire body in a flat mirror (from head to toe), 


how tall should the mirror be? Does its size depend upon your distance 
away from the mirror? Provide a sketch. 


Exercise: 
Problem: 
It can be argued that a flat mirror has an infinite focal length. If so, 
where does it form an image? That is, how are d; and dy related? 
Exercise: 
Problem: 
Why are diverging mirrors often used for rear-view mirrors in 


vehicles? What is the main disadvantage of using such a mirror 
compared with a flat one? 


Problems & Exercises 


Exercise: 


Problem: 


What is the focal length of a makeup mirror that has a power of 1.50 
D? 


Solution: 


+0.667 m 
Exercise: 
Problem: 
Some telephoto cameras use a mirror rather than a lens. What radius of 


curvature mirror is needed to replace a 800 mm focal length telephoto 
lens? 


Exercise: 
Problem: 
(a) Calculate the focal length of the mirror formed by the shiny back of 


a spoon that has a 3.00 cm radius of curvature. (b) What is its power in 
diopters? 


Solution: 
(a)-1.5 x 102m 
(b)-66.7 D 


Exercise: 
Problem: 
Find the magnification of the heater element in [link]. Note that its 
large magnitude helps spread out the reflected energy. 

Exercise: 
Problem: 
What is the focal length of a makeup mirror that produces a 
magnification of 1.50 when a person’s face is 12.0 cm away? 


Explicitly show how you follow the steps in the Problem-Solving 
Strategy for Mirrors. 


Solution: 


+0.360 m (concave) 
Exercise: 


Problem: 


A shopper standing 3.00 m from a convex security mirror sees his 
image with a magnification of 0.250. (a) Where is his image? (b) What 
is the focal length of the mirror? (c) What is its radius of curvature? 
Explicitly show how you follow the steps in the Problem-Solving 
Strategy for Mirrors. 


Exercise: 
Problem: 
An object 1.50 cm high is held 3.00 cm from a person’s cornea, and its 
reflected image is measured to be 0.167 cm high. (a) What is the 
magnification? (b) Where is the image? (c) Find the radius of 
curvature of the convex mirror formed by the comea. (Note that this 
technique is used by optometrists to measure the curvature of the 


comea for contact lens fitting. The instrument used is called a 
keratometer, or curve measurer.) 


Solution: 
(a) +0.111 
(b) -0.334 cm (behind “mirror” 


(c) 0.752cm 
Exercise: 


Problem: 


Ray tracing for a flat mirror shows that the image is located a distance 
behind the mirror equal to the distance of the object from the mirror. 
This is stated d; = —do, since this is a negative image distance (it is a 
virtual image). (a) What is the focal length of a flat mirror? (b) What is 
its power? 


Exercise: 
Problem: 
Show that for a flat mirror h; = ho, knowing that the image is a 


distance behind the mirror equal in magnitude to the distance of the 
object from the mirror. 


Solution: 
Equation: 
h; d; —d d. 
m= — = —-— = -— = SK H=1Sh=h 
hy do dy dy 
Exercise: 
Problem: 


Use the law of reflection to prove that the focal length of a mirror is 
half its radius of curvature. That is, prove that f = R/2. Note this is 
true for a spherical mirror only if its diameter is small compared with 
its radius of curvature. 


Exercise: 


Problem: 


Referring to the electric room heater considered in the first example in 
this section, calculate the intensity of IR radiation in W/ m? projected 
by the concave mirror on a person 3.00 m away. Assume that the 
heating element radiates 1500 W and has an area of 100 cm2, and that 
half of the radiated power is reflected and focused by the mirror. 


Solution: 


6.82 kW/m’? 


Exercise: 


Problem: 


Consider a 250-W heat lamp fixed to the ceiling in a bathroom. If the 
filament in one light burns out then the remaining three still work. 
Construct a problem in which you determine the resistance of each 
filament in order to obtain a certain intensity projected on the 
bathroom floor. The ceiling is 3.0 m high. The problem will need to 
involve concave mirrors behind the filaments. Your instructor may 
wish to guide you on the level of complexity to consider in the 
electrical components. 


Glossary 


converging mirror 
a concave mirror in which light rays that strike it parallel to its axis 
converge at one or more points along the axis 


diverging mirror 
a convex mirror in which light rays that strike it parallel to its axis 
bend away (diverge) from its axis 


law of reflection 
angle of reflection equals the angle of incidence 


Introduction to Vision and Optical Instruments 
class="introduction" 


A scientist 
examines 
minute 
details on the 
surface of a 
disk drive at 
a 
magnificatio 
n of 100,000 
times. The 
image was 
produced 
using an 
electron 
microscope. 
(credit: 
Robert 
Scoble) 


Explore how the image on the computer screen is formed. How is the image 
formation on the computer screen different from the image formation in 
your eye as you look down the microscope? How can videos of living cell 
processes be taken for viewing later on, and by many different people? 


Seeing faces and objects we love and cherish is a delight—one’s favorite 
teddy bear, a picture on the wall, or the sun rising over the mountains. 
Intricate images help us understand nature and are invaluable for 
developing techniques and technologies in order to improve the quality of 
life. The image of a red blood cell that almost fills the cross-sectional area 
of a tiny capillary makes us wonder how blood makes it through and not get 
stuck. We are able to see bacteria and viruses and understand their structure. 
It is the knowledge of physics that provides fundamental understanding and 
models required to develop new techniques and instruments. Therefore, 
physics is called an enabling science—a science that enables development 
and advancement in other areas. It is through optics and imaging that 
physics enables advancement in major areas of biosciences. This chapter 
illustrates the enabling nature of physics through an understanding of how a 


human eye is able to see and how we are able to use optical instruments to 
see beyond what is possible with the naked eye. It is convenient to 
categorize these instruments on the basis of geometric optics (see 
Geometric Optics) and wave optics (see Wave Optics). 


Physics of the Eye 


e Explain the image formation by the eye. 

e Explain why peripheral images lack detail and color. 

¢ Define refractive indices. 

e Analyze the accommodation of the eye for distant and near vision. 


The eye is perhaps the most interesting of all optical instruments. The eye is 
remarkable in how it forms images and in the richness of detail and color it 
can detect. However, our eyes commonly need some correction, to reach 
what is called “normal” vision, but should be called ideal rather than 
normal. Image formation by our eyes and common vision correction are 
easy to analyze with the optics discussed in Geometric Optics. 


[link] shows the basic anatomy of the eye. The cornea and lens form a 
system that, to a good approximation, acts as a single thin lens. For clear 
vision, a real image must be projected onto the light-sensitive retina, which 
lies at a fixed distance from the lens. The lens of the eye adjusts its power to 
produce an image on the retina for objects at different distances. The center 
of the image falls on the fovea, which has the greatest density of light 
receptors and the greatest acuity (sharpness) in the visual field. The variable 
opening (or pupil) of the eye along with chemical adaptation allows the eye 
to detect light intensities from the lowest observable to 10° times greater 
(without damage). This is an incredible range of detection. Our eyes 
perform a vast number of functions, such as sense direction, movement, 
sophisticated colors, and distance. Processing of visual nerve impulses 
begins with interconnections in the retina and continues in the brain. The 
optic nerve conveys signals received by the eye to the brain. 


Aqueous 
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ron nerve 
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The cornea and lens of an eye act 
together to form a real image on 
the light-sensing retina, which has 
its densest concentration of 
receptors in the fovea and a blind 
spot over the optic nerve. The 
power of the lens of an eye is 
adjustable to provide an image on 
the retina for varying object 
distances. Layers of tissues with 
varying indices of refraction in the 
lens are shown here. However, 
they have been omitted from other 
pictures for clarity. 


Refractive indices are crucial to image formation using lenses. [link] shows 
refractive indices relevant to the eye. The biggest change in the refractive 
index, and bending of rays, occurs at the cornea rather than the lens. The 
ray diagram in [link] shows image formation by the cornea and lens of the 
eye. The rays bend according to the refractive indices provided in [link]. 
The cornea provides about two-thirds of the power of the eye, owing to the 
fact that speed of light changes considerably while traveling from air into 
cornea. The lens provides the remaining power needed to produce an image 
on the retina. The cornea and lens can be treated as a single thin lens, even 


though the light rays pass through several layers of material (such as 
cornea, aqueous humor, several layers in the lens, and vitreous humor), 
changing direction at each interface. The image formed is much like the one 
produced by a single convex lens. This is a case 1 image. Images formed in 
the eye are inverted but the brain inverts them once more to make them 
seem upright. 


Material Index of Refraction 
Water 1.33 

Air 1.0 

Cornea 1.38 

— | 


Dene 1.41 average (varies throughout the lens, greatest 
in center) 

Vitreous 1.34 

humor 


Refractive Indices Relevant to the Eye 


An image is formed on the retina with 
light rays converging most at the 
cornea and upon entering and exiting 
the lens. Rays from the top and 
bottom of the object are traced and 
produce an inverted real image on the 
retina. The distance to the object is 
drawn smaller than scale. 


As noted, the image must fall precisely on the retina to produce clear vision 
— that is, the image distance d; must equal the lens-to-retina distance. 
Because the lens-to-retina distance does not change, the image distance d; 
must be the same for objects at all distances. The eye manages this by 
varying the power (and focal length) of the lens to accommodate for objects 
at various distances. The process of adjusting the eye’s focal length is called 
accommodation. A person with normal (ideal) vision can see objects 
clearly at distances ranging from 25 cm to essentially infinity. However, 
although the near point (the shortest distance at which a sharp focus can be 
obtained) increases with age (becoming meters for some older people), we 
will consider it to be 25 cm in our treatment here. 


[link] shows the accommodation of the eye for distant and near vision. 
Since light rays from a nearby object can diverge and still enter the eye, the 
lens must be more converging (more powerful) for close vision than for 
distant vision. To be more converging, the lens is made thicker by the action 
of the ciliary muscle surrounding it. The eye is most relaxed when viewing 


distant objects, one reason that microscopes and telescopes are designed to 
produce distant images. Vision of very distant objects is called totally 
relaxed, while close vision is termed accommodated, with the closest vision 
being fully accommodated. 


|} d, (very large) 
(a) 


|} d, (very small) 
(b) 


Relaxed and accommodated vision for distant and 
close objects. (a) Light rays from the same point 
on a distant object must be nearly parallel while 

entering the eye and more easily converge to 
produce an image on the retina. (b) Light rays 
from a nearby object can diverge more and still 
enter the eye. A more powerful lens is needed to 
converge them on the retina than if they were 
parallel. 


We will use the thin lens equations to examine image formation by the eye 
quantitatively. First, note the power of a lens is given as p = 1/f, so we 
rewrite the thin lens equations as 


Equation: 
| 1 
P=—+— 
dd 
and 
Equation: 
h; d; 
SS Sm. 
ho d, 


We understand that d; must equal the lens-to-retina distance to obtain clear 
vision, and that normal vision is possible for objects at distances 
d, = 25 cm to infinity. 


Note: 

Take-Home Experiment: The Pupil 

Look at the central transparent area of someone’s eye, the pupil, in normal 
room light. Estimate the diameter of the pupil. Now turn off the lights and 
darken the room. After a few minutes turn on the lights and promptly 
estimate the diameter of the pupil. What happens to the pupil as the eye 
adjusts to the room light? Explain your observations. 


The eye can detect an impressive amount of detail, considering how small 
the image is on the retina. To get some idea of how small the image can be, 
consider the following example. 


Example: 

Size of Image on Retina 

What is the size of the image on the retina of a 1.20 x 10 ? cm diameter 
human hair, held at arm’s length (60.0 cm) away? Take the lens-to-retina 
distance to be 2.00 cm. 

Strategy 

We want to find the height of the image h;, given the height of the object is 
h, = 1.20 x 10-2 cm. We also know that the object is 60.0 cm away, so 
that d, = 60.0 cm. For clear vision, the image distance must equal the 
lens-to-retina distance, and so d; = 2.00 cm . The equation 


fh =— “i — mcan be used to find h; with the known information. 
Solution 
The only unknown variable in the equation = = os Sees. 
Equation: 

A di 

hy d, 
Rearranging to isolate h; yields 
Equation: 

d: 
hehe ai 
Substituting the known values gives 
Equation: 
a) 2.00 
= —4.00 x 10°*cm. 

Discussion 


This truly small image is not the smallest discernible—that is, the limit to 
visual acuity is even smaller than this. Limitations on visual acuity have to 
do with the wave properties of light and will be discussed in the next 
chapter. Some limitation is also due to the inherent anatomy of the eye and 
processing that occurs in our brain. 


Example: 

Power Range of the Eye 

Calculate the power of the eye when viewing objects at the greatest and 
smallest distances possible with normal vision, assuming a lens-to-retina 
distance of 2.00 cm (a typical value). 

Strategy 

For clear vision, the image must be on the retina, and so d; = 2.00 cm 
here. For distant vision, d, ~ oo, and for close vision, dy = 25.0 cm, as 


discussed earlier. The equation P = - = “- as written just above, can be 


used directly to solve for P in both cases, since we know d; and dy. Power 
has units of diopters, where 1 D = 1/m, and so we should express all 
distances in meters. 


Solution 
For distant vision, 
Equation: 
1 1 1 1 
P= —+— = — + —. 
do a d; oe s 0.0200 m 


Since 1/oo = 0, this gives 
Equation: 


P=0+50.0/m = 50.0 D (distant vision). 


Now, for close vision, 


Equation: 
1 a 1 1 
P= d, as d, + 0.250m + 90200m 
= fot Ale, oe — 4.00 D+ 50.0 D 
= 54.0D (close vision). 
Discussion 


For an eye with this typical 2.00 cm lens-to-retina distance, the power of 
the eye ranges from 50.0 D (for distant totally relaxed vision) to 54.0 D 
(for close fully accommodated vision), which is an 8% increase. This 
increase in power for close vision is consistent with the preceding 


discussion and the ray tracing in [link]. An 8% ability to accommodate is 
considered normal but is typical for people who are about 40 years old. 
Younger people have greater accommodation ability, whereas older people 
gradually lose the ability to accommodate. When an optometrist identifies 
accommodation as a problem in elder people, it is most likely due to 
stiffening of the lens. The lens of the eye changes with age in ways that 
tend to preserve the ability to see distant objects clearly but do not allow 
the eye to accommodate for close vision, a condition called presbyopia 
(literally, elder eye). To correct this vision defect, we place a converging, 
positive power lens in front of the eye, such as found in reading glasses. 
Commonly available reading glasses are rated by their power in diopters, 
typically ranging from 1.0 to 3.5 D. 


Section Summary 


e Image formation by the eye is adequately described by the thin lens 
equations: 
Equation: 


Pa Scan ed 
dy. i he dy 


e The eye produces a real image on the retina by adjusting its focal 
length and power in a process called accommodation. 

e For close vision, the eye is fully accommodated and has its greatest 
power, whereas for distant vision, it is totally relaxed and has its 
smallest power. 

e The loss of the ability to accommodate with age is called presbyopia, 
which is corrected by the use of a converging lens to add power for 
close vision. 


Conceptual Questions 


Exercise: 


Problem: 


If the lens of a person’s eye is removed because of cataracts (as has 
been done since ancient times), why would you expect a spectacle lens 
of about 16 D to be prescribed? 


Exercise: 
Problem: 
A cataract is cloudiness in the lens of the eye. Is light dispersed or 
diffused by it? 
Exercise: 
Problem: 
When laser light is shone into a relaxed normal-vision eye to repair a 


tear by spot-welding the retina to the back of the eye, the rays entering 
the eye must be parallel. Why? 


Exercise: 
Problem: 
How does the power of a dry contact lens compare with its power 
when resting on the tear layer of the eye? Explain. 
Exercise: 
Problem: 


Why is your vision so blurry when you open your eyes while 
swimming under water? How does a face mask enable clear vision? 


Problem Exercises 


Unless otherwise stated, the lens-to-retina distance is 2.00 cm. 
Exercise: 


Problem: 
What is the power of the eye when viewing an object 50.0 cm away? 


Solution: 


52:0-D 
Exercise: 


Problem: 


Calculate the power of the eye when viewing an object 3.00 m away. 
Exercise: 

Problem: 

(a) The print in many books averages 3.50 mm in height. How high is 


the image of the print on the retina when the book is held 30.0 cm 
from the eye? 


(b) Compare the size of the print to the sizes of rods and cones in the 
fovea and discuss the possible details observable in the letters. (The 
eye-brain system can perform better because of interconnections and 
higher order image processing.) 


Solution: 
(a) —0.233 mm 


(b) The size of the rods and the cones is smaller than the image height, 
so we can distinguish letters on a page. 


Exercise: 


Problem: 


Suppose a certain person’s visual acuity is such that he can see objects 
clearly that form an image 4.00 pm high on his retina. What is the 
maximum distance at which he can read the 75.0 cm high letters on the 
side of an airplane? 


Exercise: 


Problem: 


People who do very detailed work close up, such as jewellers, often 
can see objects clearly at much closer distance than the normal 25 cm. 


(a) What is the power of the eyes of a woman who can see an object 
clearly at a distance of only 8.00 cm? 


(b) What is the size of an image of a 1.00 mm object, such as lettering 
inside a ring, held at this distance? 


(c) What would the size of the image be if the object were held at the 
normal 25.0 cm distance? 


Solution: 
(a) +62.5 D 
(b) —0.250 mm 


(c) 0.0800 mm 


Glossary 


accommodation 
the ability of the eye to adjust its focal length is known as 
accommodation 


presbyopia 


a condition in which the lens of the eye becomes progressively unable 
to focus on objects close to the viewer 


Vision Correction 


e Identify and discuss common vision defects. 
e Explain nearsightedness and farsightedness corrections. 
e Explain laser vision correction. 


The need for some type of vision correction is very common. Common 
vision defects are easy to understand, and some are simple to correct. [link] 
illustrates two common vision defects. Nearsightedness, or myopia, is the 
inability to see distant objects clearly while close objects are clear. The eye 
overconverges the nearly parallel rays from a distant object, and the rays 
cross in front of the retina. More divergent rays from a close object are 
converged on the retina for a clear image. The distance to the farthest object 
that can be seen clearly is called the far point of the eye (normally infinity). 
Farsightedness, or hyperopia, is the inability to see close objects clearly 
while distant objects may be clear. A farsighted eye does not converge 
sufficient rays from a close object to make the rays meet on the retina. Less 
diverging rays from a distant object can be converged for a clear image. The 
distance to the closest object that can be seen clearly is called the near 
point of the eye (normally 25 cm). 


Lens too strong Eye too long 
= — & 
(a) Myopia 

Lens too weak Eye too short 


S 


(b) Hyperopia 


(a) The nearsighted (myopic) eye converges 
rays from a distant object in front of the 
retina; thus, they are diverging when they 


strike the retina, producing a blurry image. 
This can be caused by the lens of the eye 

being too powerful or the length of the eye 

being too great. (b) The farsighted 
(hyperopic) eye is unable to converge the 
rays from a close object by the time they 
strike the retina, producing blurry close 
vision. This can be caused by insufficient 
power in the lens or by the eye being too 
short. 


Since the nearsighted eye over converges light rays, the correction for 
nearsightedness is to place a diverging spectacle lens in front of the eye. 
This reduces the power of an eye that is too powerful. Another way of 
thinking about this is that a diverging spectacle lens produces a case 3 
image, which is closer to the eye than the object (see [link]). To determine 
the spectacle power needed for correction, you must know the person’s far 
point—that is, you must know the greatest distance at which the person can 
see clearly. Then the image produced by a spectacle lens must be at this 
distance or closer for the nearsighted person to be able to see it clearly. It is 
worth noting that wearing glasses does not change the eye in any way. The 
eyeglass lens is simply used to create an image of the object at a distance 
where the nearsighted person can see it clearly. Whereas someone not 
wearing glasses can see clearly objects that fall between their near point and 
their far point, someone wearing glasses can see images that fall between 
their near point and their far point. 


I 
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Correction of nearsightedness 
requires a diverging lens that 
compensates for the 
overconvergence by the eye. 
The diverging lens produces an 
image closer to the eye than the 
object, so that the nearsighted 
person can see it clearly. 


Example: 

Correcting Nearsightedness 

What power of spectacle lens is needed to correct the vision of a 
nearsighted person whose far point is 30.0 cm? Assume the spectacle 
(corrective) lens is held 1.50 cm away from the eye by eyeglass frames. 
Strategy 

You want this nearsighted person to be able to see very distant objects 
clearly. That means the spectacle lens must produce an image 30.0 cm 
from the eye for an object very far away. An image 30.0 cm from the eye 
will be 28.5 cm to the left of the spectacle lens (see [link]). Therefore, we 


must get dj = —28.5 cm when d, & oo. The image distance is negative, 
because it is on the same side of the spectacle as the object. 

Solution 

Since d; and d, are known, the power of the spectacle lens can be found 
using P = = =F + as written earlier: 


i 


Equation: 
p= 1 n 1 7 il n 1 
do di co —0.285m™ 
Since 1/oo= 0, we obtain: 
Equation: 
P=0-3.51/m = —3.51 D. 
Discussion 


The negative power indicates a diverging (or concave) lens, as expected. 
The spectacle produces a case 3 image closer to the eye, where the person 
can see it. If you examine eyeglasses for nearsighted people, you will find 
the lenses are thinnest in the center. Additionally, if you examine a 
prescription for eyeglasses for nearsighted people, you will find that the 
prescribed power is negative and given in units of diopters. 


Since the farsighted eye under converges light rays, the correction for 
farsightedness is to place a converging spectacle lens in front of the eye. 
This increases the power of an eye that is too weak. Another way of 
thinking about this is that a converging spectacle lens produces a case 2 
image, which is farther from the eye than the object (see [link]). To 
determine the spectacle power needed for correction, you must know the 
person’s near point—that is, you must know the smallest distance at which 
the person can see clearly. Then the image produced by a spectacle lens 
must be at this distance or farther for the farsighted person to be able to see 
it clearly. 


Image 


Correction of farsightedness 
uses a converging lens that 
compensates for the under 

convergence by the eye. The 

converging lens produces an 
image farther from the eye than 
the object, so that the farsighted 
person can see it clearly. 


Example: 

Correcting Farsightedness 

What power of spectacle lens is needed to allow a farsighted person, whose 
near point is 1.00 m, to see an object clearly that is 25.0 cm away? Assume 
the spectacle (corrective) lens is held 1.50 cm away from the eye by 
eyeglass frames. 

Strategy 

When an object is held 25.0 cm from the person’s eyes, the spectacle lens 
must produce an image 1.00 m away (the near point). An image 1.00 m 


from the eye will be 98.5 cm to the left of the spectacle lens because the 
spectacle lens is 1.50 cm from the eye (see [link]). Therefore, 

d, = —98.5 cm. The image distance is negative, because it is on the same 
side of the spectacle as the object. The object is 23.5 cm to the left of the 
spectacle, so that d, = 23.5 cm. 

Solution 

Since d; and d, are known, the power of the spectacle lens can be found 


using P = aa 4p +: 


Equation: 
1 1 1 1 
1 fo (= (phan ean 
= 4.26D — 1.02D =3.24 D. 
Discussion 


The positive power indicates a converging (convex) lens, as expected. The 
convex spectacle produces a case 2 image farther from the eye, where the 
person can see it. If you examine eyeglasses of farsighted people, you will 
find the lenses to be thickest in the center. In addition, a prescription of 
eyeglasses for farsighted people has a prescribed power that is positive. 


Another common vision defect is astigmatism, an unevenness or 
asymmetry in the focus of the eye. For example, rays passing through a 
vertical region of the eye may focus closer than rays passing through a 
horizontal region, resulting in the image appearing elongated. This is 
mostly due to irregularities in the shape of the cornea but can also be due to 
lens irregularities or unevenness in the retina. Because of these 
irregularities, different parts of the lens system produce images at different 
locations. The eye-brain system can compensate for some of these 
irregularities, but they generally manifest themselves as less distinct vision 
or sharper images along certain axes. [link] shows a chart used to detect 
astigmatism. Astigmatism can be at least partially corrected with a spectacle 
having the opposite irregularity of the eye. If an eyeglass prescription has a 
cylindrical correction, it is there to correct astigmatism. The normal 
corrections for short- or farsightedness are spherical corrections, uniform 
along all axes. 


WL 
ike 


This chart 
can detect 
astigmatism, 
unevenness 
in the focus 
of the eye. 
Check each 
of your eyes 
separately by 
looking at the 
center cross 
(without 
spectacles if 
you wear 
them). If 
lines along 
some axes 
appear darker 
or clearer 
than others, 
you have an 
astigmatism. 
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Contact lenses have advantages over glasses beyond their cosmetic aspects. 
One problem with glasses is that as the eye moves, it is not at a fixed 
distance from the spectacle lens. Contacts rest on and move with the eye, 
eliminating this problem. Because contacts cover a significant portion of the 


cornea, they provide superior peripheral vision compared with eyeglasses. 
Contacts also correct some comeal astigmatism caused by surface 
irregularities. The tear layer between the smooth contact and the cornea fills 
in the irregularities. Since the index of refraction of the tear layer and the 
comea are very similar, you now have a regular optical surface in place of 
an irregular one. If the curvature of a contact lens is not the same as the 
comea (as may be necessary with some individuals to obtain a comfortable 
fit), the tear layer between the contact and comea acts as a lens. If the tear 
layer is thinner in the center than at the edges, it has a negative power, for 
example. Skilled optometrists will adjust the power of the contact to 
compensate. 


Laser vision correction has progressed rapidly in the last few years. It is 
the latest and by far the most successful in a series of procedures that 
correct vision by reshaping the cornea. As noted at the beginning of this 
section, the cornea accounts for about two-thirds of the power of the eye. 
Thus, small adjustments of its curvature have the same effect as putting a 
lens in front of the eye. To a reasonable approximation, the power of 
multiple lenses placed close together equals the sum of their powers. For 
example, a concave spectacle lens (for nearsightedness) having 

P = —3.00 D has the same effect on vision as reducing the power of the 
eye itself by 3.00 D. So to correct the eye for nearsightedness, the cornea is 
flattened to reduce its power. Similarly, to correct for farsightedness, the 
curvature of the cormmea is enhanced to increase the power of the eye—the 
same effect as the positive power spectacle lens used for farsightedness. 
Laser vision correction uses high intensity electromagnetic radiation to 
ablate (to remove material from the surface) and reshape the corneal 
surfaces. 


Today, the most commonly used laser vision correction procedure is Laser 
in situ Keratomileusis (LASIK). The top layer of the cornea is surgically 
peeled back and the underlying tissue ablated by multiple bursts of finely 
controlled ultraviolet radiation produced by an excimer laser. Lasers are 
used because they not only produce well-focused intense light, but they also 
emit very pure wavelength electromagnetic radiation that can be controlled 
more accurately than mixed wavelength light. The 193 nm wavelength UV 
commonly used is extremely and strongly absorbed by corneal tissue, 


allowing precise evaporation of very thin layers. A computer controlled 
program applies more bursts, usually at a rate of 10 per second, to the areas 
that require deeper removal. Typically a spot less than 1 mm in diameter 
and about 0.3 pm in thickness is removed by each burst. Nearsightedness, 
farsightedness, and astigmatism can be corrected with an accuracy that 
produces normal distant vision in more than 90% of the patients, in many 
cases right away. The corneal flap is replaced; healing takes place rapidly 
and is nearly painless. More than 1 million Americans per year undergo 
LASIK (see [link]). 


Laser vision 


correction is 
being 
performed 
using the 
LASIK 
procedure. 
Reshaping of 
the cornea by 
laser ablation is 
based ona 
careful 
assessment of 
the patient’s 
vision and is 
computer 
controlled. The 


upper corneal 
layer is 
temporarily 
peeled back 
and minimally 
disturbed in 
LASIK, 
providing for 
more rapid and 
less painful 
healing of the 
less sensitive 
tissues below. 
(credit: U.S. 
Navy photo by 
Mass 
Communicatio 
n Specialist 1st 
Class Brien 
Aho) 


Section Summary 


e Nearsightedness, or myopia, is the inability to see distant objects and is 
corrected with a diverging lens to reduce power. 

e Farsightedness, or hyperopia, is the inability to see close objects and is 
corrected with a converging lens to increase power. 

e In myopia and hyperopia, the corrective lenses produce images at a 
distance that the person can see clearly—the far point and near point, 
respectively. 


Conceptual Questions 


Exercise: 


Problem: 


It has become common to replace the cataract-clouded lens of the eye 
with an internal lens. This intraocular lens can be chosen so that the 
person has perfect distant vision. Will the person be able to read 
without glasses? If the person was nearsighted, is the power of the 
intraocular lens greater or less than the removed lens? 


Exercise: 
Problem: 
If the cornea is to be reshaped (this can be done surgically or with 


contact lenses) to correct myopia, should its curvature be made greater 
or smaller? Explain. Also explain how hyperopia can be corrected. 


Exercise: 
Problem: 
If there is a fixed percent uncertainty in LASIK reshaping of the 
cornea, why would you expect those people with the greatest 


correction to have a poorer chance of normal distant vision after the 
procedure? 


Exercise: 


Problem: 

A person with presbyopia has lost some or all of the ability to 
accommodate the power of the eye. If such a person’s distant vision is 
corrected with LASIK, will she still need reading glasses? Explain. 


Problem Exercises 


Exercise: 


Problem: 


What is the far point of a person whose eyes have a relaxed power of 
50,5:192 


Solution: 


2.00 m 

Exercise: 
Problem: 
What is the near point of a person whose eyes have an accommodated 
power of 53.5 D? 

Exercise: 
Problem: 
(a) A laser vision correction reshaping the cornea of a myopic patient 
reduces the power of his eye by 9.00 D, with a +5.0% uncertainty in 
the final correction. What is the range of diopters for spectacle lenses 
that this person might need after LASIK procedure? (b) Was the 


person nearsighted or farsighted before the procedure? How do you 
know? 


Solution: 
(a) +0.45 D 
(b) The person was nearsighted because the patient was myopic and 
the power was reduced. 
Exercise: 
Problem: 
In a LASIK vision correction, the power of a patient’s eye is increased 


by 3.00 D. Assuming this produces normal close vision, what was the 
patient’s near point before the procedure? 


Exercise: 
Problem: 
What was the previous far point of a patient who had laser vision 


correction that reduced the power of her eye by 7.00 D, producing 
normal distant vision for her? 


Solution: 


0.143 m 
Exercise: 
Problem: 
A severely myopic patient has a far point of 5.00 cm. By how many 


diopters should the power of his eye be reduced in laser vision 
correction to obtain normal distant vision for him? 


Exercise: 
Problem: 


A student’s eyes, while reading the blackboard, have a power of 51.0 
D. How far is the board from his eyes? 


Solution: 


1.00 m 
Exercise: 
Problem: 
The power of a physician’s eyes is 53.0 D while examining a patient. 
How far from her eyes is the feature being examined? 


Exercise: 


Problem: 


A young woman with normal distant vision has a 10.0% ability to 
accommodate (that is, increase) the power of her eyes. What is the 
closest object she can see clearly? 


Solution: 


20.0 cm 
Exercise: 
Problem: 
The far point of a myopic administrator is 50.0 cm. (a) What is the 


relaxed power of his eyes? (b) If he has the normal 8.00% ability to 
accommodate, what is the closest object he can see clearly? 


Exercise: 
Problem: 


A very myopic man has a far point of 20.0 cm. What power contact 
lens (when on the eye) will correct his distant vision? 


Solution: 


—9.00 D 
Exercise: 
Problem: 
Repeat the previous problem for eyeglasses held 1.50 cm from the 
eyes. 
Exercise: 
Problem: 


A myopic person sees that her contact lens prescription is —4.00 D. 
What is her far point? 


Solution: 


25.0 cm 
Exercise: 
Problem: 
Repeat the previous problem for glasses that are 1.75 cm from the 
eyes. 
Exercise: 
Problem: 
The contact lens prescription for a mildly farsighted person is 0.750 D, 
and the person has a near point of 29.0 cm. What is the power of the 


tear layer between the cornea and the lens if the correction is ideal, 
taking the tear layer into account? 


Solution: 


—0.198 D 
Exercise: 
Problem: 
A nearsighted man cannot see objects clearly beyond 20 cm from his 


eyes. How close must he stand to a mirror in order to see what he is 
doing when he shaves? 


Exercise: 


Problem: 


A mother sees that her child’s contact lens prescription is 0.750 D. 
What is the child’s near point? 


Solution: 


30.8 cm 


Exercise: 
Problem: 
Repeat the previous problem for glasses that are 2.20 cm from the 
eyes. 

Exercise: 
Problem: 
The contact lens prescription for a nearsighted person is —4.00 D and 
the person has a far point of 22.5 cm. What is the power of the tear 


layer between the cornea and the lens if the correction is ideal, taking 
the tear layer into account? 


Solution: 


—0.444 D 


Exercise: 


Problem: Unreasonable Results 


A boy has a near point of 50 cm and a far point of 500 cm. Will a 
—4.00 D lens correct his far point to infinity? 


Glossary 


nearsightedness 
another term for myopia, a visual defect in which distant objects 
appear blurred because their images are focused in front of the retina 
rather than being focused on the retina 


myopia 
a visual defect in which distant objects appear blurred because their 
images are focused in front of the retina rather than being focused on 
the retina 


far point 
the object point imaged by the eye onto the retina in an 
unaccommodated eye 


farsightedness 
another term for hyperopia, the condition of an eye where incoming 
rays of light reach the retina before they converge into a focused image 


hyperopia 
the condition of an eye where incoming rays of light reach the retina 
before they converge into a focused image 


near point 
the point nearest the eye at which an object is accurately focused on 
the retina at full accommodation 


astigmatism 
the result of an inability of the cornea to properly focus an image onto 
the retina 


laser vision correction 
a medical procedure used to correct astigmatism and eyesight 
deficiencies such as myopia and hyperopia 


Color and Color Vision 


e Explain the simple theory of color vision. 
¢ Outline the coloring properties of light sources. 
e Describe the retinex theory of color vision. 


The gift of vision is made richer by the existence of color. Objects and 
lights abound with thousands of hues that stimulate our eyes, brains, and 
emotions. Two basic questions are addressed in this brief treatment—what 
does color mean in scientific terms, and how do we, as humans, perceive it? 


Simple Theory of Color Vision 


We have already noted that color is associated with the wavelength of 
visible electromagnetic radiation. When our eyes receive pure-wavelength 
light, we tend to see only a few colors. Six of these (most often listed) are 
red, orange, yellow, green, blue, and violet. These are the rainbow of colors 
produced when white light is dispersed according to different wavelengths. 
There are thousands of other hues that we can perceive. These include 
brown, teal, gold, pink, and white. One simple theory of color vision 
implies that all these hues are our eye’s response to different combinations 
of wavelengths. This is true to an extent, but we find that color perception is 
even subtler than our eye’s response for various wavelengths of light. 


The two major types of light-sensing cells (photoreceptors) in the retina are 
rods and cones. Rods are more sensitive than cones by a factor of about 
1000 and are solely responsible for peripheral vision as well as vision in 
very dark environments. They are also important for motion detection. 
There are about 120 million rods in the human retina. Rods do not yield 
color information. You may notice that you lose color vision when it is very 
dark, but you retain the ability to discern grey scales. 


Note: 
Take-Home Experiment: Rods and Cones 


1. Go into a darkened room from a brightly lit room, or from outside in 
the Sun. How long did it take to start seeing shapes more clearly? 
What about color? Return to the bright room. Did it take a few 
minutes before you could see things clearly? 

2. Demonstrate the sensitivity of foveal vision. Look at the letter G in 
the word ROGERS. What about the clarity of the letters on either side 
of G? 


Cones are most concentrated in the fovea, the central region of the retina. 
There are no rods here. The fovea is at the center of the macula, a5 mm 
diameter region responsible for our central vision. The cones work best in 
bright light and are responsible for high resolution vision. There are about 6 
million cones in the human retina. There are three types of cones, and each 
type is sensitive to different ranges of wavelengths, as illustrated in [link]. 
A simplified theory of color vision is that there are three primary colors 
corresponding to the three types of cones. The thousands of other hues that 
we can distinguish among are created by various combinations of 
stimulations of the three types of cones. Color television uses a three-color 
system in which the screen is covered with equal numbers of red, green, and 
blue phosphor dots. The broad range of hues a viewer sees is produced by 
various combinations of these three colors. For example, you will perceive 
yellow when red and green are illuminated with the correct ratio of 
intensities. White may be sensed when all three are illuminated. Then, it 
would seem that all hues can be produced by adding three primary colors in 
various proportions. But there is an indication that color vision is more 
sophisticated. There is no unique set of three primary colors. Another set 
that works is yellow, green, and blue. A further indication of the need for a 
more complex theory of color vision is that various different combinations 
can produce the same hue. Yellow can be sensed with yellow light, or with 
a combination of red and green, and also with white light from which violet 
has been removed. The three-primary-colors aspect of color vision is well 
established; more sophisticated theories expand on it rather than deny it. 
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The image shows the 
relative sensitivity of 
the three types of 
cones, which are 
named according to 
wavelengths of 
greatest sensitivity. 
Rods are about 1000 
times more sensitive, 
and their curve peaks 
at about 500 nm. 
Evidence for the three 
types of cones comes 
from direct 
measurements in 
animal and human 
eyes and testing of 
color blind people. 


Consider why various objects display color—that is, why are feathers blue 
and red in a crimson rosella? The true color of an object is defined by its 
absorptive or reflective characteristics. [link] shows white light falling on 
three different objects, one pure blue, one pure red, and one black, as well 
as pure red light falling on a white object. Other hues are created by more 


complex absorption characteristics. Pink, for example on a galah cockatoo, 
can be due to weak absorption of all colors except red. An object can appear 
a different color under non-white illumination. For example, a pure blue 
object illuminated with pure red light will appear black, because it absorbs 
all the red light falling on it. But, the true color of the object is blue, which 
is independent of illumination. 
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Absorption characteristics determine 
the true color of an object. Here, three 
objects are illuminated by white light, 
and one by pure red light. White is the 

equal mixture of all visible 
wavelengths; black is the absence of 
light. 


Similarly, light sources have colors that are defined by the wavelengths 
they produce. A helium-neon laser emits pure red light. In fact, the phrase 
“pure red light” is defined by having a sharp constrained spectrum, a 
characteristic of laser light. The Sun produces a broad yellowish spectrum, 
fluorescent lights emit bluish-white light, and incandescent lights emit 
reddish-white hues as seen in [link]. As you would expect, you sense these 
colors when viewing the light source directly or when illuminating a white 
object with them. All of this fits neatly into the simplified theory that a 
combination of wavelengths produces various hues. 


Note: 

Take-Home Experiment: Exploring Color Addition 

This activity is best done with plastic sheets of different colors as they 
allow more light to pass through to our eyes. However, thin sheets of paper 
and fabric can also be used. Overlay different colors of the material and 
hold them up to a white light. Using the theory described above, explain 
the colors you observe. You could also try mixing different crayon colors. 
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Emission spectra for various light 
sources are shown. Curve A is 
average sunlight at Earth’s surface, 
curve B is light from a fluorescent 
lamp, and curve C is the output of an 
incandescent light. The spike for a 
helium-neon laser (curve D) is due to 
its pure wavelength emission. The 
spikes in the fluorescent output are 
due to atomic spectra—a topic that 
will be explored later. 


Color Constancy and a Modified Theory of Color Vision 


The eye-brain color-sensing system can, by comparing various objects in its 
view, perceive the true color of an object under varying lighting conditions 
—an ability that is called color constancy. We can sense that a white 
tablecloth, for example, is white whether it is illuminated by sunlight, 
fluorescent light, or candlelight. The wavelengths entering the eye are quite 
different in each case, as the graphs in [link] imply, but our color vision can 
detect the true color by comparing the tablecloth with its surroundings. 


Theories that take color constancy into account are based on a large body of 
anatomical evidence as well as perceptual studies. There are nerve 
connections among the light receptors on the retina, and there are far fewer 
nerve connections to the brain than there are rods and cones. This means 
that there is signal processing in the eye before information is sent to the 
brain. For example, the eye makes comparisons between adjacent light 
receptors and is very sensitive to edges as seen in [link]. Rather than 
responding simply to the light entering the eye, which is uniform in the 
various rectangles in this figure, the eye responds to the edges and senses 
false darkness variations. 
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Although the 
grey strips are 
uniformly 
shaded, as 
indicated by the 
graph 
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they do not 
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at all. Instead, 
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perceived darker 
on the dark side 
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the edge, as 
shown in the 
bottom graph. 
This is due to 
nerve impulse 
processing in 
the eye. 


One theory that takes various factors into account was advanced by Edwin 
Land (1909 — 1991), the creative founder of the Polaroid Corporation. Land 
proposed, based partly on his many elegant experiments, that the three types 
of cones are organized into systems called retinexes. Each retinex forms an 
image that is compared with the others, and the eye-brain system thus can 
compare a candle-illuminated white table cloth with its generally reddish 
surroundings and determine that it is actually white. This retinex theory of 
color vision is an example of modified theories of color vision that attempt 
to account for its subtleties. One striking experiment performed by Land 
demonstrates that some type of image comparison may produce color 


vision. Two pictures are taken of a scene on black-and-white film, one using 
a red filter, the other a blue filter. Resulting black-and-white slides are then 
projected and superimposed on a screen, producing a black-and-white 
image, as expected. Then a red filter is placed in front of the slide taken 
with a red filter, and the images are again superimposed on a screen. You 
would expect an image in various shades of pink, but instead, the image 
appears to humans in full color with all the hues of the original scene. This 
implies that color vision can be induced by comparison of the black-and- 
white and red images. Color vision is not completely understood or 
explained, and the retinex theory is not totally accepted. It is apparent that 
color vision is much subtler than what a first look might imply. 


Note: 

PhET Explorations: Color Vision 

Make a whole rainbow by mixing red, green, and blue light. Change the 
wavelength of a monochromatic beam or filter white light. View the light 
as a solid beam, or see the individual photons. 


https://phet.colorado.edu/sims/html/color-vision/latest/color- 
vision _en.html 


Section Summary 


e The eye has four types of light receptors—rods and three types of 
color-sensitive cones. 

e The rods are good for night vision, peripheral vision, and motion 
changes, while the cones are responsible for central vision and color. 

e We perceive many hues, from light having mixtures of wavelengths. 

e A simplified theory of color vision states that there are three primary 
colors, which correspond to the three types of cones, and that various 
combinations of the primary colors produce all the hues. 

e The true color of an object is related to its relative absorption of 
various wavelengths of light. The color of a light source is related to 
the wavelengths it produces. 


¢ Color constancy is the ability of the eye-brain system to discern the 
true color of an object illuminated by various light sources. 

e The retinex theory of color vision explains color constancy by 
postulating the existence of three retinexes or image systems, 
associated with the three types of cones that are compared to obtain 
sophisticated information. 


Conceptual Questions 


Exercise: 
Problem: 
A pure red object on a black background seems to disappear when 
illuminated with pure green light. Explain why. 


Exercise: 


Problem: What is color constancy, and what are its limitations? 
Exercise: 

Problem: 

There are different types of color blindness related to the malfunction 

of different types of cones. Why would it be particularly useful to 


study those rare individuals who are color blind only in one eye or who 
have a different type of color blindness in each eye? 


Exercise: 


Problem: 
Propose a way to study the function of the rods alone, given they can 
sense light about 1000 times dimmer than the cones. 

Glossary 


hues 


identity of a color as it relates specifically to the spectrum 


rods and cones 
two types of photoreceptors in the human retina; rods are responsible 
for vision at low light levels, while cones are active at higher light 
levels 


simplified theory of color vision 
a theory that states that there are three primary colors, which 
correspond to the three types of cones 


color constancy 
a part of the visual perception system that allows people to perceive 
color in a variety of conditions and to see some consistency in the 
color 


retinex 
a theory proposed to explain color and brightness perception and 
constancies; is a combination of the words retina and cortex, which are 
the two areas responsible for the processing of visual information 


retinex theory of color vision 
the ability to perceive color in an ambient-colored environment 


Microscopes 


e Investigate different types of microscopes. 
¢ Learn how image is formed in a compound microscope. 


Although the eye is marvelous in its ability to see objects large and small, it 
obviously has limitations to the smallest details it can detect. Human desire 
to see beyond what is possible with the naked eye led to the use of optical 
instruments. In this section we will examine microscopes, instruments for 
enlarging the detail that we cannot see with the unaided eye. The 
microscope is a multiple-element system having more than a single lens or 
mirror. (See [link]) A microscope can be made from two convex lenses. The 
image formed by the first element becomes the object for the second 
element. The second element forms its own image, which is the object for 
the third element, and so on. Ray tracing helps to visualize the image 
formed. If the device is composed of thin lenses and mirrors that obey the 
thin lens equations, then it is not difficult to describe their behavior 
numerically. 


Multiple lenses and 
mirrors are used in this 
microscope. (credit: U.S. 
Navy photo by Tom 
Watanabe) 


Microscopes were first developed in the early 1600s by eyeglass makers in 
The Netherlands and Denmark. The simplest compound microscope is 


constructed from two convex lenses as shown schematically in [link]. The 
first lens is called the objective lens, and has typical magnification values 
from 5x to 100. In standard microscopes, the objectives are mounted 
such that when you switch between objectives, the sample remains in focus. 
Objectives arranged in this way are described as parfocal. The second, the 
eyepiece, also referred to as the ocular, has several lenses which slide inside 
a cylindrical barrel. The focusing ability is provided by the movement of 
both the objective lens and the eyepiece. The purpose of a microscope is to 
magnify small objects, and both lenses contribute to the final magnification. 
Additionally, the final enlarged image is produced in a location far enough 
from the observer to be easily viewed, since the eye cannot focus on objects 
or images that are too close. 


Eyepiece 


A compound microscope composed of two lenses, an 
objective and an eyepiece. The objective forms a case 1 
image that is larger than the object. This first image is 
the object for the eyepiece. The eyepiece forms a case 2 
final image that is further magnified. 


To see how the microscope in [link] forms an image, we consider its two 
lenses in succession. The object is slightly farther away from the objective 
lens than its focal length f,, producing a case 1 image that is larger than the 


object. This first image is the object for the second lens, or eyepiece. The 
eyepiece is intentionally located so it can further magnify the image. The 
eyepiece is placed so that the first image is closer to it than its focal length 
f.. Thus the eyepiece acts as a magnifying glass, and the final image is 
made even larger. The final image remains inverted, but it is farther from 
the observer, making it easy to view (the eye is most relaxed when viewing 
distant objects and normally cannot focus closer than 25 cm). Since each 
lens produces a magnification that multiplies the height of the image, it is 
apparent that the overall magnification m is the product of the individual 
magnifications: 

Equation: 


M = MMe, 


where m, is the magnification of the objective and m, is the magnification 
of the eyepiece. This equation can be generalized for any combination of 
thin lenses and mirrors that obey the thin lens equations. 


Note: 

Overall Magnification 

The overall magnification of a multiple-element system is the product of 
the individual magnifications of its elements. 


Example: 

Microscope Magnification 

Calculate the magnification of an object placed 6.20 mm from a compound 
microscope that has a 6.00 mm focal length objective and a 50.0 mm focal 
length eyepiece. The objective and eyepiece are separated by 23.0 cm. 
Strategy and Concept 

This situation is similar to that shown in [link]. To find the overall 
magnification, we must find the magnification of the objective, then the 
magnification of the eyepiece. This involves using the thin lens equation. 
Solution 


The magnification of the objective lens is given as 
Equation: 


where d, and d; are the object and image distances, respectively, for the 
objective lens as labeled in [link]. The object distance is given to be 

d, = 6.20 mm, but the image distance d; is not known. Isolating d;, we 
have 

Equation: 


where f, is the focal length of the objective lens. Substituting known 
values gives 
Equation: 

a 1 _ 1 _ 0.00538 

d; 600mm 620mm mm — 


We invert this to find d;: 
Equation: 


d; = 186 mm. 


Substituting this into the expression for m, gives 
Equation: 


SS = 
do 6.20 mm 


Now we must find the magnification of the eyepiece, which is given by 
Equation: 


where d;/ and d,/ are the image and object distances for the eyepiece (see 
[link]). The object distance is the distance of the first image from the 
eyepiece. Since the first image is 186 mm to the right of the objective and 
the eyepiece is 230 mm to the right of the objective, the object distance is 
d,/= 230 mm — 186 mm = 44.0 mm. This places the first image closer 
to the eyepiece than its focal length, so that the eyepiece will form a case 2 
image as shown in the figure. We still need to find the location of the final 
image d;/ in order to find the magnification. This is done as before to 
obtain a value for 1/dj/: 


Equation: 
ee ee 1 _ 1 _ 0.00273 
dit fe dt 500mm 440mm — mm 
Inverting gives 
Equation: 
mm 
SS = af aa, 
0.00273 
The eyepiece’s magnification is thus 
Equation: 
d;! —367 
iy Soe 2 8. 
d,! 44.0 mm 
So the overall magnification is 
Equation: 
M = MomMe = (—30.0)(8.33) = —250. 
Discussion 


Both the objective and the eyepiece contribute to the overall magnification, 
which is large and negative, consistent with [link], where the image is seen 
to be large and inverted. In this case, the image is virtual and inverted, 
which cannot happen for a single element (case 2 and case 3 images for 
single elements are virtual and upright). The final image is 367 mm (0.367 
m) to the left of the eyepiece. Had the eyepiece been placed farther from 


the objective, it could have formed a case 1 image to the right. Such an 
image could be projected on a screen, but it would be behind the head of 
the person in the figure and not appropriate for direct viewing. The 
procedure used to solve this example is applicable in any multiple-element 
system. Each element is treated in turn, with each forming an image that 
becomes the object for the next element. The process is not more difficult 
than for single lenses or mirrors, only lengthier. 


Normal optical microscopes can magnify up to 1500 with a theoretical 
resolution of —0.2 um. The lenses can be quite complicated and are 
composed of multiple elements to reduce aberrations. Microscope objective 
lenses are particularly important as they primarily gather light from the 
specimen. Three parameters describe microscope objectives: the numerical 
aperture (NA), the magnification (m), and the working distance. The NA 
is related to the light gathering ability of a lens and is obtained using the 
angle of acceptance 6 formed by the maximum cone of rays focusing on the 
specimen (see [link](a)) and is given by 

Equation: 


NA = nsina, 


where 7 is the refractive index of the medium between the lens and the 
specimen and a = 6/2. As the angle of acceptance given by 0 increases, 
NA becomes larger and more light is gathered from a smaller focal region 
giving higher resolution. A 0.75NA objective gives more detail than a 
0.10. A objective. 


(a) (b) 


(a) The numerical aperture (NA) of a 
microscope objective lens refers to the light- 
gathering ability of the lens and is calculated 

using half the angle of acceptance 0. (b) 
Here, a is half the acceptance angle for light 
rays from a specimen entering a camera 
lens, and D is the diameter of the aperture 
that controls the light entering the lens. 


While the numerical aperture can be used to compare resolutions of various 
objectives, it does not indicate how far the lens could be from the specimen. 
This is specified by the “working distance,” which is the distance (in mm 
usually) from the front lens element of the objective to the specimen, or 
cover glass. The higher the NA the closer the lens will be to the specimen 
and the more chances there are of breaking the cover slip and damaging 
both the specimen and the lens. The focal length of an objective lens is 
different than the working distance. This is because objective lenses are 
made of a combination of lenses and the focal length is measured from 
inside the barrel. The working distance is a parameter that microscopists 
can use more readily as it is measured from the outermost lens. The 
working distance decreases as the NA and magnification both increase. 


The term f/# in general is called the f-number and is used to denote the 
light per unit area reaching the image plane. In photography, an image of an 
object at infinity is formed at the focal point and the f-number is given by 
the ratio of the focal length f of the lens and the diameter D of the aperture 
controlling the light into the lens (see [link](b)). If the acceptance angle is 
small the NA of the lens can also be used as given below. 

Equation: 


ne 
Itt = 5 © ON” 


As the f-number decreases, the camera is able to gather light from a larger 
angle, giving wide-angle photography. As usual there is a trade-off. A 
greater f /# means less light reaches the image plane. A setting of f/16 
usually allows one to take pictures in bright sunlight as the aperture 
diameter is small. In optical fibers, light needs to be focused into the fiber. 
[link] shows the angle used in calculating the NA of an optical fiber. 
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Light rays enter an optical fiber. The numerical aperture 
of the optical fiber can be determined by using the angle 


Omax : 


Can the NA be larger than 1.00? The answer is ‘yes’ if we use immersion 
lenses in which a medium such as oil, glycerine or water is placed between 
the objective and the microscope cover slip. This minimizes the mismatch 
in refractive indices as light rays go through different media, generally 
providing a greater light-gathering ability and an increase in resolution. 
[link] shows light rays when using air and immersion lenses. 


objective 


(c) 


Light rays from a 
specimen entering the 
objective. Paths for 
immersion medium of 
air (a), water (b) 
(n = 1.33), and oil 
(c) (n = 1.51) are 
shown. The water and 
oil immersions allow 
more rays to enter the 
objective, increasing 
the resolution. 


When using a microscope we do not see the entire extent of the sample. 
Depending on the eyepiece and objective lens we see a restricted region 
which we say is the field of view. The objective is then manipulated in two- 
dimensions above the sample to view other regions of the sample. 
Electronic scanning of either the objective or the sample is used in scanning 
microscopy. The image formed at each point during the scanning is 
combined using a computer to generate an image of a larger region of the 
sample at a selected magnification. 


When using a microscope, we rely on gathering light to form an image. 
Hence most specimens need to be illuminated, particularly at higher 
magnifications, when observing details that are so small that they reflect 
only small amounts of light. To make such objects easily visible, the 
intensity of light falling on them needs to be increased. Special illuminating 


systems called condensers are used for this purpose. The type of condenser 
that is suitable for an application depends on how the specimen is 
examined, whether by transmission, scattering or reflecting. See [link] for 
an example of each. White light sources are common and lasers are often 
used. Laser light illumination tends to be quite intense and it is important to 
ensure that the light does not result in the degradation of the specimen. 
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Illumination of a specimen in a microscope. (a) 
Transmitted light from a condenser lens. (b) 
Transmitted light from a mirror condenser. (c) Dark 
field illumination by scattering (the illuminating beam 
misses the objective lens). (d) High magnification 
illumination with reflected light — normally laser 
light. 


We normally associate microscopes with visible light but x ray and electron 
microscopes provide greater resolution. The focusing and basic physics is 
the same as that just described, even though the lenses require different 
technology. The electron microscope requires vacuum chambers so that the 
electrons can proceed unheeded. Magnifications of 50 million times provide 
the ability to determine positions of individual atoms within materials. An 
electron microscope is shown in [link]. We do not use our eyes to form 
images; rather images are recorded electronically and displayed on 
computers. In fact observing and saving images formed by optical 
microscopes on computers is now done routinely. Video recordings of what 
occurs in a microscope can be made for viewing by many people at later 
dates. Physics provides the science and tools needed to generate the 
sequence of time-lapse images of meiosis similar to the sequence sketched 
in [link]. 


An electron microscope 
has the capability to 
image individual atoms 
on a material. The 


microscope uses vacuum 
technology, sophisticated 
detectors and state of the 
art image processing 
software. (credit: Dave 
Pape) 


The image shows a sequence of 
events that takes place during 
meiosis. (credit: PatriciaR, 
Wikimedia Commons; National 
Center for Biotechnology 
Information) 


Note: 

Take-Home Experiment: Make a Lens 

Look through a clear glass or plastic bottle and describe what you see. 
Now fill the bottle with water and describe what you see. Use the water 
bottle as a lens to produce the image of a bright object and estimate the 
focal length of the water bottle lens. How is the focal length a function of 
the depth of water in the bottle? 


Section Summary 


e The microscope is a multiple-element system having more than a 
single lens or mirror. 

e Many optical devices contain more than a single lens or mirror. These 
are analysed by considering each element sequentially. The image 
formed by the first is the object for the second, and so on. The same 
ray tracing and thin lens techniques apply to each lens element. 


e The overall magnification of a multiple-element system is the product 
of the magnifications of its individual elements. For a two-element 
system with an objective and an eyepiece, this is 
Equation: 


Mm = MoMe; 


where mM, is the magnification of the objective and m, is the 
magnification of the eyepiece, such as for a microscope. 

e Microscopes are instruments for allowing us to see detail we would not 
be able to see with the unaided eye and consist of a range of 
components. 

e The eyepiece and objective contribute to the magnification. The 
numerical aperture (NA) of an objective is given by 
Equation: 


NA = nsina 


where n is the refractive index and a the angle of acceptance. 

e Immersion techniques are often used to improve the light gathering 
ability of microscopes. The specimen is illuminated by transmitted, 
scattered or reflected light though a condenser. 

e The f /# describes the light gathering ability of a lens. It is given by 
Equation: 


Conceptual Questions 


Exercise: 
Problem: 
Geometric optics describes the interaction of light with macroscopic 


objects. Why, then, is it correct to use geometric optics to analyse a 
microscope’s image? 


Exercise: 
Problem: 
The image produced by the microscope in [link] cannot be projected. 
Could extra lenses or mirrors project it? Explain. 
Exercise: 
Problem: 
Why not have the objective of a microscope form a case 2 image with 


a large magnification? (Hint: Consider the location of that image and 
the difficulty that would pose for using the eyepiece as a magnifier.) 


Exercise: 


Problem: What advantages do oil immersion objectives offer? 
Exercise: 
Problem: 


How does the NA of a microscope compare with the NA of an optical 
fiber? 


Problem Exercises 


Exercise: 
Problem: 
A microscope with an overall magnification of 800 has an objective 
that magnifies by 200. (a) What is the magnification of the eyepiece? 
(b) If there are two other objectives that can be used, having 


magnifications of 100 and 400, what other total magnifications are 
possible? 


Solution: 


(a) 4.00 


(b) 1600 
Exercise: 
Problem: 
(a) What magnification is produced by a 0.150 cm focal length 
microscope objective that is 0.155 cm from the object being viewed? 


(b) What is the overall magnification if an 8x eyepiece (one that 
produces a magnification of 8.00) is used? 


Exercise: 
Problem: 
(a) Where does an object need to be placed relative to a microscope for 
its 0.500 cm focal length objective to produce a magnification of —400 


? (b) Where should the 5.00 cm focal length eyepiece be placed to 
produce a further fourfold (4.00) magnification? 


Solution: 
(a) 0.501 cm 


(b) Eyepiece should be 204 cm behind the objective lens. 
Exercise: 
Problem: 
You switch from a 1.40NA 60 oil immersion objective to a 
1.40N A 60 oil immersion objective. What are the acceptance angles 


for each? Compare and comment on the values. Which would you use 
first to locate the target area on your specimen? 


Exercise: 


Problem: 


An amoeba is 0.305 cm away from the 0.300 cm focal length objective 
lens of a microscope. (a) Where is the image formed by the objective 
lens? (b) What is this image’s magnification? (c) An eyepiece with a 
2.00 cm focal length is placed 20.0 cm from the objective. Where is 
the final image? (d) What magnification is produced by the eyepiece? 
(e) What is the overall magnification? (See [link].) 


Solution: 

(a) +18.3 cm (on the eyepiece side of the objective lens) 
(b) -60.0 

(c) -11.3 cm (on the objective side of the eyepiece) 

(d) +6.67 


(e) -400 
Exercise: 
Problem: 
You are using a standard microscope with a 0.10 NA 4x objective and 
switch to a0.65N A 40x objective. What are the acceptance angles 


for each? Compare and comment on the values. Which would you use 
first to locate the target area on of your specimen? (See [link].) 


Exercise: 


Problem: Unreasonable Results 


Your friends show you an image through a microscope. They tell you 
that the microscope has an objective with a 0.500 cm focal length and 
an eyepiece with a 5.00 cm focal length. The resulting overall 
magnification is 250,000. Are these viable values for a microscope? 


Glossary 


compound microscope 
a microscope constructed from two convex lenses, the first serving as 
the ocular lens(close to the eye) and the second serving as the 
objective lens 


objective lens 
the lens nearest to the object being examined 


eyepiece 
the lens or combination of lenses in an optical instrument nearest to the 
eye of the observer 


numerical aperture 
a number or measure that expresses the ability of a lens to resolve fine 
detail in an object being observed. Derived by mathematical formula 
Equation: 


NA = nsina, 


where n is the refractive index of the medium between the lens and the 
specimen and a = 6/2 


Telescopes 


e Outline the invention of a telescope. 
e Describe the working of a telescope. 


Telescopes are meant for viewing distant objects, producing an image that is 
larger than the image that can be seen with the unaided eye. Telescopes 
gather far more light than the eye, allowing dim objects to be observed with 
greater magnification and better resolution. Although Galileo is often 
credited with inventing the telescope, he actually did not. What he did was 
more important. He constructed several early telescopes, was the first to 
study the heavens with them, and made monumental discoveries using 
them. Among these are the moons of Jupiter, the craters and mountains on 
the Moon, the details of sunspots, and the fact that the Milky Way is 
composed of vast numbers of individual stars. 


[link](a) shows a telescope made of two lenses, the convex objective and 
the concave eyepiece, the same construction used by Galileo. Such an 
arrangement produces an upright image and is used in spyglasses and opera 
glasses. 


Incoming 
parallel rays 


Final image Objective Eyepiece 


(a) 


raat 
Object 


very 
distant 


~~" Eyepiece 


Final image 
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(a) Galileo made telescopes with a convex 
objective and a concave eyepiece. These produce 
an upright image and are used in spyglasses. (b) 
Most simple telescopes have two convex lenses. 
The objective forms a case 1 image that is the 
object for the eyepiece. The eyepiece forms a case 
2 final image that is magnified. 


The most common two-lens telescope, like the simple microscope, uses two 
convex lenses and is shown in [link](b). The object is so far away from the 
telescope that it is essentially at infinity compared with the focal lengths of 
the lenses (dy & oo). The first image is thus produced at d; = fy, as shown 
in the figure. To prove this, note that 

Equation: 


1 | 


df 


1 
dy fo 
Because 1/co = 0, this simplifies to 
Equation: 


which implies that d; = f,, as claimed. It is true that for any distant object 
and any lens or mirror, the image is at the focal length. 


The first image formed by a telescope objective as seen in [link](b) will not 
be large compared with what you might see by looking at the object 
directly. For example, the spot formed by sunlight focused on a piece of 
paper by a magnifying glass is the image of the Sun, and it is small. The 
telescope eyepiece (like the microscope eyepiece) magnifies this first 
image. The distance between the eyepiece and the objective lens is made 
slightly less than the sum of their focal lengths so that the first image is 
closer to the eyepiece than its focal length. That is, d,/ is less than f., and 
so the eyepiece forms a case 2 image that is large and to the left for easy 
viewing. If the angle subtended by an object as viewed by the unaided eye 
is 8, and the angle subtended by the telescope image is 0/, then the angular 
magnification 1 is defined to be their ratio. That is, M = 67/6. It can be 
shown that the angular magnification of a telescope is related to the focal 
lengths of the objective and eyepiece; and is given by 

Equation: 


The minus sign indicates the image is inverted. To obtain the greatest 
angular magnification, it is best to have a long focal length objective and a 
short focal length eyepiece. The greater the angular magnification /, the 
larger an object will appear when viewed through a telescope, making more 


details visible. Limits to observable details are imposed by many factors, 
including lens quality and atmospheric disturbance. 


The image in most telescopes is inverted, which is unimportant for 
observing the stars but a real problem for other applications, such as 
telescopes on ships or telescopic gun sights. If an upright image is needed, 
Galileo’s arrangement in [link](a) can be used. But a more common 
arrangement is to use a third convex lens as an eyepiece, increasing the 
distance between the first two and inverting the image once again as seen in 
[link]. 


Second | | 


image 
\u/ image Pe 


Objective Erecting Eyepiece 
lens 


This arrangement of three lenses in a telescope produces an 
upright final image. The first two lenses are far enough apart 
that the second lens inverts the image of the first one more 
time. The third lens acts as a magnifier and keeps the image 
upright and in a location that is easy to view. 


A telescope can also be made with a concave mirror as its first element or 
objective, since a concave mirror acts like a convex lens as seen in [link]. 
Flat mirrors are often employed in optical instruments to make them more 
compact or to send light to cameras and other sensing devices. There are 
many advantages to using mirrors rather than lenses for telescope 
objectives. Mirrors can be constructed much larger than lenses and can, 
thus, gather large amounts of light, as needed to view distant galaxies, for 
example. Large and relatively flat mirrors have very long focal lengths, so 
that great angular magnification is possible. 


Concave 
mirror 
(objective) 


Eyepiece , 
(lens) 


A two-element 
telescope 
composed of a 
mirror as the 
objective and a lens 
for the eyepiece is 
shown. This 
telescope forms an 
image in the same 
manner as the two- 
convex-lens 
telescope already 
discussed, but it 
does not suffer 
from chromatic 
aberrations. Such 
telescopes can 
gather more light, 
since larger mirrors 
than lenses can be 
constructed. 


Telescopes, like microscopes, can utilize a range of frequencies from the 
electromagnetic spectrum. [link](a) shows the Australia Telescope Compact 


Array, which uses six 22-m antennas for mapping the southern skies using 
radio waves. [link](b) shows the focusing of x rays on the Chandra X-ray 
Observatory—a satellite orbiting earth since 1999 and looking at high 
temperature events as exploding stars, quasars, and black holes. X rays, 
with much more energy and shorter wavelengths than RF and light, are 
mainly absorbed and not reflected when incident perpendicular to the 
medium. But they can be reflected when incident at small glancing angles, 
much like a rock will skip on a lake if thrown at a small angle. The mirrors 
for the Chandra consist of a long barrelled pathway and 4 pairs of mirrors to 
focus the rays at a point 10 meters away from the entrance. The mirrors are 
extremely smooth and consist of a glass ceramic base with a thin coating of 
metal (iridium). Four pairs of precision manufactured mirrors are 
exquisitely shaped and aligned so that x rays ricochet off the mirrors like 
bullets off a wall, focusing on a spot. 


(a) The Australia 
Telescope Compact Array 
at Narrabri (500 km NW 
of Sydney). (credit: Ian 


Bailey) (b) The focusing 
of x rays on the Chandra 
Observatory, a satellite 
orbiting earth. X rays 
ricochet off 4 pairs of 
mirrors forming a 
barrelled pathway leading 
to the focus point. (credit: 
NASA) 


A current exciting development is a collaborative effort involving 17 
countries to construct a Square Kilometre Array (SKA) of telescopes 
capable of covering from 80 MHz to 2 GHz. The initial stage of the project 
is the construction of the Australian Square Kilometre Array Pathfinder in 
Western Australia (see [link]). The project will use cutting-edge 
technologies such as adaptive optics in which the lens or mirror is 
constructed from lots of carefully aligned tiny lenses and mirrors that can 
be manipulated using computers. A range of rapidly changing distortions 
can be minimized by deforming or tilting the tiny lenses and mirrors. The 
use of adaptive optics in vision correction is a current area of research. 


les == 


An artist’s impression of the 
Australian Square Kilometre 
Array Pathfinder in Western 


Australia is displayed. (credit: 
SPDO, XILOSTUDIOS) 


Section Summary 


e Simple telescopes can be made with two lenses. They are used for 
viewing objects at large distances and utilize the entire range of the 
electromagnetic spectrum. 

e The angular magnification M for a telescope is given by 
Equation: 


ee 
0 Tes 


where @ is the angle subtended by an object viewed by the unaided 
eye, 0/ is the angle subtended by a magnified image, and f, and f, are 
the focal lengths of the objective and the eyepiece. 


Conceptual Questions 


Exercise: 


Problem: 

If you want your microscope or telescope to project a real image onto a 
screen, how would you change the placement of the eyepiece relative 
to the objective? 


Problem Exercises 


Unless otherwise stated, the lens-to-retina distance is 2.00 cm. 
Exercise: 


Problem: 


What is the angular magnification of a telescope that has a 100 cm 
focal length objective and a 2.50 cm focal length eyepiece? 


Solution: 


—A40.0 
Exercise: 
Problem: 
Find the distance between the objective and eyepiece lenses in the 
telescope in the above problem needed to produce a final image very 


far from the observer, where vision is most relaxed. Note that a 
telescope is normally used to view very distant objects. 


Exercise: 
Problem: 
A large reflecting telescope has an objective mirror with a 10.0 m 


radius of curvature. What angular magnification does it produce when 
a 3.00 m focal length eyepiece is used? 


Solution: 


—1.67 

Exercise: 
Problem: 
A small telescope has a concave mirror with a 2.00 m radius of 
curvature for its objective. Its eyepiece is a 4.00 cm focal length lens. 
(a) What is the telescope’s angular magnification? (b) What angle is 


subtended by a 25,000 km diameter sunspot? (c) What is the angle of 
its telescopic image? 


Exercise: 


Problem: 


A 7.5x binocular produces an angular magnification of —’7.50, acting 
like a telescope. (Mirrors are used to make the image upright.) If the 
binoculars have objective lenses with a 75.0 cm focal length, what is 
the focal length of the eyepiece lenses? 


Solution: 


+10.0 cm 


Exercise: 


Problem: Construct Your Own Problem 


Consider a telescope of the type used by Galileo, having a convex 
objective and a concave eyepiece as illustrated in [link ](a). Construct a 
problem in which you calculate the location and size of the image 
produced. Among the things to be considered are the focal lengths of 
the lenses and their relative placements as well as the size and location 
of the object. Verify that the angular magnification is greater than one. 
That is, the angle subtended at the eye by the image is greater than the 
angle subtended by the object. 


Glossary 


adaptive optics 
optical technology in which computers adjust the lenses and mirrors in 
a device to correct for image distortions 


angular magnification 
a ratio related to the focal lengths of the objective and eyepiece and 


given as M = o4 


Aberrations 
e Describe optical aberration. 


Real lenses behave somewhat differently from how they are modeled using 
the thin lens equations, producing aberrations. An aberration is a distortion 
in an image. There are a variety of aberrations due to a lens size, material, 
thickness, and position of the object. One common type of aberration is 
chromatic aberration, which is related to color. Since the index of refraction 
of lenses depends on color or wavelength, images are produced at different 
places and with different magnifications for different colors. (The law of 
reflection is independent of wavelength, and so mirrors do not have this 
problem. This is another advantage for mirrors in optical systems such as 
telescopes.) [link](a) shows chromatic aberration for a single convex lens 
and its partial correction with a two-lens system. Violet rays are bent more 
than red, since they have a higher index of refraction and are thus focused 
closer to the lens. The diverging lens partially corrects this, although it is 
usually not possible to do so completely. Lenses of different materials and 
having different dispersions may be used. For example an achromatic 
doublet consisting of a converging lens made of crown glass and a 
diverging lens made of flint glass in contact can dramatically reduce 
chromatic aberration (see [link](b)). 


Quite often in an imaging system the object is off-center. Consequently, 
different parts of a lens or mirror do not refract or reflect the image to the 
same point. This type of aberration is called a coma and is shown in [Link]. 
The image in this case often appears pear-shaped. Another common 
aberration is spherical aberration where rays converging from the outer 
edges of a lens converge to a focus closer to the lens and rays closer to the 
axis focus further (see [link]). Aberrations due to astigmatism in the lenses 
of the eyes are discussed in Vision Correction, and a chart used to detect 
astigmatism is shown in [link]. Such aberrations and can also be an issue 
with manufactured lenses. 


(a) Chromatic aberration is 
caused by the dependence of a 
lens’s index of refraction on 
color (wavelength). The lens is 
more powerful for violet (V) 
than for red (R), producing 
images with different locations 
and magnifications. (b) 
Multiple-lens systems can 
partially correct chromatic 
aberrations, but they may 
require lenses of different 
materials and add to the 
expense of optical systems such 
as cameras. 


A coma is an 
aberration caused by 
an object that is off- 

center, often resulting 
in a pear-shaped 
image. The rays 
originate from points 
that are not on the 
optical axis and they 
do not converge at one 
common focal point. 


— 


Spherical aberration is 
caused by rays 
focusing at different 
distances from the 
lens. 


The image produced by an optical system needs to be bright enough to be 
discerned. It is often a challenge to obtain a sufficiently bright image. The 
brightness is determined by the amount of light passing through the optical 
system. The optical components determining the brightness are the diameter 
of the lens and the diameter of pupils, diaphragms or aperture stops placed 


in front of lenses. Optical systems often have entrance and exit pupils to 
specifically reduce aberrations but they inevitably reduce brightness as 
well. Consequently, optical systems need to strike a balance between the 
various components used. The iris in the eye dilates and constricts, acting as 
an entrance pupil. You can see objects more clearly by looking through a 
small hole made with your hand in the shape of a fist. Squinting, or using a 
small hole in a piece of paper, also will make the object sharper. 


So how are aberrations corrected? The lenses may also have specially 
shaped surfaces, as opposed to the simple spherical shape that is relatively 
easy to produce. Expensive camera lenses are large in diameter, so that they 
can gather more light, and need several elements to correct for various 
aberrations. Further, advances in materials science have resulted in lenses 
with a range of refractive indices—technically referred to as graded index 
(GRIN) lenses. Spectacles often have the ability to provide a range of 
focusing ability using similar techniques. GRIN lenses are particularly 
important at the end of optical fibers in endoscopes. Advanced computing 
techniques allow for a range of corrections on images after the image has 
been collected and certain characteristics of the optical system are known. 
Some of these techniques are sophisticated versions of what are available 
on commercial packages like Adobe Photoshop. 


Section Summary 


e Aberrations or image distortions can arise due to the finite thickness of 
optical instruments, imperfections in the optical components, and 
limitations on the ways in which the components are used. 

e The means for correcting aberrations range from better components to 
computational techniques. 


Conceptual Questions 


Exercise: 


Problem: 


List the various types of aberrations. What causes them and how can 
each be reduced? 


Problem Exercises 


Exercise: 


Problem: Integrated Concepts 


(a) During laser vision correction, a brief burst of 193 nm ultraviolet 
light is projected onto the cornea of the patient. It makes a spot 1.00 
mm in diameter and deposits 0.500 mJ of energy. Calculate the depth 
of the layer ablated, assuming the corneal tissue has the same 
properties as water and is initially at ° . The tissue’s temperature 
is increased to ° and evaporated without further temperature 
increase. 


(b) Does your answer imply that the shape of the comea can be finely 
controlled? 


Solution: 


(a) UW 


(b) Yes, this thickness implies that the shape of the cornea can be very 
finely controlled, producing normal distant vision in more than 90% of 
patients. 


Glossary 
aberration 


failure of rays to converge at one focus because of limitations or 
defects in a lens or mirror 


Introduction to Wave Optics 
class="introduction" 


The colors 
reflected 
by this 
compact 
disc vary 
with angle 
and are 
not caused 
by 
pigments. 
Colors 
such as 
these are 
direct 
evidence 
of the 
wave 
character 
of light. 
(credit: 
Infopro, 
Wikimedi 
a 
Commons 


) 


Examine a compact disc under white light, noting the colors observed and 
locations of the colors. Determine if the spectra are formed by diffraction 
from circular lines centered at the middle of the disc and, if so, what is their 
spacing. If not, determine the type of spacing. Also with the CD, explore 
the spectra of a few light sources, such as a candle flame, incandescent 
bulb, halogen light, and fluorescent light. Knowing the spacing of the rows 
of pits in the compact disc, estimate the maximum spacing that will allow 
the given number of megabytes of information to be stored. 


If you have ever looked at the reds, blues, and greens in a sunlit soap bubble 
and wondered how straw-colored soapy water could produce them, you 
have hit upon one of the many phenomena that can only be explained by the 
wave character of light (see [link]). The same is true for the colors seen in 
an oil slick or in the light reflected from a compact disc. These and other 
interesting phenomena, such as the dispersion of white light into a rainbow 
of colors when passed through a narrow slit, cannot be explained fully by 
geometric optics. In these cases, light interacts with small objects and 
exhibits its wave characteristics. The branch of optics that considers the 


behavior of light when it exhibits wave characteristics (particularly when it 
interacts with small objects) is called wave optics (sometimes called 
physical optics). It is the topic of this chapter. 


These soap bubbles exhibit 
brilliant colors when exposed to 
sunlight. How are the colors 
produced if they are not 
pigments in the soap? (credit: 
Scott Robinson, Flickr) 


The Wave Aspect of Light: Interference 


e Discuss the wave character of light. 
e Identify the changes when light enters a medium. 


We know that visible light is the type of electromagnetic wave to which our 
eyes respond. Like all other electromagnetic waves, it obeys the equation 
Equation: 


c fA 


where c is the speed of light in vacuum, f is the frequency 
of the electromagnetic waves, and J is its wavelength. The range of visible 
wavelengths is approximately 380 to 760 nm. As is true for all waves, light 
travels in straight lines and acts like a ray when it interacts with objects 
several times as large as its wavelength. However, when it interacts with 
smaller objects, it displays its wave characteristics prominently. Interference 
is the hallmark of a wave, and in [link] both the ray and wave 
characteristics of light can be seen. The laser beam emitted by the 
observatory epitomizes a ray, traveling in a straight line. However, passing 
a pure-wavelength beam through vertical slits with a size close to the 
wavelength of the beam reveals the wave character of light, as the beam 
spreads out horizontally into a pattern of bright and dark regions caused by 
systematic constructive and destructive interference. Rather than spreading 
out, a ray would continue traveling straight ahead after passing through 
Slits. 


Note: 

Making Connections: Waves 

The most certain indication of a wave is interference. This wave 
characteristic is most prominent when the wave interacts with an object 
that is not large compared with the wavelength. Interference is observed 
for water waves, sound waves, light waves, and (as we will see in Special 
Relativity) for matter waves, such as electrons scattered from a crystal. 


(a) The laser beam 
emitted by an observatory 
acts like a ray, traveling 
in a Straight line. This 
laser beam is from the 
Paranal Observatory of 
the European Southern 
Observatory. (credit: Yuri 
Beletsky, European 
Southern Observatory) 
(b) A laser beam passing 
through a grid of vertical 
slits produces an 
interference pattern— 
characteristic of a wave. 
(credit: Shim'on and 
Slava Rybka, Wikimedia 
Commons) 


Light has wave characteristics in various media as well as in a vacuum. 
When light goes from a vacuum to some medium, like water, its speed and 


wavelength change, but its frequency f remains the same. (We can think of 
light as a forced oscillation that must have the frequency of the original 
source.) The speed of light ina mediumisv cc n, where n is its index of 
refraction. If we divide both sides of equationc fA by n, we get 

cn vu fX n. This implies thatv fA, where A is the wavelength 
in a medium and that 

Equation: 


a 
; ee 
n 


where A is the wavelength in vacuum and n is the medium’s index of 
refraction. Therefore, the wavelength of light is smaller in any medium than 


it is in vacuum. In water, for example, which has n , the range of 
visible wavelengths is to , Or 
» . Although wavelengths change while traveling from 


one medium to another, colors do not, since colors are associated with 
frequency. 


Section Summary 


e Wave optics is the branch of optics that must be used when light 
interacts with small objects or whenever the wave characteristics of 
light are considered. 

e Wave characteristics are those associated with interference and 
diffraction. 

e Visible light is the type of electromagnetic wave to which our eyes 
respond and has a wavelength in the range of 380 to 760 nm. 

e Like all EM waves, the following relationship is valid in vacuum: 


c fA, where c is the speed of light, f is the 
frequency of the electromagnetic wave, and J is its wavelength in 
vacuum. 


e The wavelength A of light in a medium with index of refraction 7 is 
A An. Its frequency is the same as in vacuum. 


Conceptual Questions 


Exercise: 


Problem: 


What type of experimental evidence indicates that light is a wave? 
Exercise: 
Problem: 


Give an example of a wave characteristic of light that is easily 
observed outside the laboratory. 


Problems & Exercises 


Exercise: 
Problem: 


Show that when light passes from air to water, its wavelength 
decreases to 0.750 times its original value. 


Solution: 


Exercise: 


Problem: 


Find the range of visible wavelengths of light in crown glass. 
Exercise: 

Problem: 

What is the index of refraction of a material for which the wavelength 


of light is 0.671 times its value in a vacuum? Identify the likely 
substance. 


Solution: 


1.49, Polystyrene 
Exercise: 
Problem: 
Analysis of an interference effect in a clear solid shows that the 
wavelength of light in the solid is 329 nm. Knowing this light comes 


from a He-Ne laser and has a wavelength of 633 nm in air, is the 
substance zircon or diamond? 


Exercise: 


Problem: 


What is the ratio of thicknesses of crown glass and water that would 
contain the same number of wavelengths of light? 


Solution: 


0.877 glass to water 


Glossary 


wavelength in a medium 
Xr » n, where A is the wavelength in vacuum, and n is the index of 
refraction of the medium 


Huygens's Principle: Diffraction 


e Discuss the propagation of transverse waves. 
e Discuss Huygens’s principle. 
e Explain the bending of light. 


[link] shows how a transverse wave looks as viewed from above and from 
the side. A light wave can be imagined to propagate like this, although we 
do not actually see it wiggling through space. From above, we view the 
wavefronts (or wave crests) as we would by looking down on the ocean 
waves. The side view would be a graph of the electric or magnetic field. 
The view from above is perhaps the most useful in developing concepts 
about wave optics. 


View from above View from side 


Overall view 


A transverse wave, such as an 
electromagnetic wave like light, 
as viewed from above and from 

the side. The direction of 
propagation is perpendicular to 
the wavefronts (or wave crests) 
and is represented by an arrow 
like a ray. 


The Dutch scientist Christiaan Huygens (1629-1695) developed a useful 
technique for determining in detail how and where waves propagate. 


Starting from some known position, Huygens’s principle states that: 


Every point on a wavefront is a source of wavelets that spread out in 
the forward direction at the same speed as the wave itself. The new 
wavefront is a line tangent to all of the wavelets. 


[link] shows how Huygens’s principle is applied. A wavefront is the long 
edge that moves, for example, the crest or the trough. Each point on the 
wavefront emits a semicircular wave that moves at the propagation speed v. 
These are drawn at a time ¢ later, so that they have moved a distance s = vt 
. The new wavefront is a line tangent to the wavelets and is where we 
would expect the wave to be a time ¢ later. Huygens’s principle works for 
all types of waves, including water waves, sound waves, and light waves. 
We will find it useful not only in describing how light waves propagate, but 
also in explaining the laws of reflection and refraction. In addition, we will 
see that Huygens’s principle tells us how and where light rays interfere. 


New wavefront 


Old wavefront 


Huygens’s 
principle 
applied to a 
straight 
wavefront. 
Each point 
on the 
wavefront 


emits a 
semicircular 
wavelet that 

moves a 

distance 

s =vt. The 
new 
wavefront is 
a line tangent 
to the 
wavelets. 


[link] shows how a mirror reflects an incoming wave at an angle equal to 
the incident angle, verifying the law of reflection. As the wavefront strikes 
the mirror, wavelets are first emitted from the left part of the mirror and 
then the right. The wavelets closer to the left have had time to travel farther, 
producing a wavefront traveling in the direction shown. 


Huygens’s principle 
applied to a straight 
wavefront striking a 
mirror. The wavelets 
shown were emitted as 
each point on the 
wavefront struck the 
mirror. The tangent to 
these wavelets shows 


that the new wavefront 
has been reflected at 
an angle equal to the 
incident angle. The 
direction of 
propagation is 
perpendicular to the 
wavefront, as shown 
by the downward- 
pointing arrows. 


The law of refraction can be explained by applying Huygens’s principle to a 
wavefront passing from one medium to another (see [link]). Each wavelet 
in the figure was emitted when the wavefront crossed the interface between 
the media. Since the speed of light is smaller in the second medium, the 
waves do not travel as far in a given time, and the new wavefront changes 
direction as shown. This explains why a ray changes direction to become 
closer to the perpendicular when light slows down. Snell’s law can be 
derived from the geometry in [link], but this is left as an exercise for 


ambitious readers. 
6, 4 


Surface Medium 1 


Medium 2 


Huygens’s principle 
applied to a straight 
wavefront traveling from 
one medium to another 
where its speed is less. 
The ray bends toward the 
perpendicular, since the 


wavelets have a lower 
speed in the second 
medium. 


What happens when a wave passes through an opening, such as light 
shining through an open door into a dark room? For light, we expect to see 
a sharp shadow of the doorway on the floor of the room, and we expect no 
light to bend around corners into other parts of the room. When sound 
passes through a door, we expect to hear it everywhere in the room and, 
thus, expect that sound spreads out when passing through such an opening 
(see [link]). What is the difference between the behavior of sound waves 
and light waves in this case? The answer is that light has very short 
wavelengths and acts like a ray. Sound has wavelengths on the order of the 
size of the door and bends around corners (for frequency of 1000 Hz, 

A = c/f = (330 m/s)/(1000 s-!) = 0.33 m, about three times smaller 
than the width of the doorway). 


Straight- Sound 
edge 
shadows 


Plane 
wavefront 


of sound S& @ 


Listener hears sound 
around the corner 


Wall with doorway Same wall and doorway 


(a) (b) 


(a) Light passing through a doorway 
makes a sharp outline on the floor. Since 
light’s wavelength is very small 
compared with the size of the door, it acts 
like a ray. (b) Sound waves bend into all 
parts of the room, a wave effect, because 
their wavelength is similar to the size of 
the door. 


If we pass light through smaller openings, often called slits, we can use 

Huygens’s principle to see that light bends as sound does (see [link]). The 
bending of a wave around the edges of an opening or an obstacle is called 
diffraction. Diffraction is a wave characteristic and occurs for all types of 
waves. If diffraction is observed for some phenomenon, it is evidence that 
the phenomenon is a wave. Thus the horizontal diffraction of the laser beam 
after it passes through slits in [link] is evidence that light is a wave. 


he 


Huygens’s principle 
applied to a straight 
wavefront striking an 
opening. The edges of the 
wavefront bend after 
passing through the 
opening, a process called 
diffraction. The amount 
of bending is more 
extreme for a small 
opening, consistent with 
the fact that wave 
characteristics are most 
noticeable for interactions 
with objects about the 
Same size as the 
wavelength. 


Section Summary 


e An accurate technique for determining how and where waves 
propagate is given by Huygens’s principle: Every point on a wavefront 
is a source of wavelets that spread out in the forward direction at the 
same speed as the wave itself. The new wavefront is a line tangent to 
all of the wavelets. 

e Diffraction is the bending of a wave around the edges of an opening or 
other obstacle. 


Conceptual Questions 


Exercise: 
Problem: 
How do wave effects depend on the size of the object with which the 


wave interacts? For example, why does sound bend around the corner 
of a building while light does not? 


Exercise: 


Problem: 


Under what conditions can light be modeled like a ray? Like a wave? 
Exercise: 

Problem: 

Go outside in the sunlight and observe your shadow. It has fuzzy edges 

even if you do not. Is this a diffraction effect? Explain. 
Exercise: 

Problem: 

Why does the wavelength of light decrease when it passes from 


vacuum into a medium? State which attributes change and which stay 
the same and, thus, require the wavelength to decrease. 


Exercise: 


Problem: Does Huygens’s principle apply to all types of waves? 


Glossary 


diffraction 
the bending of a wave around the edges of an opening or an obstacle 


Huygens’s principle 
every point on a wavefront is a source of wavelets that spread out in 
the forward direction at the same speed as the wave itself. The new 
wavefront is a line tangent to all of the wavelets 


Young’s Double Slit Experiment 


e Explain the phenomena of interference. 
e Define constructive interference for a double slit and destructive 
interference for a double slit. 


Although Christiaan Huygens thought that light was a wave, Isaac Newton 
did not. Newton felt that there were other explanations for color, and for the 
interference and diffraction effects that were observable at the time. Owing 
to Newton’s tremendous stature, his view generally prevailed. The fact that 
Huygens’s principle worked was not considered evidence that was direct 
enough to prove that light is a wave. The acceptance of the wave character 
of light came many years later when, in 1801, the English physicist and 
physician Thomas Young (1773-1829) did his now-classic double slit 
experiment (see [link]). 

> 


a 


Young’s double slit 
experiment. Here 
pure-wavelength 

light sent through a 

pair of vertical slits 

is diffracted into a 

pattern on the 
screen of numerous 
vertical lines spread 
out horizontally. 

Without diffraction 
and interference, 

the light would 


simply make two 
lines on the screen. 


Why do we not ordinarily observe wave behavior for light, such as 
observed in Young’s double slit experiment? First, light must interact with 
something small, such as the closely spaced slits used by Young, to show 
pronounced wave effects. Furthermore, Young first passed light from a 
single source (the Sun) through a single slit to make the light somewhat 
coherent. By coherent, we mean waves are in phase or have a definite 
phase relationship. Incoherent means the waves have random phase 
relationships. Why did Young then pass the light through a double slit? The 
answer to this question is that two slits provide two coherent light sources 
that then interfere constructively or destructively. Young used sunlight, 
where each wavelength forms its own pattern, making the effect more 
difficult to see. We illustrate the double slit experiment with monochromatic 
(single A) light to clarify the effect. [link] shows the pure constructive and 
destructive interference of two waves having the same wavelength and 
amplitude. 


Wave 1 


Wave 2 


Resultant 


(b) 


The amplitudes of waves 
add. (a) Pure constructive 
interference is obtained 
when identical waves are in 
phase. (b) Pure destructive 
interference occurs when 
identical waves are exactly 
out of phase, or shifted by 
half a wavelength. 


When light passes through narrow slits, it is diffracted into semicircular 
waves, as shown in [link ](a). Pure constructive interference occurs where 
the waves are crest to crest or trough to trough. Pure destructive 
interference occurs where they are crest to trough. The light must fall on a 
screen and be scattered into our eyes for us to see the pattern. An analogous 
pattern for water waves is shown in [link](b). Note that regions of 
constructive and destructive interference move out from the slits at well- 
defined angles to the original beam. These angles depend on wavelength 
and the distance between the slits, as we shall see below. 


Screen 


(a) (b) (c) 


Double slits produce two coherent sources of waves that 
interfere. (a) Light spreads out (diffracts) from each slit, 
because the slits are narrow. These waves overlap and 


interfere constructively (bright lines) and destructively 
(dark regions). We can only see this if the light falls onto 
a screen and is scattered into our eyes. (b) Double slit 
interference pattern for water waves are nearly identical 
to that for light. Wave action is greatest in regions of 
constructive interference and least in regions of 
destructive interference. (c) When light that has passed 
through double slits falls on a screen, we see a pattern 
such as this. (credit: PASCO) 


To understand the double slit interference pattern, we consider how two 
waves travel from the slits to the screen, as illustrated in [link]. Each slit is a 
different distance from a given point on the screen. Thus different numbers 
of wavelengths fit into each path. Waves start out from the slits in phase 
(crest to crest), but they may end up out of phase (crest to trough) at the 
screen if the paths differ in length by half a wavelength, interfering 
destructively as shown in [link](a). If the paths differ by a whole 
wavelength, then the waves arrive in phase (crest to crest) at the screen, 
interfering constructively as shown in [link](b). More generally, if the paths 
taken by the two waves differ by any half-integral number of wavelengths [ 
(1/2)A, (3/2)A, (5/2)A, etc.], then destructive interference occurs. 
Similarly, if the paths taken by the two waves differ by any integral number 
of wavelengths (A, 2A, 3A, etc.), then constructive interference occurs. 


Note: 

Take-Home Experiment: Using Fingers as Slits 

Look at a light, such as a street lamp or incandescent bulb, through the 
narrow gap between two fingers held close together. What type of pattern 
do you see? How does it change when you allow the fingers to move a 
little farther apart? Is it more distinct for a monochromatic source, such as 
the yellow light from a sodium vapor lamp, than for an incandescent bulb? 


Dark Bright 
(destructive (constructive 
interference) interference) 


Waves follow different paths 
from the slits to a common 
point on a screen. (a) 
Destructive interference occurs 
here, because one path is a half 
wavelength longer than the 
other. The waves start in phase 
but arrive out of phase. (b) 
Constructive interference occurs 
here because one path is a 
whole wavelength longer than 
the other. The waves start out 
and arrive in phase. 


[link] shows how to determine the path length difference for waves 
traveling from two slits to a common point on a screen. If the screen is a 
large distance away compared with the distance between the slits, then the 
angle @ between the path and a line from the slits to the screen (see the 
figure) is nearly the same for each path. The difference between the paths is 
shown in the figure; simple trigonometry shows it to be d sin 0, where d is 
the distance between the slits. To obtain constructive interference for a 
double slit, the path length difference must be an integral multiple of the 
wavelength, or 

Equation: 


d sin@ = mi, form =0,1, —1,2, — 2, ... (constructive). 


Similarly, to obtain destructive interference for a double slit, the path 
length difference must be a half-integral multiple of the wavelength, or 
Equation: 


1 
d sin@ = G + 3) A, form =0,1, —1,2, —2,... (destructive), 


where J is the wavelength of the light, d is the distance between slits, and 0 
is the angle from the original direction of the beam as discussed above. We 
call m the order of the interference. For example, m = 4 is fourth-order 
interference. 


A¢=dsin@ 


Screen 


The paths from each 
slit to a common point 
on the screen differ by 

an amount d sin 0, 
assuming the distance 
to the screen is much 

greater than the 
distance between slits 

(not to scale here). 


The equations for double slit interference imply that a series of bright and 
dark lines are formed. For vertical slits, the light spreads out horizontally on 


either side of the incident beam into a pattern called interference fringes, 
illustrated in [link]. The intensity of the bright fringes falls off on either 
side, being brightest at the center. The closer the slits are, the more is the 
spreading of the bright fringes. We can see this by examining the equation 
Equation: 


d sin? = mA, form =0,1, —1,2, —2,.... 


For fixed A and m, the smaller d is, the larger 0 must be, since 

sin 9 = m2/ d. This is consistent with our contention that wave effects are 
most noticeable when the object the wave encounters (here, slits a distance 
d apart) is small. Small d gives large 0, hence a large effect. 


2 ->| 


The interference pattern for a double 
slit has an intensity that falls off with 
angle. The photograph shows multiple 
bright and dark lines, or fringes, 
formed by light passing through a 
double slit. 


Example: 
Finding a Wavelength from an Interference Pattern 


Suppose you pass light from a He-Ne laser through two slits separated by 
0.0100 mm and find that the third bright line on a screen is formed at an 
angle of 10.95° relative to the incident beam. What is the wavelength of 
the light? 

Strategy 

The third bright line is due to third-order constructive interference, which 
means that m = 3. We are given d = 0.0100 mm and 9 = 10.95°. The 
wavelength can thus be found using the equation d sin 8 = mA for 
constructive interference. 

Solution 

The equation isd sin 8 = mA. Solving for the wavelength A gives 
Equation: 


d sin 0 
eo sin 0 
m 
Substituting known values yields 
Equation: 
ees (0.0100 seals 10.95°) 
= 6.33 x 10-4 mm = 633 nm. 

Discussion 


To three digits, this is the wavelength of light emitted by the common He- 
Ne laser. Not by coincidence, this red color is similar to that emitted by 
neon lights. More important, however, is the fact that interference patterns 
can be used to measure wavelength. Young did this for visible 
wavelengths. This analytical technique is still widely used to measure 
electromagnetic spectra. For a given order, the angle for constructive 
interference increases with A, so that spectra (measurements of intensity 
versus wavelength) can be obtained. 


Example: 
Calculating Highest Order Possible 


Interference patterns do not have an infinite number of lines, since there is 
a limit to how big m can be. What is the highest-order constructive 
interference possible with the system described in the preceding example? 
Strategy and Concept 

The equation d sin 9 = md (form =0,1, — 1, 2, — 2, ...) describes 
constructive interference. For fixed values of d and 4, the larger m is, the 
larger sin 0 is. However, the maximum value that sin @ can have is 1, for 
an angle of 90°. (Larger angles imply that light goes backward and does 
not reach the screen at all.) Let us find which m corresponds to this 
maximum diffraction angle. 


Solution 
Solving the equation d sin 8 = mA for m gives 
Equation: 
_ dsin@é 
esa 


Taking sin 8 = 1 and substituting the values of d and A from the preceding 
example gives 


Equation: 
0.0100 1 
ee UD eee a 
633 nm 
Therefore, the largest integer m can be is 15, or 
Equation: 
TiS: 
Discussion 


The number of fringes depends on the wavelength and slit separation. The 
number of fringes will be very large for large slit separations. However, if 
the slit separation becomes much greater than the wavelength, the intensity 
of the interference pattern changes so that the screen has two bright lines 
cast by the slits, as expected when light behaves like a ray. We also note 
that the fringes get fainter further away from the center. Consequently, not 
all 15 fringes may be observable. 


Section Summary 


e Young’s double slit experiment gave definitive proof of the wave 
character of light. 

e An interference pattern is obtained by the superposition of light from 
two slits. 

e There is constructive interference when 
d sin 9 = md (form = 0,1, —1,2, — 2, ...), where d is the 
distance between the slits, 0 is the angle relative to the incident 
direction, and m is the order of the interference. 

e There is destructive interference when 
d sin@ = (m+ 4)A (for m=0,1, —1,2, — 2, ...). 


Conceptual Questions 


Exercise: 
Problem: 
Young’s double slit experiment breaks a single light beam into two 


sources. Would the same pattern be obtained for two independent 
sources of light, such as the headlights of a distant car? Explain. 


Exercise: 
Problem: 
Suppose you use the same double slit to perform Young’s double slit 
experiment in air and then repeat the experiment in water. Do the 


angles to the same parts of the interference pattern get larger or 
smaller? Does the color of the light change? Explain. 


Exercise: 
Problem: 
Is it possible to create a situation in which there is only destructive 
interference? Explain. 


Exercise: 


Problem: 


[link] shows the central part of the interference pattern for a pure 
wavelength of red light projected onto a double slit. The pattern is 
actually a combination of single slit and double slit interference. Note 
that the bright spots are evenly spaced. Is this a double slit or single slit 
characteristic? Note that some of the bright spots are dim on either side 
of the center. Is this a single slit or double slit characteristic? Which is 
smaller, the slit width or the separation between slits? Explain your 


responses. 


This double slit interference 
pattern also shows signs of 
single slit interference. (credit: 
PASCO) 


Problems & Exercises 


Exercise: 


Problem: 


At what angle is the first-order maximum for 450-nm wavelength blue 
light falling on double slits separated by 0.0500 mm? 


Solution: 


0.516° 


Exercise: 


Problem: 


Calculate the angle for the third-order maximum of 580-nm 
wavelength yellow light falling on double slits separated by 0.100 mm. 


Exercise: 
Problem: 


What is the separation between two slits for which 610-nm orange 
light has its first maximum at an angle of 30.0°? 


Solution: 


1.22 x 10°m 
Exercise: 
Problem: 
Find the distance between two slits that produces the first minimum for 
410-nm violet light at an angle of 45.0°. 
Exercise: 
Problem: 
Calculate the wavelength of light that has its third minimum at an 
angle of 30.0° when falling on double slits separated by 3.00 pm. 


Explicitly, show how you follow the steps in Problem-Solving 
Strategies for Wave Optics. 


Solution: 


600 nm 
Exercise: 
Problem: 


What is the wavelength of light falling on double slits separated by 
2.00 ym if the third-order maximum is at an angle of 60.0°? 


Exercise: 


Problem: 
At what angle is the fourth-order maximum for the situation in [link]? 


Solution: 


2.06° 
Exercise: 
Problem: 
What is the highest-order maximum for 400-nm light falling on double 
slits separated by 25.0 um? 
Exercise: 
Problem: 
Find the largest wavelength of light falling on double slits separated by 


1.20 um for which there is a first-order maximum. Is this in the visible 
part of the spectrum? 


Solution: 


1200 nm (not visible) 
Exercise: 
Problem: 
What is the smallest separation between two slits that will produce a 
second-order maximum for 720-nm red light? 
Exercise: 
Problem: 


(a) What is the smallest separation between two slits that will produce 
a second-order maximum for any visible light? (b) For all visible light? 


Solution: 
(a) 760 nm 


(b) 1520 nm 
Exercise: 


Problem: 


(a) If the first-order maximum for pure-wavelength light falling on a 
double slit is at an angle of 10.0°, at what angle is the second-order 
maximum? (b) What is the angle of the first minimum? (c) What is the 
highest-order maximum possible here? 


Exercise: 


Problem: 


[link] shows a double slit located a distance x from a screen, with the 
distance from the center of the screen given by y. When the distance d 
between the slits is relatively large, there will be numerous bright 
spots, called fringes. Show that, for small angles (where sin 6 = 0, 
with @ in radians), the distance between fringes is given by 

Ay aad. 


The distance between 
adjacent fringes is 
Ay = xX/d, assuming the 


slit separation d is large 
compared with J. 


Solution: 


For small angles sin 9 — tan 0 ~ 6 (in radians). 


For two adjacent fringes we have, 
Equation: 


d sin 06, = mA 
and 
Equation: 


d sin 0n41 = (mM+1)A 


Subtracting these equations gives 


Equation: 
d(sin 0441 — sin 0) = [((m+1) —m]r 
d(6n4+1—Om) = A 
tan 0, = me Om => d( 4 — 4m) =A 
d=} > Ay= 
Exercise: 
Problem: 


Using the result of the problem above, calculate the distance between 
fringes for 633-nm light falling on double slits separated by 0.0800 
mm, located 3.00 m from a screen as in [link]. 


Exercise: 


Problem: 


Using the result of the problem two problems prior, find the 
wavelength of light that produces fringes 7.50 mm apart on a screen 
2.00 m from double slits separated by 0.120 mm (see [link]). 


Solution: 


450 nm 


Glossary 


coherent 
waves are in phase or have a definite phase relationship 


constructive interference for a double slit 
the path length difference must be an integral multiple of the 
wavelength 


destructive interference for a double slit 
the path length difference must be a half-integral multiple of the 
wavelength 


incoherent 
waves have random phase relationships 


order 
the integer m used in the equations for constructive and destructive 
interference for a double slit 


Multiple Slit Diffraction 


e Discuss the pattern obtained from diffraction grating. 
e Explain diffraction grating effects. 


An interesting thing happens if you pass light through a large number of 
evenly spaced parallel slits, called a diffraction grating. An interference 
pattern is created that is very similar to the one formed by a double slit (see 
[link]). A diffraction grating can be manufactured by scratching glass with a 
sharp tool in a number of precisely positioned parallel lines, with the 
untouched regions acting like slits. These can be photographically mass 
produced rather cheaply. Diffraction gratings work both for transmission of 
light, as in [link], and for reflection of light, as on butterfly wings and the 
Australian opal in [link] or the CD pictured in the opening photograph of 
this chapter, [link]. In addition to their use as novelty items, diffraction 
gratings are commonly used for spectroscopic dispersion and analysis of 
light. What makes them particularly useful is the fact that they form a 
sharper pattern than double slits do. That is, their bright regions are 
narrower and brighter, while their dark regions are darker. [link] shows 
idealized graphs demonstrating the sharper pattern. Natural diffraction 
gratings occur in the feathers of certain birds. Tiny, finger-like structures in 
regular patterns act as reflection gratings, producing constructive 
interference that gives the feathers colors not solely due to their 
pigmentation. This is called iridescence. 


Second-order 
rainbow 


First-order 
rainbow 


white 
First-order 
rainbow 


Second-order 
rainbow 


(a) (b) 


= 
— 
= 
co Central 
= 
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A diffraction grating is a large 
number of evenly spaced 
parallel slits. (a) Light passing 
through is diffracted in a pattern 
similar to a double slit, with 


bright regions at various angles. 
(b) The pattern obtained for 
white light incident on a 
grating. The central maximum 
is white, and the higher-order 
maxima disperse white light 
into a rainbow of colors. 


(a) (b) 


(a) This Australian opal and (b) 

the butterfly wings have rows of 

reflectors that act like reflection 

gratings, reflecting different 
colors at different angles. 
(credits: (a) Opals-On- 
Black.com, via Flickr (b) 
whologwhy, Flickr) 


Double slit 


Grating 


0 m=1 


m=1 m 
(b) 


Idealized graphs 
of the intensity 
of light passing 
through a double 
slit (a) anda 
diffraction 
grating (b) for 
monochromatic 
light. Maxima 
can be produced 
at the same 
angles, but those 
for the 
diffraction 
grating are 
narrower and 
hence sharper. 
The maxima 
become narrower 
and the regions 
between darker 
as the number of 
slits is increased. 


The analysis of a diffraction grating is very similar to that for a double slit 
(see [link]). As we know from our discussion of double slits in Young's 
Double Slit Experiment, light is diffracted by each slit and spreads out after 
passing through. Rays traveling in the same direction (at an angle @ relative 
to the incident direction) are shown in the figure. Each of these rays travels 
a different distance to a common point on a screen far away. The rays start 
in phase, and they can be in or out of phase when they reach a screen, 
depending on the difference in the path lengths traveled. As seen in the 
figure, each ray travels a distance d sin 6 different from that of its neighbor, 
where d is the distance between slits. If this distance equals an integral 
number of wavelengths, the rays all arrive in phase, and constructive 
interference (a maximum) is obtained. Thus, the condition necessary to 
obtain constructive interference for a diffraction grating is 

Equation: 


d sin @ = mi, for m = 0, 1, -1, 2, -2,... (constructive), 


where d is the distance between slits in the grating, A is the wavelength of 
light, and m is the order of the maximum. Note that this is exactly the same 
equation as for double slits separated by d. However, the slits are usually 
closer in diffraction gratings than in double slits, producing fewer maxima 
at larger angles. 

Ae = dsin@é 


Diffraction grating 
showing light rays 
from each slit 
traveling in the 
same direction. 
Each ray travels a 
different distance to 
reach a common 
point on a screen 
(not shown). Each 
ray travels a 
distance d sin 0 
different from that 
of its neighbor. 


Where are diffraction gratings used? Diffraction gratings are key 
components of monochromators used, for example, in optical imaging of 
particular wavelengths from biological or medical samples. A diffraction 
grating can be chosen to specifically analyze a wavelength emitted by 
molecules in diseased cells in a biopsy sample or to help excite strategic 
molecules in the sample with a selected frequency of light. Another vital 
use is in optical fiber technologies where fibers are designed to provide 
optimum performance at specific wavelengths. A range of diffraction 
gratings are available for selecting specific wavelengths for such use. 


Note: 

Take-Home Experiment: Rainbows on a CD 

The spacing d of the grooves in a CD or DVD can be well determined by 
using a laser and the equation d sin 8 = mA, for m = 0, 1, —1, 2, —2,... 
. However, we can still make a good estimate of this spacing by using 
white light and the rainbow of colors that comes from the interference. 
Reflect sunlight from a CD onto a wall and use your best judgment of the 
location of a strongly diffracted color to find the separation d. 


Example: 

Calculating Typical Diffraction Grating Effects 

Diffraction gratings with 10,000 lines per centimeter are readily available. 
Suppose you have one, and you send a beam of white light through it to a 
screen 2.00 m away. (a) Find the angles for the first-order diffraction of the 
shortest and longest wavelengths of visible light (380 and 760 nm). (b) 
What is the distance between the ends of the rainbow of visible light 
produced on the screen for first-order interference? (See [link].) 


Screen 


The diffraction grating 
considered in this 
example produces a 
rainbow of colors on a 
screen a distance 
x = 2.00 m from the 
grating. The distances 
along the screen are 
measured perpendicular 
to the x-direction. In 
other words, the rainbow 
pattern extends out of 
the page. 


Strategy 
The angles can be found using the equation 
Equation: 


d sin 0 = mA (for m = 0, 1, -1, 2, -2, ...) 


once a value for the slit spacing d has been determined. Since there are 
10,000 lines per centimeter, each line is separated by 1/10,000 of a 
centimeter. Once the angles are found, the distances along the screen can 
be found using simple trigonometry. 

Solution for (a) 

The distance between slits is d = (1 cm)/10,000 = 1.00 x 10-4 cm or 
1.00 x 10° m. Let us call the two angles @y for violet (380 nm) and 0g 
for red (760 nm). Solving the equation d sin 6 = m4 for sin Oy, 
Equation: 


maAy 
d y) 


sin by = 


where m = 1 for first order and Ay = 380 nm = 3.80 x 10°/ m. 
Substituting these values gives 


Equation: 

3.80 x 107 

Sti 

1.00 x 10°-&m 
Thus the angle Oy is 
Equation: 

6y = sin * 0.380 = 22.33°. 

Similarly, 
Equation: 


Thus the angle Op is 
Equation: 


6x = sin ! 0.760 = 49.46°. 


Notice that in both equations, we reported the results of these intermediate 
calculations to four significant figures to use with the calculation in part 
(b). 

Solution for (b) 

The distances on the screen are labeled yy and yp in [link]. Noting that 
tan 0 = y/zx, we can solve for yy and yp. That is, 

Equation: 


yy = x tan Oy = (2.00 m)(tan 22.33°) = 0.815 m 


and 
Equation: 


yp = z tan 6p = (2.00 m)(tan 49.46°) = 2.338 m. 


The distance between them is therefore 
Equation: 


yR — yv = 1.52 m. 


Discussion 

The large distance between the red and violet ends of the rainbow 
produced from the white light indicates the potential this diffraction grating 
has as a spectroscopic tool. The more it can spread out the wavelengths 
(greater dispersion), the more detail can be seen in a spectrum. This 
depends on the quality of the diffraction grating—it must be very precisely 
made in addition to having closely spaced lines. 


Section Summary 


¢ A diffraction grating is a large collection of evenly spaced parallel slits 
that produces an interference pattern similar to but sharper than that of 
a double slit. 

e There is constructive interference for a diffraction grating when 
d sin @ = mA (for m = 0, 1, -1, 2, -2, ...), where d is the distance 
between slits in the grating, A is the wavelength of light, and m is the 
order of the maximum. 


Conceptual Questions 


Exercise: 
Problem: 
What is the advantage of a diffraction grating over a double slit in 
dispersing light into a spectrum? 
Exercise: 
Problem: 
What are the advantages of a diffraction grating over a prism in 
dispersing light for spectral analysis? 
Exercise: 
Problem: 
Can the lines in a diffraction grating be too close together to be useful 


as a spectroscopic tool for visible light? If so, what type of EM 
radiation would the grating be suitable for? Explain. 


Exercise: 


Problem: 


If a beam of white light passes through a diffraction grating with 
vertical lines, the light is dispersed into rainbow colors on the right and 
left. If a glass prism disperses white light to the right into a rainbow, 
how does the sequence of colors compare with that produced on the 
right by a diffraction grating? 


Exercise: 
Problem: 
Suppose pure-wavelength light falls on a diffraction grating. What 
happens to the interference pattern if the same light falls on a grating 
that has more lines per centimeter? What happens to the interference 
pattern if a longer-wavelength light falls on the same grating? Explain 


how these two effects are consistent in terms of the relationship of 
wavelength to the distance between slits. 


Exercise: 
Problem: 
Suppose a feather appears green but has no green pigment. Explain in 
terms of diffraction. 
Exercise: 
Problem: 
It is possible that there is no minimum in the interference pattern of a 


single slit. Explain why. Is the same true of double slits and diffraction 
gratings? 


Problems & Exercises 


Exercise: 


Problem: 


A diffraction grating has 2000 lines per centimeter. At what angle will 
the first-order maximum be for 520-nm-wavelength green light? 


Solution: 


Doe 


Exercise: 


Problem: 


Find the angle for the third-order maximum for 580-nm-wavelength 
yellow light falling on a diffraction grating having 1500 lines per 
centimeter. 


Exercise: 
Problem: 


How many lines per centimeter are there on a diffraction grating that 


gives a first-order maximum for 470-nm blue light at an angle of 25.0° 
ig 


Solution: 


8.99 x 10° 
Exercise: 
Problem: 
What is the distance between lines on a diffraction grating that 


produces a second-order maximum for 760-nm red light at an angle of 
60.0°? 


Exercise: 


Problem: 


Calculate the wavelength of light that has its second-order maximum 
at 45.0° when falling on a diffraction grating that has 5000 lines per 
centimeter. 


Solution: 


707 nm 


Exercise: 


Problem: 


An electric current through hydrogen gas produces several distinct 
wavelengths of visible light. What are the wavelengths of the hydrogen 
spectrum, if they form first-order maxima at angles of 24.2°, 25.7°, 
29.1°, and 41.0° when projected on a diffraction grating having 10,000 
lines per centimeter? Explicitly show how you follow the steps in 
Problem-Solving Strategies for Wave Optics 


Exercise: 
Problem: 
(a) What do the four angles in the above problem become if a 5000- 
line-per-centimeter diffraction grating is used? (b) Using this grating, 
what would the angles be for the second-order maxima? (c) Discuss 


the relationship between integral reductions in lines per centimeter and 
the new angles of various order maxima. 


Solution: 
(a) 11.8°, 12.5°, 14.1°, 19.2° 
(b) 24.2", 25.7°, 29.1°, 41.0" 


(c) Decreasing the number of lines per centimeter by a factor of x 
means that the angle for the x-order maximum is the same as the 
original angle for the first- order maximum. 


Exercise: 
Problem: 
What is the maximum number of lines per centimeter a diffraction 


grating can have and produce a complete first-order spectrum for 
visible light? 


Exercise: 


Problem: 


The yellow light from a sodium vapor lamp seems to be of pure 
wavelength, but it produces two first-order maxima at 36.093° and 
36.129° when projected on a 10,000 line per centimeter diffraction 
grating. What are the two wavelengths to an accuracy of 0.1 nm? 


Solution: 


589.1 nm and 589.6 nm 
Exercise: 
Problem: 
What is the spacing between structures in a feather that acts as a 


reflection grating, given that they produce a first-order maximum for 
525-nm light at a 30.0° angle? 


Exercise: 
Problem: 
Structures on a bird feather act like a reflection grating having 8000 


lines per centimeter. What is the angle of the first-order maximum for 
600-nm light? 


Solution: 


28.7° 
Exercise: 
Problem: 
An opal such as that shown in [Link] acts like a reflection grating with 
rows separated by about 8 pm. If the opal is illuminated normally, (a) 


at what angle will red light be seen and (b) at what angle will blue light 
be seen? 


Exercise: 


Problem: 


At what angle does a diffraction grating produces a second-order 
maximum for light having a first-order maximum at 20.0°? 


Solution: 


43.2° 
Exercise: 
Problem: 
Show that a diffraction grating cannot produce a second-order 


maximum for a given wavelength of light unless the first-order 
maximum is at an angle less than 30.0°. 


Exercise: 
Problem: 
If a diffraction grating produces a first-order maximum for the shortest 


wavelength of visible light at 30.0°, at what angle will the first-order 
maximum be for the longest wavelength of visible light? 


Solution: 


90.0° 
Exercise: 
Problem: 
(a) Find the maximum number of lines per centimeter a diffraction 
grating can have and produce a maximum for the smallest wavelength 


of visible light. (b) Would such a grating be useful for ultraviolet 
spectra? (c) For infrared spectra? 


Exercise: 


Problem: 


(a) Show that a 30,000-line-per-centimeter grating will not produce a 
maximum for visible light. (b) What is the longest wavelength for 
which it does produce a first-order maximum? (c) What is the greatest 
number of lines per centimeter a diffraction grating can have and 
produce a complete second-order spectrum for visible light? 


Solution: 
(a) The longest wavelength is 333.3 nm, which is not visible. 
(b) 333 nm (UV) 


(c) 6.58 x 10° cm 
Exercise: 


Problem: 


A He-Ne laser beam is reflected from the surface of a CD onto a wall. 
The brightest spot is the reflected beam at an angle equal to the angle 
of incidence. However, fringes are also observed. If the wall is 1.50 m 
from the CD, and the first fringe is 0.600 m from the central 
maximum, what is the spacing of grooves on the CD? 


Exercise: 
Problem: 
The analysis shown in the figure below also applies to diffraction 
gratings with lines separated by a distance d. What is the distance 


between fringes produced by a diffraction grating having 125 lines per 
centimeter for 600-nm light, if the screen is 1.50 m away? 


The distance between adjacent 
fringes is Ay = xX/d, 
assuming the slit separation d is 
large compared with 4. 


Solution: 


1.13 x 10°? m 


Exercise: 


Problem: Unreasonable Results 


Red light of wavelength of 700 nm falls on a double slit separated by 
A400 nm. (a) At what angle is the first-order maximum in the diffraction 
pattern? (b) What is unreasonable about this result? (c) Which 
assumptions are unreasonable or inconsistent? 


Exercise: 
Problem: Unreasonable Results 


(a) What visible wavelength has its fourth-order maximum at an angle 
of 25.0° when projected on a 25,000-line-per-centimeter diffraction 


grating? (b) What is unreasonable about this result? (c) Which 
assumptions are unreasonable or inconsistent? 


Solution: 
(a) 42.3 nm 
(b) Not a visible wavelength 


The number of slits in this diffraction grating is too large. Etching in 
integrated circuits can be done to a resolution of 50 nm, so slit 
separations of 400 nm are at the limit of what we can do today. This 
line spacing is too small to produce diffraction of light. 


Exercise: 


Problem: Construct Your Own Problem 


Consider a spectrometer based on a diffraction grating. Construct a 
problem in which you calculate the distance between two wavelengths 
of electromagnetic radiation in your spectrometer. Among the things to 
be considered are the wavelengths you wish to be able to distinguish, 
the number of lines per meter on the diffraction grating, and the 
distance from the grating to the screen or detector. Discuss the 
practicality of the device in terms of being able to discern between 
wavelengths of interest. 


Glossary 


constructive interference for a diffraction grating 
occurs when the condition 
d sin @ = md (for m = 0, 1, -1, 2, -2, ...) is satisfied, where d is 
the distance between slits in the grating, A is the wavelength of light, 
and m is the order of the maximum 


diffraction grating 
a large number of evenly spaced parallel slits 


Single Slit Diffraction 
e Discuss the single slit diffraction pattern. 


Light passing through a single slit forms a diffraction pattern somewhat 
different from those formed by double slits or diffraction gratings. [link] 
shows a single slit diffraction pattern. Note that the central maximum is 
larger than those on either side, and that the intensity decreases rapidly on 
either side. In contrast, a diffraction grating produces evenly spaced lines 
that dim slowly on either side of center. 
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(a) Single slit 
diffraction 
pattern. 
Monochromati 
c light passing 
through a 
single slit has a 
central 
maximum and 
many smaller 
and dimmer 
maxima on 
either side. The 
central 
maximum is 
Six times 
higher than 
shown. (b) The 
drawing shows 


= 
lox 
— 


the bright 
central 
maximum and 
dimmer and 
thinner maxima 
on either side. 


The analysis of single slit diffraction is illustrated in [link]. Here we 
consider light coming from different parts of the same slit. According to 
Huygens’s principle, every part of the wavefront in the slit emits wavelets. 
These are like rays that start out in phase and head in all directions. (Each 
ray is perpendicular to the wavefront of a wavelet.) Assuming the screen is 
very far away compared with the size of the slit, rays heading toward a 
common destination are nearly parallel. When they travel straight ahead, as 
in [link](a), they remain in phase, and a central maximum is obtained. 
However, when rays travel at an angle @ relative to the original direction of 
the beam, each travels a different distance to a common location, and they 
can arrive in or out of phase. In [link](b), the ray from the bottom travels a 
distance of one wavelength A farther than the ray from the top. Thus a ray 
from the center travels a distance /2 farther than the one on the left, 
arrives out of phase, and interferes destructively. A ray from slightly above 
the center and one from slightly above the bottom will also cancel one 
another. In fact, each ray from the slit will have another to interfere 
destructively, and a minimum in intensity will occur at this angle. There 
will be another minimum at the same angle to the right of the incident 
direction of the light. 


| y 
——_bd 
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Bright 


sind = 34 
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Bright Dark 


(c) (d) 


Light passing through a single slit is 
diffracted in all directions and may interfere 
constructively or destructively, depending 
on the angle. The difference in path length 
for rays from either side of the slit is seen to 
be D sin 0. 


At the larger angle shown in [link](c), the path lengths differ by 3A/2 for 
rays from the top and bottom of the slit. One ray travels a distance 
different from the ray from the bottom and arrives in phase, interfering 
constructively. Two rays, each from slightly above those two, will also add 
constructively. Most rays from the slit will have another to interfere with 
constructively, and a maximum in intensity will occur at this angle. 
However, all rays do not interfere constructively for this situation, and so 
the maximum is not as intense as the central maximum. Finally, in [link](d), 
the angle shown is large enough to produce a second minimum. As seen in 
the figure, the difference in path length for rays from either side of the slit is 
D sin 6, and we see that a destructive minimum is obtained when this 
distance is an integral multiple of the wavelength. 


Intensity 


3A sing 
D 
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A graph of 
single slit 
diffraction 
intensity 
showing the 
central 
maximum to 
be wider and 
much more 
intense than 
those to the 
sides. In fact 
the central 
maximum is 
six times 
higher than 
shown here. 


Thus, to obtain destructive interference for a single slit, 
Equation: 


D sin 0 = mX, for m = 1, -1, 2, -2, 3, ... (destructive), 


where D is the slit width, A is the light’s wavelength, 0 is the angle relative 
to the original direction of the light, and m is the order of the minimum. 
[link] shows a graph of intensity for single slit interference, and it is 
apparent that the maxima on either side of the central maximum are much 
less intense and not as wide. This is consistent with the illustration in [link] 


(b). 


Example: 

Calculating Single Slit Diffraction 

Visible light of wavelength 550 nm falls on a single slit and produces its 
second diffraction minimum at an angle of 45.0° relative to the incident 
direction of the light. (a) What is the width of the slit? (b) At what angle is 
the first minimum produced? 


Intensity 
on screen 


A graph of the 


single slit 
diffraction pattern 
is analyzed in this 

example. 


Strategy 

From the given information, and assuming the screen is far away from the 
slit, we can use the equation D sin 8 = mA first to find D, and again to 
find the angle for the first minimum 9}. 

Solution for (a) 

We are given that A = 550 nm, m = 2, and 02 = 45.0°. Solving the 
equation D sin 8 = md for D and substituting known values gives 
Equation: 


es md __ 2(550 nm) 
sinf. ~—_—_ sin 45.0° 


1100x1072 
0.707 


— 1.56~x 10°. 


Solution for (b) 

Solving the equation D sin 8? = m4) for sin 6, and substituting the known 
values gives 

Equation: 


mX — 1(550 x 10-° m) 


sin 6, = TS 
D 1.56 x 10m 
Thus the angle 0, is 
Equation: 
6, =sin ' 0.354 = 20.7°. 
Discussion 


We see that the slit is narrow (it is only a few times greater than the 
wavelength of light). This is consistent with the fact that light must interact 
with an object comparable in size to its wavelength in order to exhibit 


significant wave effects such as this single slit diffraction pattern. We also 
see that the central maximum extends 20.7° on either side of the original 
beam, for a width of about 41° . The angle between the first and second 
minima is only about 24° (45.0° — 20.7°). Thus the second maximum is 
only about half as wide as the central maximum. 


Section Summary 


e A single slit produces an interference pattern characterized by a broad 
central maximum with narrower and dimmer maxima to the sides. 

e There is destructive interference for a single slit when 
D sin@ = mA, (for m = 1, -1, 2, -2, 3, ...), where D is the slit 
width, A is the light’s wavelength, @ is the angle relative to the original 
direction of the light, and ™ is the order of the minimum. Note that 
there is no m = O minimum. 


Conceptual Questions 


Exercise: 


Problem: 

As the width of the slit producing a single-slit diffraction pattern is 

reduced, how will the diffraction pattern produced change? 
Problems & Exercises 


Exercise: 


Problem: 


(a) At what angle is the first minimum for 550-nm light falling on a 
single slit of width 1.00 um? (b) Will there be a second minimum? 


Solution: 


(a) 33.4° 


(b) No 
Exercise: 
Problem: 
(a) Calculate the angle at which a 2.00-,1m-wide slit produces its first 


minimum for 410-nm violet light. (b) Where is the first minimum for 
700-nm red light? 


Exercise: 
Problem: 
(a) How wide is a single slit that produces its first minimum for 633- 


nm light at an angle of 28.0°? (b) At what angle will the second 
minimum be? 


Solution: 
(a) 1.35 x10 °m 


(b) 69.9° 
Exercise: 
Problem: 
(a) What is the width of a single slit that produces its first minimum at 


60.0° for 600-nm light? (b) Find the wavelength of light that has its 
first minimum at 62.0°. 


Exercise: 


Problem: 


Find the wavelength of light that has its third minimum at an angle of 
48.6° when it falls on a single slit of width 3.00 pm. 


Solution: 


750 nm 
Exercise: 

Problem: 

Calculate the wavelength of light that produces its first minimum at an 

angle of 36.9° when falling on a single slit of width 1.00 pm. 
Exercise: 

Problem: 

(a) Sodium vapor light averaging 589 nm in wavelength falls on a 


single slit of width 7.50 tm. At what angle does it produces its second 
minimum? (b) What is the highest-order minimum produced? 


Solution: 
(a) 9.04° 
(b) 12 


Exercise: 


Problem: 


(a) Find the angle of the third diffraction minimum for 633-nm light 
falling on a slit of width 20.0 pm. (b) What slit width would place this 
minimum at 85.0°? Explicitly show how you follow the steps in 
Problem-Solving Strategies for Wave Optics 


Exercise: 


Problem: 


(a) Find the angle between the first minima for the two sodium vapor 
lines, which have wavelengths of 589.1 and 589.6 nm, when they fall 
upon a single slit of width 2.00 tm. (b) What is the distance between 
these minima if the diffraction pattern falls on a screen 1.00 m from 
the slit? (c) Discuss the ease or difficulty of measuring such a distance. 


Solution: 

(a) 0.0150° 

(b) 0.262 mm 

(c) This distance is not easily measured by human eye, but under a 

microscope or magnifying glass it is quite easily measurable. 
Exercise: 

Problem: 

(a) What is the minimum width of a single slit (in multiples of A) that 


will produce a first minimum for a wavelength A? (b) What is its 
minimum width if it produces 50 minima? (c) 1000 minima? 


Exercise: 
Problem: 
(a) If a single slit produces a first minimum at 14.5°, at what angle is 
the second-order minimum? (b) What is the angle of the third-order 
minimum? (c) Is there a fourth-order minimum? (d) Use your answers 
to illustrate how the angular width of the central maximum is about 


twice the angular width of the next maximum (which is the angle 
between the first and second minima). 


Solution: 
(a) 30.1° 

(b) 48.7° 

(c) No 


(d) 20, = (2)(14.5°) = 29°, 2 — 0, = 30.05° — 14.5°=15.56°. 
Thus, 29° = (2)(15.56°) = 31.1°. 


Exercise: 


Problem: 


A double slit produces a diffraction pattern that is a combination of 
single and double slit interference. Find the ratio of the width of the 
slits to the separation between them, if the first minimum of the single 
slit pattern falls on the fifth maximum of the double slit pattern. (This 
will greatly reduce the intensity of the fifth maximum.) 


Exercise: 


Problem: Integrated Concepts 


A water break at the entrance to a harbor consists of a rock barrier with 
a 50.0-m-wide opening. Ocean waves of 20.0-m wavelength approach 
the opening straight on. At what angle to the incident direction are the 
boats inside the harbor most protected against wave action? 


Solution: 


23.6° and 53.1° 


Exercise: 


Problem: Integrated Concepts 


An aircraft maintenance technician walks past a tall hangar door that 
acts like a single slit for sound entering the hangar. Outside the door, 
on a line perpendicular to the opening in the door, a jet engine makes a 
600-Hz sound. At what angle with the door will the technician observe 
the first minimum in sound intensity if the vertical opening is 0.800 m 
wide and the speed of sound is 340 m/s? 


Glossary 
destructive interference for a single slit 


occurs when D sin 0 = md, (for m = 1, -1, 2, -2, 3, ...), where 
D is the slit width, A is the light’s wavelength, 0 is the angle relative to 


the original direction of the light, and m is the order of the minimum 


Limits of Resolution: The Rayleigh Criterion 
e Discuss the Rayleigh criterion. 


Light diffracts as it moves through space, bending around obstacles, 
interfering constructively and destructively. While this can be used as a 
spectroscopic tool—a diffraction grating disperses light according to 
wavelength, for example, and is used to produce spectra—diffraction also 
limits the detail we can obtain in images. [link](a) shows the effect of 
passing light through a small circular aperture. Instead of a bright spot with 
sharp edges, a spot with a fuzzy edge surrounded by circles of light is 
obtained. This pattern is caused by diffraction similar to that produced by a 
single slit. Light from different parts of the circular aperture interferes 
constructively and destructively. The effect is most noticeable when the 
aperture is small, but the effect is there for large apertures, too. 


(a) (b) (c) 


(a) Monochromatic light passed 
through a small circular aperture 
produces this diffraction pattern. (b) 
Two point light sources that are close 
to one another produce overlapping 
images because of diffraction. (c) If 
they are closer together, they cannot 
be resolved or distinguished. 


How does diffraction affect the detail that can be observed when light 
passes through an aperture? [link](b) shows the diffraction pattern produced 
by two point light sources that are close to one another. The pattern is 
similar to that for a single point source, and it is just barely possible to tell 
that there are two light sources rather than one. If they were closer together, 


as in [link](c), we could not distinguish them, thus limiting the detail or 
resolution we can obtain. This limit is an inescapable consequence of the 
wave nature of light. 


There are many situations in which diffraction limits the resolution. The 
acuity of our vision is limited because light passes through the pupil, the 
circular aperture of our eye. Be aware that the diffraction-like spreading of 
light is due to the limited diameter of a light beam, not the interaction with 
an aperture. Thus light passing through a lens with a diameter D shows this 
effect and spreads, blurring the image, just as light passing through an 
aperture of diameter D does. So diffraction limits the resolution of any 
system having a lens or mirror. Telescopes are also limited by diffraction, 
because of the finite diameter D of their primary mirror. 


Note: 

Take-Home Experiment: Resolution of the Eye 

Draw two lines on a white sheet of paper (several mm apart). How far 
away can you be and still distinguish the two lines? What does this tell you 
about the size of the eye’s pupil? Can you be quantitative? (The size of an 
adult’s pupil is discussed in Physics of the Eye.) 


Just what is the limit? To answer that question, consider the diffraction 
pattern for a circular aperture, which has a central maximum that is wider 
and brighter than the maxima surrounding it (similar to a slit) [see [link] 
(a)]. It can be shown that, for a circular aperture of diameter D, the first 
minimum in the diffraction pattern occurs at 8 = 1.22 A/D (providing the 
aperture is large compared with the wavelength of light, which is the case 
for most optical instruments). The accepted criterion for determining the 
diffraction limit to resolution based on this angle was developed by Lord 
Rayleigh in the 19th century. The Rayleigh criterion for the diffraction 
limit to resolution states that two images are just resolvable when the center 
of the diffraction pattern of one is directly over the first minimum of the 
diffraction pattern of the other. See [link](b). The first minimum is at an 


angle of 9 = 1.22 /D, so that two point objects are just resolvable if they 
are separated by the angle 
Equation: 


0 = 129 
D 


where A is the wavelength of light (or other electromagnetic radiation) and 
D is the diameter of the aperture, lens, mirror, etc., with which the two 


objects are observed. In this expression, @ has units of radians. 
Intensities 


min 


) Object 1 


r) 
(@) 
0 Object 2 (f9 ) y 
-1.22A 1.222 WV 
(a) (b) 


(a) Graph of intensity of the diffraction 
pattern for a circular aperture. Note that, 
similar to a single slit, the central maximum 
is wider and brighter than those to the sides. 
(b) Two point objects produce overlapping 
diffraction patterns. Shown here is the 
Rayleigh criterion for being just resolvable. 
The central maximum of one pattern lies on 
the first minimum of the other. 


Note: 

Connections: Limits to Knowledge 

All attempts to observe the size and shape of objects are limited by the 
wavelength of the probe. Even the small wavelength of light prohibits 
exact precision. When extremely small wavelength probes as with an 
electron microscope are used, the system is disturbed, still limiting our 
knowledge, much as making an electrical measurement alters a circuit. 
Heisenberg’s uncertainty principle asserts that this limit is fundamental and 
inescapable, as we shall see in quantum mechanics. 


Example: 

Calculating Diffraction Limits of the Hubble Space Telescope 

The primary mirror of the orbiting Hubble Space Telescope has a diameter 
of 2.40 m. Being in orbit, this telescope avoids the degrading effects of 
atmospheric distortion on its resolution. (a) What is the angle between two 
just-resolvable point light sources (perhaps two stars)? Assume an average 
light wavelength of 550 nm. (b) If these two stars are at the 2 million light 
year distance of the Andromeda galaxy, how close together can they be and 
still be resolved? (A light year, or ly, is the distance light travels in 1 year.) 
Strategy 

The Rayleigh criterion stated in the equation 0 = 1.22 +“ gives the 
smallest possible angle 8 between point sources, or the best obtainable 
resolution. Once this angle is found, the distance between stars can be 
calculated, since we are given how far away they are. 

Solution for (a) 

The Rayleigh criterion for the minimum resolvable angle is 

Equation: 


X 
d= Pe 


Entering known values gives 
Equation: 


= 550x107? m 
0 = 1.22 2.40 m 


— 2.80 x 10~’ rad. 


Solution for (b) 

The distance s between two objects a distance r away and separated by an 
angle 0 is s = rf. 

Substituting known values gives 

Equation: 


s = (2.0 x 10° ly)(2.80 x 107% rad) 
0.56 ly. 


Discussion 

The angle found in part (a) is extraordinarily small (less than 1/50,000 of a 
degree), because the primary mirror is so large compared with the 
wavelength of light. As noticed, diffraction effects are most noticeable 
when light interacts with objects having sizes on the order of the 
wavelength of light. However, the effect is still there, and there is a 
diffraction limit to what is observable. The actual resolution of the Hubble 
Telescope is not quite as good as that found here. As with all instruments, 
there are other effects, such as non-uniformities in mirrors or aberrations in 
lenses that further limit resolution. However, [link] gives an indication of 
the extent of the detail observable with the Hubble because of its size and 
quality and especially because it is above the Earth’s atmosphere. 


(a) 


These two photographs of the 
M82 galaxy give an idea of the 
observable detail using the 
Hubble Space Telescope 
compared with that using a 
ground-based telescope. (a) On 


the left is a ground-based 
image. (credit: Ricnun, 
Wikimedia Commons) (b) The 
photo on the right was captured 
by Hubble. (credit: NASA, 
ESA, and the Hubble Heritage 
Team (STScI/AURA)) 


The answer in part (b) indicates that two stars separated by about half a 
light year can be resolved. The average distance between stars in a galaxy 
is on the order of 5 light years in the outer parts and about 1 light year near 
the galactic center. Therefore, the Hubble can resolve most of the 
individual stars in Andromeda galaxy, even though it lies at such a huge 
distance that its light takes 2 million years for its light to reach us. [link] 
shows another mirror used to observe radio waves from outer space. 


A 305-m-diameter 
natural bowl at 


Arecibo in Puerto 
Rico is lined with 
reflective material, 
making it into a 
radio telescope. It 
is the largest 
curved focusing 
dish in the world. 
Although D for 
Arecibo is much 
larger than for the 


Hubble Telescope, 
it detects much 
longer wavelength 
radiation and its 
diffraction limit is 
significantly poorer 
than Hubble’s. 
Arecibo is still very 
useful, because 
important 
information is 
carried by radio 
waves that is not 
carried by visible 
light. (credit: 
Tatyana 
Temirbulatova, 
Flickr) 


Diffraction is not only a problem for optical instruments but also for the 
electromagnetic radiation itself. Any beam of light having a finite diameter 
D and a wavelength A exhibits diffraction spreading. The beam spreads out 
with an angle 0 given by the equation 0 = 1.22 +. Take, for example, a 
laser beam made of rays as parallel as possible (angles between rays as 
close to 8 = 0° as possible) instead spreads out at an angle 0 = 1.22 \/D, 
where D is the diameter of the beam and / is its wavelength. This 
spreading is impossible to observe for a flashlight, because its beam is not 
very parallel to start with. However, for long-distance transmission of laser 
beams or microwave signals, diffraction spreading can be significant (see 
[link]). To avoid this, we can increase D. This is done for laser light sent to 
the Moon to measure its distance from the Earth. The laser beam is 
expanded through a telescope to make D much larger and @ smaller. 


The beam 
produced by 
this 
microwave 
transmission 
antenna will 
spread out at a 
minimum 
angle 
6=1,22 A/D 
due to 
diffraction. It 
is impossible 
to produce a 
near-parallel 
beam, because 
the beam has a 
limited 
diameter. 


In most biology laboratories, resolution is presented when the use of the 
microscope is introduced. The ability of a lens to produce sharp images of 
two closely spaced point objects is called resolution. The smaller the 
distance x by which two objects can be separated and still be seen as 
distinct, the greater the resolution. The resolving power of a lens is defined 
as that distance zx. An expression for resolving power is obtained from the 


Rayleigh criterion. In [link](a) we have two point objects separated by a 
distance x. According to the Rayleigh criterion, resolution is possible when 
the minimum angular separation is 

Equation: 


a es 
D d 


where d is the distance between the specimen and the objective lens, and we 
have used the small angle approximation (i.e., we have assumed that z is 
much smaller than d), so that tan 0 © sin 6 & @. 


Therefore, the resolving power is 
Equation: 


C= 1.2904, 
D 


Another way to look at this is by re-examining the concept of Numerical 
Aperture (NA) discussed in Microscopes. There, NA is a measure of the 
maximum acceptance angle at which the fiber will take light and still 
contain it within the fiber. [link](b) shows a lens and an object at point P. 
The NA here is a measure of the ability of the lens to gather light and 
resolve fine detail. The angle subtended by the lens at its focus is defined to 
be 6 = 2a. From the figure and again using the small angle approximation, 
we can write 

Equation: 


D/2 =D 
sn a = — 
d 


The NVA for a lens is NA = n sin a, where n is the index of refraction of 
the medium between the objective lens and the object at point P. 


From this definition for NA, we can see that 
Equation: 


Ad A An 
= 2 12? = 0.61 —_-.. 
. D 2 sin a NA 


In a microscope, NA is important because it relates to the resolving power 
of a lens. A lens with a large NA will be able to resolve finer details. 
Lenses with larger NA will also be able to collect more light and so give a 
brighter image. Another way to describe this situation is that the larger the 
NA, the larger the cone of light that can be brought into the lens, and so 
more of the diffraction modes will be collected. Thus the microscope has 
more information to form a clear image, and so its resolving power will be 
higher. 
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(a) Two points 
separated by at 
distance x anda 
positioned a 
distance d away 
from the objective. 
(credit: Infopro, 


Wikimedia 
Commons) (b) 
Terms and symbols 
used in discussion 
of resolving power 
for a lens and an 
object at point P. 
(credit: Infopro, 
Wikimedia 
Commons) 


One of the consequences of diffraction is that the focal point of a beam has 
a finite width and intensity distribution. Consider focusing when only 
considering geometric optics, shown in [link](a). The focal point is 
infinitely small with a huge intensity and the capacity to incinerate most 
samples irrespective of the NA of the objective lens. For wave optics, due 
to diffraction, the focal point spreads to become a focal spot (see [link](b)) 
with the size of the spot decreasing with increasing NA. Consequently, the 
intensity in the focal spot increases with increasing NA. The higher the NA 
, the greater the chances of photodegrading the specimen. However, the spot 
never becomes a true point. 


—_> 


Focal 
point 


———_ Geometric optics focus 


(a) 
—_——— 
Focal 
region 


(b) 
(a) In geometric optics, the 
focus is a point, but it is not 


physically possible to produce 
such a point because it implies 
infinite intensity. (b) In wave 
optics, the focus is an extended 
region. 


Section Summary 


e Diffraction limits resolution. 

e For a circular aperture, lens, or mirror, the Rayleigh criterion states 
that two images are just resolvable when the center of the diffraction 
pattern of one is directly over the first minimum of the diffraction 
pattern of the other. 

¢ This occurs for two point objects separated by the angle 0 = 1.224, 
where A is the wavelength of light (or other electromagnetic radiation) 
and D is the diameter of the aperture, lens, mirror, etc. This equation 


also gives the angular spreading of a source of light having a diameter 
LD; 


Conceptual Questions 


Exercise: 


Problem: 

A beam of light always spreads out. Why can a beam not be created 
with parallel rays to prevent spreading? Why can lenses, mirrors, or 
apertures not be used to correct the spreading? 


Problems & Exercises 


Exercise: 


Problem: 


The 300-m-diameter Arecibo radio telescope pictured in [link] detects 
radio waves with a 4.00 cm average wavelength. 


(a) What is the angle between two just-resolvable point sources for this 
telescope? 


(b) How close together could these point sources be at the 2 million 
light year distance of the Andromeda galaxy? 


Solution: 
(a) 1.63 x 10-4 rad 
(b) 326 ly 


Exercise: 
Problem: 
Assuming the angular resolution found for the Hubble Telescope in 
[link], what is the smallest detail that could be observed on the Moon? 
Exercise: 
Problem: 
Diffraction spreading for a flashlight is insignificant compared with 
other limitations in its optics, such as spherical aberrations in its 
mirror. To show this, calculate the minimum angular spreading of a 


flashlight beam that is originally 5.00 cm in diameter with an average 
wavelength of 600 nm. 


Solution: 


1.46 x 10-° rad 


Exercise: 


Problem: 


(a) What is the minimum angular spread of a 633-nm wavelength He- 
Ne laser beam that is originally 1.00 mm in diameter? 


(b) If this laser is aimed at a mountain cliff 15.0 km away, how big will 
the illuminated spot be? 


(c) How big a spot would be illuminated on the Moon, neglecting 
atmospheric effects? (This might be done to hit a corner reflector to 
measure the round-trip time and, hence, distance.) Explicitly show 
how you follow the steps in Problem-Solving Strategies for Wave 
Optics. 


Exercise: 
Problem: 
A telescope can be used to enlarge the diameter of a laser beam and 
limit diffraction spreading. The laser beam is sent through the 


telescope in opposite the normal direction and can then be projected 
onto a satellite or the Moon. 


(a) If this is done with the Mount Wilson telescope, producing a 2.54- 
m-diameter beam of 633-nm light, what is the minimum angular 
spread of the beam? 


(b) Neglecting atmospheric effects, what is the size of the spot this 
beam would make on the Moon, assuming a lunar distance of 
3.84 x 10° m? 


Solution: 
(a) 3.04 x 10~" rad 


(b) Diameter of 235 m 


Exercise: 


Problem: 


The limit to the eye’s acuity is actually related to diffraction by the 
pupil. 


(a) What is the angle between two just-resolvable points of light for a 
3.00-mm-diameter pupil, assuming an average wavelength of 550 nm? 


(b) Take your result to be the practical limit for the eye. What is the 
greatest possible distance a car can be from you if you can resolve its 
two headlights, given they are 1.30 m apart? 


(c) What is the distance between two just-resolvable points held at an 
arm’s length (0.800 m) from your eye? 


(d) How does your answer to (c) compare to details you normally 
observe in everyday circumstances? 

Exercise: 
Problem: 
What is the minimum diameter mirror on a telescope that would allow 
you to see details as small as 5.00 km on the Moon some 384,000 km 


away? Assume an average wavelength of 550 nm for the light 
received. 


Solution: 


oo cm 
Exercise: 
Problem: 
You are told not to shoot until you see the whites of their eyes. If the 
eyes are separated by 6.5 cm and the diameter of your pupil is 5.0 mm, 


at what distance can you resolve the two eyes using light of 
wavelength 555 nm? 


Exercise: 


Problem: 


(a) The planet Pluto and its Moon Charon are separated by 19,600 km. 
Neglecting atmospheric effects, should the 5.08-m-diameter Mount 
Palomar telescope be able to resolve these bodies when they are 

4.50 x 10° km from Earth? Assume an average wavelength of 550 
nm. 


(b) In actuality, it is just barely possible to discern that Pluto and 
Charon are separate bodies using an Earth-based telescope. What are 
the reasons for this? 


Solution: 

(a) Yes. Should easily be able to discern. 

(b) The fact that it is just barely possible to discern that these are 

separate bodies indicates the severity of atmospheric aberrations. 
Exercise: 

Problem: 

The headlights of a car are 1.3 m apart. What is the maximum distance 


at which the eye can resolve these two headlights? Take the pupil 
diameter to be 0.40 cm. 


Exercise: 


Problem: 


When dots are placed on a page from a laser printer, they must be close 
enough so that you do not see the individual dots of ink. To do this, the 
separation of the dots must be less than Raleigh’s criterion. Take the 
pupil of the eye to be 3.0 mm and the distance from the paper to the 
eye of 35 cm; find the minimum separation of two dots such that they 
cannot be resolved. How many dots per inch (dpi) does this correspond 
to? 


Exercise: 


Problem: Unreasonable Results 


An amateur astronomer wants to build a telescope with a diffraction 
limit that will allow him to see if there are people on the moons of 
Jupiter. 


(a) What diameter mirror is needed to be able to see 1.00 m detail on a 
Jovian Moon at a distance of 7.50 x 10° km from Earth? The 
wavelength of light averages 600 nm. 


(b) What is unreasonable about this result? 


(c) Which assumptions are unreasonable or inconsistent? 


Exercise: 


Problem: Construct Your Own Problem 


Consider diffraction limits for an electromagnetic wave interacting 
with a circular object. Construct a problem in which you calculate the 
limit of angular resolution with a device, using this circular object 
(such as a lens, mirror, or antenna) to make observations. Also 
calculate the limit to spatial resolution (such as the size of features 
observable on the Moon) for observations at a specific distance from 
the device. Among the things to be considered are the wavelength of 
electromagnetic radiation used, the size of the circular object, and the 
distance to the system or phenomenon being observed. 


Glossary 


Rayleigh criterion 
two images are just resolvable when the center of the diffraction 
pattern of one is directly over the first minimum of the diffraction 
pattern of the other 


Thin Film Interference 
e Discuss the rainbow formation by thin films. 


The bright colors seen in an oil slick floating on water or in a sunlit soap 
bubble are caused by interference. The brightest colors are those that 
interfere constructively. This interference is between light reflected from 
different surfaces of a thin film; thus, the effect is known as thin film 
interference. As noticed before, interference effects are most prominent 
when light interacts with something having a size similar to its wavelength. 
A thin film is one having a thickness ¢ smaller than a few times the 
wavelength of light, A. Since color is associated indirectly with A and since 
all interference depends in some way on the ratio of A to the size of the 
object involved, we should expect to see different colors for different 
thicknesses of a film, as in [link]. 


These soap bubbles exhibit 
brilliant colors when exposed to 
sunlight. (credit: Scott 
Robinson, Flickr) 


What causes thin film interference? [link] shows how light reflected from 
the top and bottom surfaces of a film can interfere. Incident light is only 
partially reflected from the top surface of the film (ray 1). The remainder 
enters the film and is itself partially reflected from the bottom surface. Part 


of the light reflected from the bottom surface can emerge from the top of 
the film (ray 2) and interfere with light reflected from the top (ray 1). Since 
the ray that enters the film travels a greater distance, it may be in or out of 
phase with the ray reflected from the top. However, consider for a moment, 
again, the bubbles in [link]. The bubbles are darkest where they are 
thinnest. Furthermore, if you observe a soap bubble carefully, you will note 
it gets dark at the point where it breaks. For very thin films, the difference 
in path lengths of ray 1 and ray 2 in [link] is negligible; so why should they 
interfere destructively and not constructively? The answer is that a phase 
change can occur upon reflection. The rule is as follows: 


When light reflects from a medium having an index of refraction 
greater than that of the medium in which it is traveling, a 180° phase 
change (or a \/2 shift) occurs. 


Incident Si 


light 


n; 


l 
| 


Ng 


Light striking a thin 
film is partially 
reflected (ray 1) and 
partially refracted at 
the top surface. The 
refracted ray is 
partially reflected at 
the bottom surface 


and emerges as ray 2. 
These rays will 
interfere in a way that 
depends on the 
thickness of the film 
and the indices of 
refraction of the 
various media. 


If the film in [link] is a soap bubble (essentially water with air on both 
sides), then there is a \/2 shift for ray 1 and none for ray 2. Thus, when the 
film is very thin, the path length difference between the two rays is 
negligible, they are exactly out of phase, and destructive interference will 
occur at all wavelengths and so the soap bubble will be dark here. 


The thickness of the film relative to the wavelength of light is the other 
crucial factor in thin film interference. Ray 2 in [link] travels a greater 
distance than ray 1. For light incident perpendicular to the surface, ray 2 
travels a distance approximately 2¢ farther than ray 1. When this distance is 
an integral or half-integral multiple of the wavelength in the medium ( 

An = A/n, where 2 is the wavelength in vacuum and n is the index of 
refraction), constructive or destructive interference occurs, depending also 
on whether there is a phase change in either ray. 


Example: 

Calculating Non-reflective Lens Coating Using Thin Film Interference 
Sophisticated cameras use a series of several lenses. Light can reflect from 
the surfaces of these various lenses and degrade image clarity. To limit 
these reflections, lenses are coated with a thin layer of magnesium fluoride 
that causes destructive thin film interference. What is the thinnest this film 
can be, if its index of refraction is 1.38 and it is designed to limit the 
reflection of 550-nm light, normally the most intense visible wavelength? 
The index of refraction of glass is 1.52. 


Strategy 

Refer to [link] and use n; = 100 for air, ny = 1.38, and n3 = 1.52. Both 
ray 1 and ray 2 will have a \/2 shift upon reflection. Thus, to obtain 
destructive interference, ray 2 will need to travel a half wavelength farther 
than ray 1. For rays incident perpendicularly, the path length difference is 
Dab 


Solution 
To obtain destructive interference here, 
Equation: 
a 
7 7 
where A,,, is the wavelength in the film and is given by A, = a, 
Thus, 
Equation: 
A/n 
pe ale 
2 
Solving for ¢ and entering known values yields 
Equation: 
fae A/nz __ (550 nm)/1.38 
= es 4 
99.6 nm. 
Discussion 


Films such as the one in this example are most effective in producing 
destructive interference when the thinnest layer is used, since light over a 
broader range of incident angles will be reduced in intensity. These films 
are called non-reflective coatings; this is only an approximately correct 
description, though, since other wavelengths will only be partially 
cancelled. Non-reflective coatings are used in car windows and sunglasses. 


Thin film interference is most constructive or most destructive when the 
path length difference for the two rays is an integral or half-integral 
wavelength, respectively. That is, for rays incident perpendicularly, 

2t = An, 2A, 3A,,--- Or 2E = A_/2, 3A,,/2, 5A,,/2,.... To know whether 
interference is constructive or destructive, you must also determine if there 
is a phase change upon reflection. Thin film interference thus depends on 
film thickness, the wavelength of light, and the refractive indices. For white 
light incident on a film that varies in thickness, you will observe rainbow 
colors of constructive interference for various wavelengths as the thickness 
varies. 


Example: 

Soap Bubbles: More Than One Thickness can be Constructive 

(a) What are the three smallest thicknesses of a soap bubble that produce 
constructive interference for red light with a wavelength of 650 nm? The 
index of refraction of soap is taken to be the same as that of water. (b) 
What three smallest thicknesses will give destructive interference? 
Strategy and Concept 

Use [link] to visualize the bubble. Note that 2; = n3 = 1.00 for air, and 
N2 = 1.333 for soap (equivalent to water). There is a A /2 shift for ray 1 
reflected from the top surface of the bubble, and no shift for ray 2 reflected 
from the bottom surface. To get constructive interference, then, the path 
length difference (2¢) must be a half-integral multiple of the wavelength— 
the first three being A,,/2, 3A,,/2, and 5A,,/2. To get destructive 
interference, the path length difference must be an integral multiple of the 
wavelength—the first three being 0, A,,, and 2A,,. 

Solution for (a) 

Constructive interference occurs here when 

Equation: 


The smallest constructive thickness ¢, thus is 
Equation: 


— Ayn _. A/n __ (650 nm) /1.333 
COS foie ante at arene aes 


4 
22 nm. 


| 
— 


The next thickness that gives constructive interference is t/, = 3A,,/4, so 
that 
Equation: 


t/, = 366 nm. 


Finally, the third thickness producing constructive interference is 
ti. < 5A,,/4, so that 
Equation: 


tv. = 610 nm. 


Solution for (b) 

For destructive interference, the path length difference here is an integral 
multiple of the wavelength. The first occurs for zero thickness, since there 
is a phase change at the top surface. That is, 

Equation: 


s= 1. 


The first non-zero thickness producing destructive interference is 
Equation: 


2a Ne 
Substituting known values gives 
Equation: 
A/n 650 nm) /1.333 
try = Ae — Alm _ (650.nm/1.398 
= 244 nm. 


Finally, the third destructive thickness is 2¢//g = 2A,,, so that 
Equation: 


A 650 nm 
UU) = = ean 


488 nm. 


Discussion 

If the bubble was illuminated with pure red light, we would see bright and 
dark bands at very uniform increases in thickness. First would be a dark 
band at 0 thickness, then bright at 122 nm thickness, then dark at 244 nm, 
bright at 366 nm, dark at 488 nm, and bright at 610 nm. If the bubble 
varied smoothly in thickness, like a smooth wedge, then the bands would 
be evenly spaced. 


Another example of thin film interference can be seen when microscope 
slides are separated (see [link]). The slides are very flat, so that the wedge 
of air between them increases in thickness very uniformly. A phase change 
occurs at the second surface but not the first, and so there is a dark band 
where the slides touch. The rainbow colors of constructive interference 
repeat, going from violet to red again and again as the distance between the 
slides increases. As the layer of air increases, the bands become more 
difficult to see, because slight changes in incident angle have greater effects 
on path length differences. If pure-wavelength light instead of white light is 
used, then bright and dark bands are obtained rather than repeating rainbow 


colors. 


Angle shown 1’ 
larger than 2’ 
actual 


(a) The rainbow color bands are produced 
by thin film interference in the air between 
the two glass slides. (b) Schematic of the 


paths taken by rays in the wedge of air 
between the slides. 


An important application of thin film interference is found in the 
manufacturing of optical instruments. A lens or mirror can be compared 
with a master as it is being ground, allowing it to be shaped to an accuracy 
of less than a wavelength over its entire surface. [link] illustrates the 
phenomenon called Newton’s rings, which occurs when the plane surfaces 
of two lenses are placed together. (The circular bands are called Newton’s 
rings because Isaac Newton described them and their use in detail. Newton 
did not discover them; Robert Hooke did, and Newton did not believe they 
were due to the wave character of light.) Each successive ring of a given 
color indicates an increase of only one wavelength in the distance between 
the lens and the blank, so that great precision can be obtained. Once the lens 
is perfect, there will be no rings. 


“Newton's rings” 
interference fringes 
are produced when 
two plano-convex 

lenses are placed 
together with their 

plane surfaces in 
contact. The rings 
are created by 
interference 
between the light 
reflected off the 

two surfaces as a 

result of a slight 


gap between them, 
indicating that 
these surfaces are 
not precisely plane 
but are slightly 
convex. (credit: Ulf 
Seifert, Wikimedia 
Commons) 


The wings of certain moths and butterflies have nearly iridescent colors due 
to thin film interference. In addition to pigmentation, the wing’s color is 
affected greatly by constructive interference of certain wavelengths 
reflected from its film-coated surface. Car manufacturers are offering 
special paint jobs that use thin film interference to produce colors that 
change with angle. This expensive option is based on variation of thin film 
path length differences with angle. Security features on credit cards, 
banknotes, driving licenses and similar items prone to forgery use thin film 
interference, diffraction gratings, or holograms. Australia led the way with 
dollar bills printed on polymer with a diffraction grating security feature 
making the currency difficult to forge. Other countries such as New Zealand 
and Taiwan are using similar technologies, while the United States currency 
includes a thin film interference effect. 


Note: 

Making Connections: Take-Home Experiment—Thin Film Interference 
One feature of thin film interference and diffraction gratings is that the 
pattern shifts as you change the angle at which you look or move your 
head. Find examples of thin film interference and gratings around you. 
Explain how the patterns change for each specific example. Find examples 
where the thickness changes giving rise to changing colors. If you can find 
two microscope slides, then try observing the effect shown in [link]. Try 
separating one end of the two slides with a hair or maybe a thin piece of 
paper and observe the effect. 


Problem-Solving Strategies for Wave Optics 


Step 1. Examine the situation to determine that interference is involved. 
Identify whether slits or thin film interference are considered in the 
problem. 


Step 2. If slits are involved, note that diffraction gratings and double slits 
produce very similar interference patterns, but that gratings have narrower 
(sharper) maxima. Single slit patterns are characterized by a large central 
maximum and smaller maxima to the sides. 


Step 3. If thin film interference is involved, take note of the path length 
difference between the two rays that interfere. Be certain to use the 
wavelength in the medium involved, since it differs from the wavelength in 
vacuum. Note also that there is an additional X/2 phase shift when light 
reflects from a medium with a greater index of refraction. 


Step 4. Identify exactly what needs to be determined in the problem 
(identify the unknowns). A written list is useful. Draw a diagram of the 
situation. Labeling the diagram is useful. 


Step 5. Make a list of what is given or can be inferred from the problem as 
stated (identify the knowns). 


Step 6. Solve the appropriate equation for the quantity to be determined 
(the unknown), and enter the knowns. Slits, gratings, and the Rayleigh limit 
involve equations. 


Step 7. For thin film interference, you will have constructive interference 
for a total shift that is an integral number of wavelengths. You will have 
destructive interference for a total shift of a half-integral number of 
wavelengths. Always keep in mind that crest to crest is constructive 
whereas crest to trough is destructive. 


Step 8. Check to see if the answer is reasonable: Does it make sense? 
Angles in interference patterns cannot be greater than 90°, for example. 


Section Summary 


e Thin film interference occurs between the light reflected from the top 
and bottom surfaces of a film. In addition to the path length difference, 
there can be a phase change. 

¢ When light reflects from a medium having an index of refraction 
greater than that of the medium in which it is traveling, a 180° phase 
change (or a A/2 shift) occurs. 


Conceptual Questions 


Exercise: 
Problem: 
What effect does increasing the wedge angle have on the spacing of 


interference fringes? If the wedge angle is too large, fringes are not 
observed. Why? 


Exercise: 
Problem: 
How is the difference in paths taken by two originally in-phase light 


waves related to whether they interfere constructively or destructively? 
How can this be affected by reflection? By refraction? 


Exercise: 
Problem: 
Is there a phase change in the light reflected from either surface of a 


contact lens floating on a person’s tear layer? The index of refraction 
of the lens is about 1.5, and its top surface is dry. 


Exercise: 


Problem: 


In placing a sample on a microscope slide, a glass cover is placed over 
a water drop on the glass slide. Light incident from above can reflect 
from the top and bottom of the glass cover and from the glass slide 
below the water drop. At which surfaces will there be a phase change 
in the reflected light? 


Exercise: 
Problem: 
Answer the above question if the fluid between the two pieces of 
crown glass is carbon disulfide. 
Exercise: 
Problem: 
While contemplating the food value of a slice of ham, you notice a 
rainbow of color reflected from its moist surface. Explain its origin. 
Exercise: 
Problem: 
An inventor notices that a soap bubble is dark at its thinnest and 
realizes that destructive interference is taking place for all 
wavelengths. How could she use this knowledge to make a non- 
reflective coating for lenses that is effective at all wavelengths? That 


is, what limits would there be on the index of refraction and thickness 
of the coating? How might this be impractical? 


Exercise: 
Problem: 
A non-reflective coating like the one described in [link] works ideally 


for a single wavelength and for perpendicular incidence. What happens 
for other wavelengths and other incident directions? Be specific. 


Exercise: 


Problem: 


Why is it much more difficult to see interference fringes for light 
reflected from a thick piece of glass than from a thin film? Would it be 
easier if monochromatic light were used? 


Problems & Exercises 


Exercise: 
Problem: 
A soap bubble is 100 nm thick and illuminated by white light incident 
perpendicular to its surface. What wavelength and color of visible light 


is most constructively reflected, assuming the same index of refraction 
as water? 


Solution: 


532 nm (green) 
Exercise: 
Problem: 
An oil slick on water is 120 nm thick and illuminated by white light 
incident perpendicular to its surface. What color does the oil appear 


(what is the most constructively reflected wavelength), given its index 
of refraction is 1.40? 


Exercise: 
Problem: 
Calculate the minimum thickness of an oil slick on water that appears 
blue when illuminated by white light perpendicular to its surface. Take 


the blue wavelength to be 470 nm and the index of refraction of oil to 
be 1.40. 


Solution: 


83.9 nm 
Exercise: 
Problem: 
Find the minimum thickness of a soap bubble that appears red when 
illuminated by white light perpendicular to its surface. Take the 


wavelength to be 680 nm, and assume the same index of refraction as 
water. 


Exercise: 
Problem: 
A film of soapy water (n = 1.33) on top of a plastic cutting board has 


a thickness of 233 nm. What color is most strongly reflected if it is 
illuminated perpendicular to its surface? 


Solution: 


620 nm (orange) 

Exercise: 
Problem: 
What are the three smallest non-zero thicknesses of soapy water ( 
nm = 1.33) on Plexiglas if it appears green (constructively reflecting 
520-nm light) when illuminated perpendicularly by white light? 


Explicitly show how you follow the steps in Problem Solving 
Strategies for Wave Optics. 


Exercise: 


Problem: 


Suppose you have a lens system that is to be used primarily for 700- 
nm red light. What is the second thinnest coating of fluorite 
(magnesium fluoride) that would be non-reflective for this 
wavelength? 


Solution: 


380 nm 
Exercise: 


Problem: 


(a) As a soap bubble thins it becomes dark, because the path length 
difference becomes small compared with the wavelength of light and 
there is a phase shift at the top surface. If it becomes dark when the 
path length difference is less than one-fourth the wavelength, what is 
the thickest the bubble can be and appear dark at all visible 
wavelengths? Assume the same index of refraction as water. (b) 
Discuss the fragility of the film considering the thickness found. 


Exercise: 
Problem: 
A film of oil on water will appear dark when it is very thin, because 
the path length difference becomes small compared with the 
wavelength of light and there is a phase shift at the top surface. If it 
becomes dark when the path length difference is less than one-fourth 


the wavelength, what is the thickest the oil can be and appear dark at 
all visible wavelengths? Oil has an index of refraction of 1.40. 


Solution: 


33.9 nm 


Exercise: 


Problem: 


[link] shows two glass slides illuminated by pure-wavelength light 
incident perpendicularly. The top slide touches the bottom slide at one 
end and rests on a 0.100-mm-diameter hair at the other end, forming a 
wedge of air. (a) How far apart are the dark bands, if the slides are 7.50 
cm long and 589-nm light is used? (b) Is there any difference if the 
slides are made from crown or flint glass? Explain. 


Exercise: 


Problem: 


[link] shows two 7.50-cm-long glass slides illuminated by pure 589- 
nm wavelength light incident perpendicularly. The top slide touches 

the bottom slide at one end and rests on some debris at the other end, 
forming a wedge of air. How thick is the debris, if the dark bands are 
1.00 mm apart? 


Solution: 
4.42x10-°m 


Exercise: 


Problem: Repeat [link], but take the light to be incident at a 45° angle. 


Exercise: 


Problem: Repeat [link], but take the light to be incident at a 45° angle. 
Solution: 


The oil film will appear black, since the reflected light is not in the 
visible part of the spectrum. 


Exercise: 


Problem: Unreasonable Results 


To save money on making military aircraft invisible to radar, an 
inventor decides to coat them with a non-reflective material having an 
index of refraction of 1.20, which is between that of air and the surface 
of the plane. This, he reasons, should be much cheaper than designing 
Stealth bombers. (a) What thickness should the coating be to inhibit 
the reflection of 4.00-cm wavelength radar? (b) What is unreasonable 
about this result? (c) Which assumptions are unreasonable or 
inconsistent? 


Glossary 
thin film interference 


interference between light reflected from different surfaces of a thin 
film 


Polarization 


e Discuss the meaning of polarization. 
e Discuss the property of optical activity of certain materials. 


Polaroid sunglasses are familiar to most of us. They have a special ability to 
cut the glare of light reflected from water or glass (see [link]). Polaroids 
have this ability because of a wave characteristic of light called 
polarization. What is polarization? How is it produced? What are some of 
its uses? The answers to these questions are related to the wave character of 
light. 


(a) |  (b) 


These two photographs of a river show the 
effect of a polarizing filter in reducing glare 
in light reflected from the surface of water. 
Part (b) of this figure was taken with a 
polarizing filter and part (a) was not. As a 
result, the reflection of clouds and sky 
observed in part (a) is not observed in part 
(b). Polarizing sunglasses are particularly 
useful on snow and water. (credit: Amithshs, 
Wikimedia Commons) 


Light is one type of electromagnetic (EM) wave. As noted earlier, EM 
waves are transverse waves consisting of varying electric and magnetic 
fields that oscillate perpendicular to the direction of propagation (see 
[link]). There are specific directions for the oscillations of the electric and 
magnetic fields. Polarization is the attribute that a wave’s oscillations have 


a definite direction relative to the direction of propagation of the wave. 
(This is not the same type of polarization as that discussed for the 
separation of charges.) Waves having such a direction are said to be 
polarized. For an EM wave, we define the direction of polarization to be 
the direction parallel to the electric field. Thus we can think of the electric 
field arrows as showing the direction of polarization, as in [link]. 


polarization 


Ga: 


An EM wave, such as 
light, is a transverse 
wave. The electric and 
magnetic fields are 
perpendicular to the 
direction of propagation. 


To examine this further, consider the transverse waves in the ropes shown in 
[link]. The oscillations in one rope are in a vertical plane and are said to be 
vertically polarized. Those in the other rope are in a horizontal plane and 
are horizontally polarized. If a vertical slit is placed on the first rope, the 
waves pass through. However, a vertical slit blocks the horizontally 
polarized waves. For EM waves, the direction of the electric field is 
analogous to the disturbances on the ropes. 


— 


Direction of polarization 5 Direction of polarization 


(a) (b) 


The transverse oscillations in one rope 
are in a vertical plane, and those in the 
other rope are in a horizontal plane. 
The first is said to be vertically 
polarized, and the other is said to be 
horizontally polarized. Vertical slits 
pass vertically polarized waves and 
block horizontally polarized waves. 


The Sun and many other light sources produce waves that are randomly 
polarized (see [link]). Such light is said to be unpolarized because it is 
composed of many waves with all possible directions of polarization. 
Polaroid materials, invented by the founder of Polaroid Corporation, Edwin 
Land, act as a polarizing slit for light, allowing only polarization in one 
direction to pass through. Polarizing filters are composed of long molecules 
aligned in one direction. Thinking of the molecules as many slits, analogous 
to those for the oscillating ropes, we can understand why only light with a 
specific polarization can get through. The axis of a polarizing filter is the 
direction along which the filter passes the electric field of an EM wave (see 
[link]). 


Random polarization 


E 
Direction of ray 
(of propagation) 


The slender arrow 
represents a ray of 
unpolarized light. 
The bold arrows 
represent the 
direction of 
polarization of the 
individual waves 
composing the ray. 
Since the light is 
unpolarized, the 
arrows point in all 
directions. 


Polarizing filter 


Polarization 
se direction 


E Direction 


S of ray 


A polarizing filter has a polarization 
axis that acts as a slit passing through 
electric fields parallel to its direction. 
The direction of polarization of an EM 


wave is defined to be the direction of 
its electric field. 


[link] shows the effect of two polarizing filters on originally unpolarized 
light. The first filter polarizes the light along its axis. When the axes of the 
first and second filters are aligned (parallel), then all of the polarized light 
passed by the first filter is also passed by the second. If the second 
polarizing filter is rotated, only the component of the light parallel to the 
second filter’s axis is passed. When the axes are perpendicular, no light is 
passed by the second. 


Only the component of the EM wave parallel to the axis of a filter is passed. 
Let us call the angle between the direction of polarization and the axis of a 
filter 0. If the electric field has an amplitude E, then the transmitted part of 
the wave has an amplitude EF cos 6 (see [link]). Since the intensity of a 
wave is proportional to its amplitude squared, the intensity I of the 
transmitted wave is related to the incident wave by 

Equation: 


I = Ip cos? 8, 


where Jo is the intensity of the polarized wave before passing through the 
filter. (The above equation is known as Malus’s law.) 


Polarizing filter E Polarizing filter 


Polarizing filter Axis Polarizing filter 


Axis 


(a) (b) 


E Polarizing filter 


AXIS Polarizing filter 


Axis 


(c) (d) 


The effect of rotating two polarizing filters, where the 
first polarizes the light. (a) All of the polarized light is 
passed by the second polarizing filter, because its axis is 
parallel to the first. (b) As the second is rotated, only part 
of the light is passed. (c) When the second is 
perpendicular to the first, no light is passed. (d) In this 
photograph, a polarizing filter is placed above two 
others. Its axis is perpendicular to the filter on the right 
(dark area) and parallel to the filter on the left (lighter 
area). (credit: P.P. Urone) 


@ ce Polarizing filter 


A polarizing filter 
transmits only the 
component of the 
wave parallel to its 


axis, F cos 0, 
reducing the intensity 
of any light not 
polarized parallel to 
its axis. 


Example: 

Calculating Intensity Reduction by a Polarizing Filter 

What angle is needed between the direction of polarized light and the axis 
of a polarizing filter to reduce its intensity by 90.0%? 

Strategy 

When the intensity is reduced by 90.0%, it is 10.0% or 0.100 times its 
original value. That is, J = 0.100Jo. Using this information, the equation 
I = Ip cos? 6 can be used to solve for the needed angle. 

Solution 

Solving the equation J = Ip cos? 6 for cos @ and substituting with the 
relationship between J and Jo gives 


Equation: 
cos 6 = jz ~ ee = 0.3162. 
Solving for 8 yields 
Equation: 
9 = cos ' 0.3162 = 71.6’. 
Discussion 


A fairly large angle between the direction of polarization and the filter axis 
is needed to reduce the intensity to 10.0% of its original value. This seems 
reasonable based on experimenting with polarizing films. It is interesting 
that, at an angle of 45°, the intensity is reduced to 50% of its original value 
(as you will show in this section’s Problems & Exercises). Note that 71.6° 


is 18.4° from reducing the intensity to zero, and that at an angle of 18.4° 
the intensity is reduced to 90.0% of its original value (as you will also 
show in Problems & Exercises), giving evidence of symmetry. 


Polarization by Reflection 


By now you can probably guess that Polaroid sunglasses cut the glare in 
reflected light because that light is polarized. You can check this for 
yourself by holding Polaroid sunglasses in front of you and rotating them 
while looking at light reflected from water or glass. As you rotate the 
sunglasses, you will notice the light gets bright and dim, but not completely 
black. This implies the reflected light is partially polarized and cannot be 
completely blocked by a polarizing filter. 


[link] illustrates what happens when unpolarized light is reflected from a 
surface. Vertically polarized light is preferentially refracted at the surface, 
so that the reflected light is left more horizontally polarized. The reasons for 
this phenomenon are beyond the scope of this text, but a convenient 
mnemonic for remembering this is to imagine the polarization direction to 
be like an arrow. Vertical polarization would be like an arrow perpendicular 
to the surface and would be more likely to stick and not be reflected. 
Horizontal polarization is like an arrow bouncing on its side and would be 
more likely to be reflected. Sunglasses with vertical axes would then block 
more reflected light than unpolarized light from other sources. 


Unpolarized light Partially polarized light 


et Perpendicular to 
plane of paper 


Polarization by reflection. 
Unpolarized light has equal 
amounts of vertical and 
horizontal polarization. After 
interaction with a surface, the 
vertical components are 
preferentially absorbed or 
refracted, leaving the reflected 
light more horizontally 
polarized. This is akin to arrows 
striking on their sides bouncing 
off, whereas arrows striking on 
their tips go into the surface. 


Since the part of the light that is not reflected is refracted, the amount of 
polarization depends on the indices of refraction of the media involved. It 
can be shown that reflected light is completely polarized at a angle of 
reflection 0}, given by 

Equation: 


na 
tan & = —, 
ny 


where 7; is the medium in which the incident and reflected light travel and 
ny is the index of refraction of the medium that forms the interface that 
reflects the light. This equation is known as Brewster’s law, and 4, is 
known as Brewster’s angle, named after the 19th-century Scottish physicist 
who discovered them. 


Note: 

Things Great and Small: Atomic Explanation of Polarizing Filters 
Polarizing filters have a polarization axis that acts as a slit. This slit passes 
electromagnetic waves (often visible light) that have an electric field 
parallel to the axis. This is accomplished with long molecules aligned 
perpendicular to the axis as shown in [link]. 


D609 2@o°P DQG? 2O2A0C 
_,| Ax! 
DGQ092DO 2°®@oPDOOPF POD APC Long 
; ; a molecule 
D6IGPFDOP@oPDIWOLRPO 20 
D60G9PBWOSP2@o°P DGVOP 2O APG 
OGO?BO 2®oPVOO?P? PO APC 
DEODPOO)P@aVOOQOk? FO) 20C 
DGO0POO2@oP DBO? FOC) APC 
WGOPPBO P@?°-DGO# PO APE 
DEOOH?PBWO2@o?P DOP FPO APE 
DGOGPOOP@oP DAO? 2QC) 2G 


Ci 
DQEOGPOO23oP DOO? FO 2-00 


Long molecules are aligned 
perpendicular to the axis of a 
polarizing filter. The component 
of the electric field in an EM 
wave perpendicular to these 
molecules passes through the 
filter, while the component 
parallel to the molecules is 
absorbed. 


[link] illustrates how the component of the electric field parallel to the long 
molecules is absorbed. An electromagnetic wave is composed of 
oscillating electric and magnetic fields. The electric field is strong 
compared with the magnetic field and is more effective in exerting force on 
charges in the molecules. The most affected charged particles are the 
electrons in the molecules, since electron masses are small. If the electron 
is forced to oscillate, it can absorb energy from the EM wave. This reduces 
the fields in the wave and, hence, reduces its intensity. In long molecules, 
electrons can more easily oscillate parallel to the molecule than in the 
perpendicular direction. The electrons are bound to the molecule and are 
more restricted in their movement perpendicular to the molecule. Thus, the 
electrons can absorb EM waves that have a component of their electric 
field parallel to the molecule. The electrons are much less responsive to 
electric fields perpendicular to the molecule and will allow those fields to 
pass. Thus the axis of the polarizing filter is perpendicular to the length of 
the molecule. 


Enid = ga Long molecule 


= 
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to length of molecule 2° 
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i 


Artist’s conception of an 
electron in a long molecule 
oscillating parallel to the 
molecule. The oscillation of the 
electron absorbs energy and 


reduces the intensity of the 
component of the EM wave that 
is parallel to the molecule. 


Example: 

Calculating Polarization by Reflection 

(a) At what angle will light traveling in air be completely polarized 
horizontally when reflected from water? (b) From glass? 

Strategy 

All we need to solve these problems are the indices of refraction. Air has 
nm, = 1.00, water has no = 1.333, and crown glass has n/p = 1.520. The 
equation tan 0) = aa can be directly applied to find , in each case. 


Solution for (a) 
Putting the known quantities into the equation 


Equation: 
tan 04 = aes 
ny 
gives 
Equation: 
1.333 
fonts = = SE Se 
Ny 1.00 


Solving for the angle 6, yields 
Equation: 


6, = tan + 1.333 = 53.1°. 


Solution for (b) 
Similarly, for crown glass and air, 
Equation: 


NI5 1.520 


tan 0h, = 7 = SL = Ds 
‘Thus, 
Equation: 
Op —tane 152 — 56.7 
Discussion 


Light reflected at these angles could be completely blocked by a good 
polarizing filter held with its axis vertical. Brewster’s angle for water and 
air are similar to those for glass and air, so that sunglasses are equally 
effective for light reflected from either water or glass under similar 
circumstances. Light not reflected is refracted into these media. So at an 
incident angle equal to Brewster’s angle, the refracted light will be slightly 
polarized vertically. It will not be completely polarized vertically, because 
only a small fraction of the incident light is reflected, and so a significant 
amount of horizontally polarized light is refracted. 


Polarization by Scattering 


If you hold your Polaroid sunglasses in front of you and rotate them while 
looking at blue sky, you will see the sky get bright and dim. This is a clear 
indication that light scattered by air is partially polarized. [link] helps 
illustrate how this happens. Since light is a transverse EM wave, it vibrates 
the electrons of air molecules perpendicular to the direction it is traveling. 
The electrons then radiate like small antennae. Since they are oscillating 
perpendicular to the direction of the light ray, they produce EM radiation 
that is polarized perpendicular to the direction of the ray. When viewing the 
light along a line perpendicular to the original ray, as in [link], there can be 
no polarization in the scattered light parallel to the original ray, because that 
would require the original ray to be a longitudinal wave. Along other 
directions, a component of the other polarization can be projected along the 
line of sight, and the scattered light will only be partially polarized. 
Furthermore, multiple scattering can bring light to your eyes from other 
directions and can contain different polarizations. 


Unpolarized Molecule 


sunlight Unpolarized 
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polarized 
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Polarization by scattering. 
Unpolarized light scattering from air 
molecules shakes their electrons 
perpendicular to the direction of the 
original ray. The scattered light 
therefore has a polarization 
perpendicular to the original direction 
and none parallel to the original 
direction. 


Photographs of the sky can be darkened by polarizing filters, a trick used by 
many photographers to make clouds brighter by contrast. Scattering from 
other particles, such as smoke or dust, can also polarize light. Detecting 
polarization in scattered EM waves can be a useful analytical tool in 
determining the scattering source. 


There is a range of optical effects used in sunglasses. Besides being 
Polaroid, other sunglasses have colored pigments embedded in them, while 
others use non-reflective or even reflective coatings. A recent development 
is photochromic lenses, which darken in the sunlight and become clear 
indoors. Photochromic lenses are embedded with organic microcrystalline 
molecules that change their properties when exposed to UV in sunlight, but 
become clear in artificial lighting with no UV. 


Note: 

Take-Home Experiment: Polarization 

Find Polaroid sunglasses and rotate one while holding the other still and 
look at different surfaces and objects. Explain your observations. What is 
the difference in angle from when you see a maximum intensity to when 
you see a minimum intensity? Find a reflective glass surface and do the 
same. At what angle does the glass need to be oriented to give minimum 
glare? 


Liquid Crystals and Other Polarization Effects in Materials 


While you are undoubtedly aware of liquid crystal displays (LCDs) found 
in watches, calculators, computer screens, cellphones, flat screen 
televisions, and other myriad places, you may not be aware that they are 
based on polarization. Liquid crystals are so named because their molecules 
can be aligned even though they are in a liquid. Liquid crystals have the 
property that they can rotate the polarization of light passing through them 
by 90°. Furthermore, this property can be turned off by the application of a 
voltage, as illustrated in [link]. It is possible to manipulate this 
characteristic quickly and in small well-defined regions to create the 
contrast patterns we see in so many LCD devices. 


In flat screen LCD televisions, there is a large light at the back of the TV. 
The light travels to the front screen through millions of tiny units called 
pixels (picture elements). One of these is shown in [link] (a) and (b). Each 
unit has three cells, with red, blue, or green filters, each controlled 
independently. When the voltage across a liquid crystal is switched off, the 
liquid crystal passes the light through the particular filter. One can vary the 
picture contrast by varying the strength of the voltage applied to the liquid 
crystal. 
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(a) Polarized light is 
rotated 90° by a liquid 
crystal and then passed by 
a polarizing filter that has 
its axis perpendicular to 
the original polarization 
direction. (b) When a 
voltage is applied to the 
liquid crystal, the polarized 
light is not rotated and is 
blocked by the filter, 
making the region dark in 
comparison with its 
surroundings. (c) LCDs 


can be made color specific, 
small, and fast enough to 
use in laptop computers 
and TVs. (credit: Jon 
Sullivan) 


Many crystals and solutions rotate the plane of polarization of light passing 
through them. Such substances are said to be optically active. Examples 
include sugar water, insulin, and collagen (see [link]). In addition to 
depending on the type of substance, the amount and direction of rotation 
depends on a number of factors. Among these is the concentration of the 
substance, the distance the light travels through it, and the wavelength of 
light. Optical activity is due to the asymmetric shape of molecules in the 
substance, such as being helical. Measurements of the rotation of polarized 
light passing through substances can thus be used to measure 
concentrations, a standard technique for sugars. It can also give information 
on the shapes of molecules, such as proteins, and factors that affect their 
shapes, such as temperature and pH. 


E__ Polarizing filter 


Optical activity is the ability of some 
substances to rotate the plane of 
polarization of light passing through 
them. The rotation is detected with a 
polarizing filter or analyzer. 


Glass and plastic become optically active when stressed; the greater the 
stress, the greater the effect. Optical stress analysis on complicated shapes 
can be performed by making plastic models of them and observing them 
through crossed filters, as seen in [link]. It is apparent that the effect 
depends on wavelength as well as stress. The wavelength dependence is 
sometimes also used for artistic purposes. 


Optical stress 
analysis of a plastic 
lens placed 
between crossed 
polarizers. (credit: 
Infopro, Wikimedia 
Commons) 


Another interesting phenomenon associated with polarized light is the 
ability of some crystals to split an unpolarized beam of light into two. Such 
crystals are said to be birefringent (see [link]). Each of the separated rays 
has a specific polarization. One behaves normally and is called the ordinary 
ray, whereas the other does not obey Snell’s law and is called the 


extraordinary ray. Birefringent crystals can be used to produce polarized 
beams from unpolarized light. Some birefringent materials preferentially 
absorb one of the polarizations. These materials are called dichroic and can 
produce polarization by this preferential absorption. This is fundamentally 
how polarizing filters and other polarizers work. The interested reader is 
invited to further pursue the numerous properties of materials related to 
polarization. 


Birefringent materials, such as 
the common mineral calcite, 
split unpolarized beams of light 
into two. The ordinary ray 
behaves as expected, but the 
extraordinary ray does not obey 
Snell’s law. 


Section Summary 


e Polarization is the attribute that wave oscillations have a definite 
direction relative to the direction of propagation of the wave. 

e EM waves are transverse waves that may be polarized. 

e The direction of polarization is defined to be the direction parallel to 
the electric field of the EM wave. 

e Unpolarized light is composed of many rays having random 
polarization directions. 


e Light can be polarized by passing it through a polarizing filter or other 
polarizing material. The intensity J of polarized light after passing 
through a polarizing filter is J = Ip cos? 8, where Jp is the original 
intensity and @ is the angle between the direction of polarization and 
the axis of the filter. 

e Polarization is also produced by reflection. 

e Brewster’s law states that reflected light will be completely polarized 
at the angle of reflection 6, known as Brewster’s angle, given by a 
statement known as Brewster’s law: tan 0, = re where 7 is the 


medium in which the incident and reflected light travel and nz is the 
index of refraction of the medium that forms the interface that reflects 
the light. 

e Polarization can also be produced by scattering. 

e There are a number of types of optically active substances that rotate 
the direction of polarization of light passing through them. 


Conceptual Questions 


Exercise: 
Problem: 
Under what circumstances is the phase of light changed by reflection? 
Is the phase related to polarization? 


Exercise: 


Problem: Can a sound wave in air be polarized? Explain. 
Exercise: 

Problem: 

No light passes through two perfect polarizing filters with 

perpendicular axes. However, if a third polarizing filter is placed 


between the original two, some light can pass. Why is this? Under 
what circumstances does most of the light pass? 


Exercise: 


Problem: 
Explain what happens to the energy carried by light that it is dimmed 
by passing it through two crossed polarizing filters. 

Exercise: 
Problem: 
When particles scattering light are much smaller than its wavelength, 
the amount of scattering is proportional to 1/A*. Does this mean there 


is more scattering for small A than large A? How does this relate to the 
fact that the sky is blue? 


Exercise: 
Problem: 
Using the information given in the preceding question, explain why 
sunsets are red. 

Exercise: 
Problem: 
When light is reflected at Brewster’s angle from a smooth surface, it is 
100% polarized parallel to the surface. Part of the light will be 
refracted into the surface. Describe how you would do an experiment 
to determine the polarization of the refracted light. What direction 


would you expect the polarization to have and would you expect it to 
be 100%? 


Problems & Exercises 


Exercise: 


Problem: 


What angle is needed between the direction of polarized light and the 
axis of a polarizing filter to cut its intensity in half? 


Solution: 


45.0° 
Exercise: 
Problem: 
The angle between the axes of two polarizing filters is 45.0°. By how 


much does the second filter reduce the intensity of the light coming 
through the first? 


Exercise: 
Problem: 
If you have completely polarized light of intensity 150 W/m/2, what 


will its intensity be after passing through a polarizing filter with its 
axis at an 89.0° angle to the light’s polarization direction? 


Solution: 


45.7 mW /m? 
Exercise: 
Problem: 
What angle would the axis of a polarizing filter need to make with the 


direction of polarized light of intensity 1.00 kW/ m? to reduce the 
intensity to 10.0 W/m”? 


Exercise: 
Problem: 
At the end of [link], it was stated that the intensity of polarized light is 
reduced to 90.0% of its original value by passing through a polarizing 


filter with its axis at an angle of 18.4° to the direction of polarization. 
Verify this statement. 


Solution: 


90.0% 

Exercise: 
Problem: 
Show that if you have three polarizing filters, with the second at an 
angle of 45° to the first and the third at an angle of 90.0° to the first, 
the intensity of light passed by the first will be reduced to 25.0% of its 
value. (This is in contrast to having only the first and third, which 


reduces the intensity to zero, so that placing the second between them 
increases the intensity of the transmitted light.) 


Exercise: 
Problem: 
Prove that, if I is the intensity of light transmitted by two polarizing 
filters with axes at an angle 6 and J7 is the intensity when the axes are 
at an angle 90.0° — 0, then J + I/= Ip, the original intensity. (Hint: 


Use the trigonometric identities cos (90.0° — 0) = sin 6 and 
cos? § + sin? 9 = 1.) 


Solution: 


Ip 
Exercise: 
Problem: 
At what angle will light reflected from diamond be completely 
polarized? 
Exercise: 
Problem: 


What is Brewster’s angle for light traveling in water that is reflected 
from crown glass? 


Solution: 


48.8° 
Exercise: 
Problem: 
A scuba diver sees light reflected from the water’s surface. At what 
angle will this light be completely polarized? 
Exercise: 
Problem: 


At what angle is light inside crown glass completely polarized when 
reflected from water, as in a fish tank? 


Solution: 


41.2° 
Exercise: 
Problem: 
Light reflected at 55.6° from a window is completely polarized. What 


is the window’s index of refraction and the likely substance of which it 
is made? 


Exercise: 


Problem: 


(a) Light reflected at 62.5° from a gemstone in a ring is completely 
polarized. Can the gem be a diamond? (b) At what angle would the 
light be completely polarized if the gem was in water? 


Solution: 
(a) 1.92, not diamond (Zircon) 


(b) 55.2° 


Exercise: 


Problem: 


If #, is Brewster’s angle for light reflected from the top of an interface 
between two substances, and @/, is Brewster’s angle for light reflected 
from below, prove that # + 04, = 90.0°. 


Exercise: 


Problem: Integrated Concepts 


If a polarizing filter reduces the intensity of polarized light to 50.0% of 
its original value, by how much are the electric and magnetic fields 
reduced? 


Solution: 


By = 0.707 By, 


Exercise: 


Problem: Integrated Concepts 


Suppose you put on two pairs of Polaroid sunglasses with their axes at 
an angle of 15.0°. How much longer will it take the light to deposit a 
given amount of energy in your eye compared with a single pair of 
sunglasses? Assume the lenses are clear except for their polarizing 
characteristics. 


Exercise: 


Problem: Integrated Concepts 


(a) On a day when the intensity of sunlight is 1.00 kW/ m2, a circular 
lens 0.200 m in diameter focuses light onto water in a black beaker. 
Two polarizing sheets of plastic are placed in front of the lens with 
their axes at an angle of 20.0°. Assuming the sunlight is unpolarized 
and the polarizers are 100% efficient, what is the initial rate of heating 
of the water in °C/s, assuming it is 80.0% absorbed? The aluminum 


beaker has a mass of 30.0 grams and contains 250 grams of water. (b) 
Do the polarizing filters get hot? Explain. 


Solution: 
(a) 2.07 x 10-2 °C/s 


(b) Yes, the polarizing filters get hot because they absorb some of the 
lost energy from the sunlight. 


Glossary 


axis of a polarizing filter 
the direction along which the filter passes the electric field of an EM 
wave 


birefringent 
crystals that split an unpolarized beam of light into two beams 


Brewster’s angle 


6, = tan! a where 7 is the index of refraction of the medium 


from which the light is reflected and 7, is the index of refraction of the 
medium in which the reflected light travels 


Brewster’s law 
tan 6, = a where 7, is the medium in which the incident and 


reflected light travel and nz is the index of refraction of the medium 
that forms the interface that reflects the light 


direction of polarization 
the direction parallel to the electric field for EM waves 


horizontally polarized 
the oscillations are in a horizontal plane 


optically active 


substances that rotate the plane of polarization of light passing through 
them 


polarization 
the attribute that wave oscillations have a definite direction relative to 
the direction of propagation of the wave 


polarized 
waves having the electric and magnetic field oscillations in a definite 
direction 


reflected light that is completely polarized 
light reflected at the angle of reflection 64, known as Brewster’s angle 


unpolarized 
waves that are randomly polarized 


vertically polarized 
the oscillations are in a vertical plane 


Introduction to Special Relativity 
class="introduction" 


Special 
relativity 
explains 

why 
traveling to 
other star 
systems, 
such as these 
in the Orion 
Nebula, is 
unreasonabl 
e using our 
current level 
of 
technology. 
(credit: s58y, 
Flickr) 


Have you ever looked up at the night sky and dreamed of traveling to other 
planets in faraway star systems? Would there be other life forms? What 
would other worlds look like? You might imagine that such an amazing trip 


would be possible if we could just travel fast enough, but you will read in 
this chapter why this is not true. In 1905 Albert Einstein developed the 
theory of special relativity. This theory explains the limit on an object’s 
speed and describes the consequences. 


Relativity. The word relativity might conjure an image of Einstein, but the 
idea did not begin with him. People have been exploring relativity for many 
centuries. Relativity is the study of how different observers measure the 
same event. Galileo and Newton developed the first correct version of 
classical relativity. Einstein developed the modern theory of relativity. 
Modern relativity is divided into two parts. Special relativity deals with 
observers who are moving at constant velocity. General relativity deals with 
observers who are undergoing acceleration. Einstein is famous because his 
theories of relativity made revolutionary predictions. Most importantly, his 
theories have been verified to great precision in a vast range of experiments, 
altering forever our concept of space and time. 


Many people think 
that Albert Einstein 
(1879-1955) was 
the greatest 
physicist of the 
20th century. Not 
only did he develop 
modern relativity, 
thus 


revolutionizing our 
concept of the 
universe, he also 
made fundamental 
contributions to the 
foundations of 
quantum 
mechanics. (credit: 
The Library of 
Congress) 


It is important to note that although classical mechanics, in general, and 
classical relativity, in particular, are limited, they are extremely good 
approximations for large, slow-moving objects. Otherwise, we could not 
use classical physics to launch satellites or build bridges. In the classical 
limit (objects larger than submicroscopic and moving slower than about 1% 
of the speed of light), relativistic mechanics becomes the same as classical 
mechanics. This fact will be noted at appropriate places throughout this 
chapter. 


Einstein’s Postulates 


e State and explain both of Einstein’s postulates. 
e Explain what an inertial frame of reference is. 
e Describe one way the speed of light can be changed. 


Special relativity 
resembles trigonometry in 
that both are reliable 
because they are based on 
postulates that flow one 
from another in a logical 
way. (credit: Jon Oakley, 
Flickr) 


Have you ever used the Pythagorean Theorem and gotten a wrong answer? 
Probably not, unless you made a mistake in either your algebra or your 
arithmetic. Each time you perform the same calculation, you know that the 
answer will be the same. Trigonometry is reliable because of the certainty 
that one part always flows from another in a logical way. Each part is based 
on a set of postulates, and you can always connect the parts by applying 
those postulates. Physics is the same way with the exception that all parts 
must describe nature. If we are careful to choose the correct postulates, then 
our theory will follow and will be verified by experiment. 


Einstein essentially did the theoretical aspect of this method for relativity. 
With two deceptively simple postulates and a careful consideration of how 
measurements are made, he produced the theory of special relativity. 


Einstein’s First Postulate 


The first postulate upon which Einstein based the theory of special relativity 
relates to reference frames. All velocities are measured relative to some 
frame of reference. For example, a car’s motion is measured relative to its 
Starting point or the road it is moving over, a projectile’s motion is 
measured relative to the surface it was launched from, and a planet’s orbit is 
measured relative to the star it is orbiting around. The simplest frames of 
reference are those that are not accelerated and are not rotating. Newton’s 
first law, the law of inertia, holds exactly in such a frame. 


Note: 

Inertial Reference Frame 

An inertial frame of reference is a reference frame in which a body at rest 
remains at rest and a body in motion moves at a constant speed in a straight 
line unless acted on by an outside force. 


The laws of physics seem to be simplest in inertial frames. For example, 
when you are in a plane flying at a constant altitude and speed, physics 
seems to work exactly the same as if you were standing on the surface of 
the Earth. However, in a plane that is taking off, matters are somewhat more 
complicated. In these cases, the net force on an object, F’, is not equal to the 
product of mass and acceleration, ma. Instead, F' is equal to ma plus a 
fictitious force. This situation is not as simple as in an inertial frame. Not 
only are laws of physics simplest in inertial frames, but they should be the 
same in all inertial frames, since there is no preferred frame and no absolute 
motion. Einstein incorporated these ideas into his first postulate of special 
relativity. 


Note: 

First Postulate of Special Relativity 

The laws of physics are the same and can be stated in their simplest form 
in all inertial frames of reference. 


As with many fundamental statements, there is more to this postulate than 
meets the eye. The laws of physics include only those that satisfy this 
postulate. We shall find that the definitions of relativistic momentum and 
energy must be altered to fit. Another outcome of this postulate is the 
famous equation EF = mc?. 


Einstein’s Second Postulate 


The second postulate upon which Einstein based his theory of special 
relativity deals with the speed of light. Late in the 19th century, the major 
tenets of classical physics were well established. Two of the most important 
were the laws of electricity and magnetism and Newton’s laws. In 
particular, the laws of electricity and magnetism predict that light travels at 
c = 3.00 x 10°m /s in a vacuum, but they do not specify the frame of 
reference in which light has this speed. 


There was a contradiction between this prediction and Newton’s laws, in 
which velocities add like simple vectors. If the latter were true, then two 
observers moving at different speeds would see light traveling at different 
speeds. Imagine what a light wave would look like to a person traveling 
along with it at a speed c. If such a motion were possible then the wave 
would be stationary relative to the observer. It would have electric and 
magnetic fields that varied in strength at various distances from the 
observer but were constant in time. This is not allowed by Maxwell’s 
equations. So either Maxwell’s equations are wrong, or an object with mass 
cannot travel at speed c. Einstein concluded that the latter is true. An object 
with mass cannot travel at speed c. This conclusion implies that light in a 
vacuum must always travel at speed c relative to any observer. Maxwell’s 
equations are correct, and Newton’s addition of velocities is not correct for 
light. 


Investigations such as Young’s double slit experiment in the early-1800s 
had convincingly demonstrated that light is a wave. Many types of waves 
were known, and all travelled in some medium. Scientists therefore 
assumed that a medium carried light, even in a vacuum, and light travelled 
at a speed c relative to that medium. Starting in the mid-1880s, the 
American physicist A. A. Michelson, later aided by E. W. Morley, made a 
series of direct measurements of the speed of light. The results of their 
measurements were Startling. 


Note: 

Michelson-Morley Experiment 

The Michelson-Morley experiment demonstrated that the speed of light 
in a vacuum is independent of the motion of the Earth about the Sun. 


The eventual conclusion derived from this result is that light, unlike 
mechanical waves such as sound, does not need a medium to carry it. 
Furthermore, the Michelson-Morley results implied that the speed of light c 
is independent of the motion of the source relative to the observer. That is, 
everyone observes light to move at speed c regardless of how they move 
relative to the source or one another. For a number of years, many scientists 
tried unsuccessfully to explain these results and still retain the general 
applicability of Newton’s laws. 


It was not until 1905, when Einstein published his first paper on special 
relativity, that the currently accepted conclusion was reached. Based mostly 
on his analysis that the laws of electricity and magnetism would not allow 
another speed for light, and only slightly aware of the Michelson-Morley 
experiment, Einstein detailed his second postulate of special relativity. 


Note: 
Second Postulate of Special Relativity 


The speed of light c is a constant, independent of the relative motion of the 
source. 


Deceptively simple and counterintuitive, this and the first postulate leave all 
else open for change. Some fundamental concepts do change. Among the 
changes are the loss of agreement on the elapsed time for an event, the 
variation of distance with speed, and the realization that matter and energy 
can be converted into one another. You will read about these concepts in the 
following sections. 


Note: 

Misconception Alert: Constancy of the Speed of Light 

The speed of light is a constant c = 3.00 x 10° m/s ina vacuum. If you 
remember the effect of the index of refraction from The Law of Refraction, 
the speed of light is lower in matter. 


Exercise: 
Check Your Understanding 


Problem: Explain how special relativity differs from general relativity. 


Solution: 
Answer 


Special relativity applies only to unaccelerated motion, but general 
relativity applies to accelerated motion. 
Section Summary 


¢ Relativity is the study of how different observers measure the same 
event. 


e Modern relativity is divided into two parts. Special relativity deals 
with observers who are in uniform (unaccelerated) motion, whereas 
general relativity includes accelerated relative motion and gravity. 
Modern relativity is correct in all circumstances and, in the limit of 
low velocity and weak gravitation, gives the same predictions as 
classical relativity. 

e An inertial frame of reference is a reference frame in which a body at 
rest remains at rest and a body in motion moves at a constant speed in 
a straight line unless acted on by an outside force. 

¢ Modern relativity is based on Ejinstein’s two postulates. The first 
postulate of special relativity is the idea that the laws of physics are the 
same and can be stated in their simplest form in all inertial frames of 
reference. The second postulate of special relativity is the idea that the 
speed of light c is a constant, independent of the relative motion of the 
source. 

e The Michelson-Morley experiment demonstrated that the speed of 
light in a vacuum is independent of the motion of the Earth about the 
Sun. 


Conceptual Questions 


Exercise: 
Problem: 
Which of Einstein’s postulates of special relativity includes a concept 
that does not fit with the ideas of classical physics? Explain. 
Exercise: 
Problem: 
Is Earth an inertial frame of reference? Is the Sun? Justify your 
response. 


Exercise: 


Problem: 


When you are flying in a commercial jet, it may appear to you that the 
airplane is stationary and the Earth is moving beneath you. Is this point 
of view valid? Discuss briefly. 


Glossary 


relativity 
the study of how different observers measure the same event 


special relativity 
the theory that, in an inertial frame of reference, the motion of an 
object is relative to the frame from which it is viewed or measured 


inertial frame of reference 
a reference frame in which a body at rest remains at rest and a body in 
motion moves at a constant speed in a straight line unless acted on by 
an outside force 


first postulate of special relativity 
the idea that the laws of physics are the same and can be stated in their 
simplest form in all inertial frames of reference 


second postulate of special relativity 
the idea that the speed of light c is a constant, independent of the 
source 


Michelson-Morley experiment 
an investigation performed in 1887 that proved that the speed of light 
in a vacuum is the same in all frames of reference from which it is 
viewed 


Simultaneity And Time Dilation 


e Describe simultaneity. 

e Describe time dilation. 

e Calculate y. 

e Compare proper time and the observer’s measured time. 
e Explain why the twin paradox is a false paradox. 


Elapsed time for a foot 
race is the same for all 
observers, but at 
relativistic speeds, 
elapsed time depends on 
the relative motion of the 
observer and the event 
that is observed. (credit: 
Jason Edward Scott Bain, 
Flickr) 


Do time intervals depend on who observes them? Intuitively, we expect the 
time for a process, such as the elapsed time for a foot race, to be the same 
for all observers. Our experience has been that disagreements over elapsed 
time have to do with the accuracy of measuring time. When we carefully 
consider just how time is measured, however, we will find that elapsed time 
depends on the relative motion of an observer with respect to the process 
being measured. 


Simultaneity 


Consider how we measure elapsed time. If we use a stopwatch, for 
example, how do we know when to start and stop the watch? One method is 
to use the arrival of light from the event, such as observing a light turning 
green to start a drag race. The timing will be more accurate if some sort of 
electronic detection is used, avoiding human reaction times and other 
complications. 


Now suppose we use this method to measure the time interval between two 
flashes of light produced by flash lamps. (See [link].) Two flash lamps with 
observer A midway between them are on a rail car that moves to the right 
relative to observer B. Observer B arranges for the light flashes to be 
emitted just as A passes B, so that both A and B are equidistant from the 
lamps when the light is emitted. Observer B measures the time interval 
between the arrival of the light flashes. According to postulate 2, the speed 
of light is not affected by the motion of the lamps relative to B. Therefore, 
light travels equal distances to him at equal speeds. Thus observer B 
measures the flashes to be simultaneous. 


Observer B measures the elapsed time 
between the arrival of light flashes as 
described in the text. Observer A 
moves with the lamps on a rail car. 
Observer B perceives that the light 
flashes occurred simultaneously. 
Observer A perceives that the light on 
the right flashes before the light on the 
left. 


Now consider what observer B sees happen to observer A. Observer B 
perceives light from the right reaching observer A before light from the left, 
because she has moved towards that flash lamp, lessening the distance the 
light must travel and reducing the time it takes to get to her. Light travels at 
speed c relative to both observers, but observer B remains equidistant 
between the points where the flashes were emitted, while A gets closer to 
the emission point on the right. From observer B’s point of view, then, there 
is a time interval between the arrival of the flashes to observer A. In 
observer A's frame of reference, the flashes occur at different times. 
Observer B measures the flashes to arrive simultaneously relative to him 
but not relative to A. 


Now consider what observer A sees happening. She sees the light from the 
right arriving before light from the left. Since both lamps are the same 
distance from her in her reference frame, from her perspective, the right 
flash occurred before the left flash. Here a relative velocity between 
observers affects whether two events are observed to be simultaneous. 
Simultaneity is not absolute 


This illustrates the power of clear thinking. We might have guessed 
incorrectly that if light is emitted simultaneously, then two observers 
halfway between the sources would see the flashes simultaneously. But 
careful analysis shows this not to be the case. Einstein was brilliant at this 
type of thought experiment (in German, “Gedankenexperiment”). He very 
carefully considered how an observation is made and disregarded what 


might seem obvious. The validity of thought experiments, of course, is 
determined by actual observation. The genius of Einstein is evidenced by 
the fact that experiments have repeatedly confirmed his theory of relativity. 


In summary: Two events are defined to be simultaneous if an observer 
measures them as occurring at the same time (such as by receiving light 
from the events). Two events are not necessarily simultaneous to all 
observers. 


Time Dilation 


The consideration of the measurement of elapsed time and simultaneity 
leads to an important relativistic effect. 


Note: 

Time dilation 

Time dilation is the phenomenon of time passing slower for an observer 
who is moving relative to another observer. 


Suppose, for example, an astronaut measures the time it takes for light to 
cross her ship, bounce off a mirror, and return. (See [link].) How does the 
elapsed time the astronaut measures compare with the elapsed time 
measured for the same event by a person on the Earth? Asking this question 
(another thought experiment) produces a profound result. We find that the 
elapsed time for a process depends on who is measuring it. In this case, the 
time measured by the astronaut is smaller than the time measured by the 
Earth-bound observer. The passage of time is different for the observers 
because the distance the light travels in the astronaut’s frame is smaller than 
in the Earth-bound frame. Light travels at the same speed in each frame, 
and so it will take longer to travel the greater distance in the Earth-bound 
frame. 
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(c) 


(a) An astronaut measures the time Atg for light to 
cross her ship using an electronic timer. Light 
travels a distance 2D in the astronaut’s frame. (b) 
A person on the Earth sees the light follow the 
longer path 2s and take a longer time At. (c) 
These triangles are used to find the relationship 
between the two distances 2D and 2s. 


To quantitatively verify that time depends on the observer, consider the 
paths followed by light as seen by each observer. (See [link](c).) The 
astronaut sees the light travel straight across and back for a total distance of 
2D, twice the width of her ship. The Earth-bound observer sees the light 
travel a total distance 2s. Since the ship is moving at speed v to the right 
relative to the Earth, light moving to the right hits the mirror in this frame. 
Light travels at a speed c in both frames, and because time is the distance 
divided by speed, the time measured by the astronaut is 

Equation: 


2D 
Atty = —. 
Cc 


This time has a separate name to distinguish it from the time measured by 
the Earth-bound observer. 


Note: 

Proper Time 

Proper time Ato is the time measured by an observer at rest relative to the 
event being observed. 


In the case of the astronaut observe the reflecting light, the astronaut 
measures proper time. The time measured by the Earth-bound observer is 
Equation: 


9 
Ma =. 
Cc 


To find the relationship between Atg and At, consider the triangles formed 
by D and s. (See [link](c).) The third side of these similar triangles is L, the 
distance the astronaut moves as the light goes across her ship. In the frame 
of the Earth-bound observer, 

Equation: 


Using the Pythagorean Theorem, the distance s is found to be 
Equation: 


At\2 
$= p+ (2) : 


Substituting s into the expression for the time interval At gives 
Equation: 


We square this equation, which yields 
Equation: 


4(p24 =") 2 9 
AD 
(At)? = ~—__—_* = = 4 * (au)? 


C2 Cc 


Note that if we square the first expression we had for Ato, we get 

2 
(Ato)? = se. This term appears in the preceding equation, giving us a 
means to relate the two time intervals. Thus, 


Equation: 
2 

(At)? = (Ato)? + = (At), 
Gathering terms, we solve for At: 
Equation: 

2 ao 2 

(Az)*| 1— oo i (Ato) 

Thus, 


Equation: 


Ato)? 
(az)? = ‘Ato 0) ; 
to 


Taking the square root yields an important relationship between elapsed 
times: 


Equation: 
At 
At = : = yAt, 
2 
yi-5 
where 
Equation: 


This equation for At is truly remarkable. First, as contended, elapsed time 
is not the same for different observers moving relative to one another, even 
though both are in inertial frames. Proper time Atg measured by an 
observer, like the astronaut moving with the apparatus, is smaller than time 
measured by other observers. Since those other observers measure a longer 
time At, the effect is called time dilation. The Earth-bound observer sees 
time dilate (get longer) for a system moving relative to the Earth. 
Alternatively, according to the Earth-bound observer, time slows in the 
moving frame, since less time passes there. All clocks moving relative to an 
observer, including biological clocks such as aging, are observed to run 
slow compared with a clock stationary relative to the observer. 


Note that if the relative velocity is much less than the speed of light (v<<c 
), then a is extremely small, and the elapsed times At and Ato are nearly 


equal. At low velocities, modern relativity approaches classical physics— 
our everyday experiences have very small relativistic effects. 


The equation At = yAt, also implies that relative velocity cannot exceed 
the speed of light. As v approaches c, At approaches infinity. This would 
imply that time in the astronaut’s frame stops at the speed of light. If v 
exceeded c, then we would be taking the square root of a negative number, 
producing an imaginary value for At. 


There is considerable experimental evidence that the equation At = yAt, 
is correct. One example is found in cosmic ray particles that continuously 
rain down on the Earth from deep space. Some collisions of these particles 
with nuclei in the upper atmosphere result in short-lived particles called 
muons. The half-life (amount of time for half of a material to decay) of a 
muon is 1.52 ys when it is at rest relative to the observer who measures the 
half-life. This is the proper time At y. Muons produced by cosmic ray 
particles have a range of velocities, with some moving near the speed of 
light. It has been found that the muon’s half-life as measured by an Earth- 
bound observer (At) varies with velocity exactly as predicted by the 
equation At = yAt,. The faster the muon moves, the longer it lives. We on 
the Earth see the muon’s half-life time dilated—as viewed from our frame, 
the muon decays more slowly than it does when at rest relative to us. 


Example: 

Calculating Az for a Relativistic Event: How Long Does a Speedy 
Muon Live? 

Suppose a cosmic ray colliding with a nucleus in the Earth’s upper 
atmosphere produces a muon that has a velocity v = 0.950c. The muon 
then travels at constant velocity and lives 1.52 pus as measured in the 
muon’s frame of reference. (You can imagine this as the muon’s internal 
clock.) How long does the muon live as measured by an Earth-bound 
observer? (See [link].) 


A muon in the 
Earth’s atmosphere 
lives longer as 
measured by an 
Earth-bound 
observer than 
measured by the 
muon’s internal 
clock. 


Strategy 

A clock moving with the system being measured observes the proper time, 
so the time we are given is Atp = 1.52 ws. The Earth-bound observer 
measures At as given by the equation At = yAt,). Since we know the 
velocity, the calculation is straightforward. 

Solution 

1) Identify the knowns. v = 0.950c, Atp = 1.52 ps 

2) Identify the unknown. At 

3) Choose the appropriate equation. 

Use, 

Equation: 


At = yAt, 


where 
Equation: 


———<$$=— 


yi-8 


4) Plug the knowns into the equation. 


ih 


First find y. 
Equation: 
1 
10 = aa 
= i 
1 (0 a 
= 1 
a 1—(0.950)? 
ee: 
Use the calculated value of y to determine At. 
Equation: 
Nia NG, 
= (3.20)(1.52 us) 
= 4.87 us 
Discussion 


One implication of this example is that since 7 = 3.20 at 95.0% of the 
speed of light (v = 0.950c), the relativistic effects are significant. The two 
time intervals differ by this factor of 3.20, where classically they would be 
the same. Something moving at 0.950c is said to be highly relativistic. 


Another implication of the preceding example is that everything an 
astronaut does when moving at 95.0% of the speed of light relative to the 
Earth takes 3.20 times longer when observed from the Earth. Does the 


astronaut sense this? Only if she looks outside her spaceship. All methods 
of measuring time in her frame will be affected by the same factor of 3.20. 
This includes her wristwatch, heart rate, cell metabolism rate, nerve impulse 
rate, and so on. She will have no way of telling, since all of her clocks will 
agree with one another because their relative velocities are zero. Motion is 
relative, not absolute. But what if she does look out the window? 


Note: 

Real-World Connections 

It may seem that special relativity has little effect on your life, but it is 
probably more important than you realize. One of the most common effects 
is through the Global Positioning System (GPS). Emergency vehicles, 
package delivery services, electronic maps, and communications devices 
are just a few of the common uses of GPS, and the GPS system could not 
work without taking into account relativistic effects. GPS satellites rely on 
precise time measurements to communicate. The signals travel at 
relativistic speeds. Without corrections for time dilation, the satellites 
could not communicate, and the GPS system would fail within minutes. 


The Twin Paradox 


An intriguing consequence of time dilation is that a space traveler moving 
at a high velocity relative to the Earth would age less than her Earth-bound 
twin. Imagine the astronaut moving at such a velocity that ~ = 30.0, as in 
[link]. A trip that takes 2.00 years in her frame would take 60.0 years in her 
Earth-bound twin’s frame. Suppose the astronaut traveled 1.00 year to 
another star system. She briefly explored the area, and then traveled 1.00 
year back. If the astronaut was 40 years old when she left, she would be 42 
upon her return. Everything on the Earth, however, would have aged 60.0 
years. Her twin, if still alive, would be 100 years old. 


The situation would seem different to the astronaut. Because motion is 
relative, the spaceship would seem to be stationary and the Earth would 
appear to move. (This is the sensation you have when flying in a jet.) If the 


astronaut looks out the window of the spaceship, she will see time slow 
down on the Earth by a factor of y = 30.0. To her, the Earth-bound sister 
will have aged only 2/30 (1/15) of a year, while she aged 2.00 years. The 
two sisters cannot both be correct. 


The twin paradox asks 
why the traveling twin 
ages less than the Earth- 
bound twin. That is the 
prediction we obtain if we 
consider the Earth-bound 
twin’s frame. In the 
astronaut’s frame, 
however, the Earth is 
moving and time runs 
slower there. Who is 
correct? 


As with all paradoxes, the premise is faulty and leads to contradictory 
conclusions. In fact, the astronaut’s motion is significantly different from 
that of the Earth-bound twin. The astronaut accelerates to a high velocity 


and then decelerates to view the star system. To return to the Earth, she 
again accelerates and decelerates. The Earth-bound twin does not 
experience these accelerations. So the situation is not symmetric, and it is 
not correct to claim that the astronaut will observe the same effects as her 
Earth-bound twin. If you use special relativity to examine the twin paradox, 
you must keep in mind that the theory is expressly based on inertial frames, 
which by definition are not accelerated or rotating. Einstein developed 
general relativity to deal with accelerated frames and with gravity, a prime 
source of acceleration. You can also use general relativity to address the 
twin paradox and, according to general relativity, the astronaut will age less. 
Some important conceptual aspects of general relativity are discussed in 
General Relativity and Quantum Gravity of this course. 


In 1971, American physicists Joseph Hafele and Richard Keating verified 
time dilation at low relative velocities by flying extremely accurate atomic 
clocks around the Earth on commercial aircraft. They measured elapsed 
time to an accuracy of a few nanoseconds and compared it with the time 
measured by clocks left behind. Hafele and Keating’s results were within 
experimental uncertainties of the predictions of relativity. Both special and 
general relativity had to be taken into account, since gravity and 
accelerations were involved as well as relative motion. 

Exercise: 

Check Your Understanding 


Problem:1. What is y if v = 0.650c? 


Solution 
1 1 
q i 1 (0.650c)2 


C2 


2. A particle travels at 1.90 x 10° m/s and lives 2.10 x 10~° s when 
at rest relative to an observer. How long does the particle live as 
viewed in the laboratory? 


Solution: 


-8 
At = = 210s _ = 2.71 x10 8s 
ie (190x108 m/s)? 
& (3.00x108 m/s)2 


Section Summary 


¢ Two events are defined to be simultaneous if an observer measures 
them as occurring at the same time. They are not necessarily 
simultaneous to all observers—simultaneity is not absolute. 

e Time dilation is the phenomenon of time passing slower for an 
observer who is moving relative to another observer. 

e Observers moving at a relative velocity v do not measure the same 
elapsed time for an event. Proper time Ato is the time measured by an 
observer at rest relative to the event being observed. Proper time is 
related to the time At measured by an Earth-bound observer by the 


equation 
Equation: 
At 
At = : = yAto, 
ye 
l-@ 
where 
Equation: 
1 


e The equation relating proper time and time measured by an Earth- 
bound observer implies that relative velocity cannot exceed the speed 
of light. 

e The twin paradox asks why a twin traveling at a relativistic speed 
away and then back towards the Earth ages less than the Earth-bound 
twin. The premise to the paradox is faulty because the traveling twin is 
accelerating. Special relativity does not apply to accelerating frames of 
reference. 


e Time dilation is usually negligible at low relative velocities, but it does 
occur, and it has been verified by experiment. 


Conceptual Questions 


Exercise: 


Problem: 


Does motion affect the rate of a clock as measured by an observer 
moving with it? Does motion affect how an observer moving relative 
to a clock measures its rate? 


Exercise: 


Problem: 


To whom does the elapsed time for a process seem to be longer, an 
observer moving relative to the process or an observer moving with the 
process? Which observer measures proper time? 


Exercise: 


Problem: 
How could you travel far into the future without aging significantly? 
Could this method also allow you to travel into the past? 

Problems & Exercises 


Exercise: 
Problem: (a) What is y if v = 0.250c? (b) If v = 0.500c? 
Solution: 


(a) 1.0328 


(b) 1.15 


Exercise: 


Problem: (a) What is y if v = 0.100c? (b) If v = 0.900c? 
Exercise: 

Problem: 

Particles called 7-mesons are produced by accelerator beams. If these 

particles travel at 2.70 x 10° m/s and live 2.60 x 10~° s when at rest 


relative to an observer, how long do they live as viewed in the 
laboratory? 


Solution: 


5.96 x 108s 
Exercise: 


Problem: 


Suppose a particle called a kaon is created by cosmic radiation striking 


the atmosphere. It moves by you at 0.980c, and it lives 1.24 x 10°° s 
when at rest relative to an observer. How long does it live as you 
observe it? 


Exercise: 
Problem: 
A neutral 7-meson is a particle that can be created by accelerator 
beams. If one such particle lives 1.40 x 10~*° s as measured in the 


laboratory, and 0.840 x 10~*° s when at rest relative to an observer, 
what is its velocity relative to the laboratory? 


Solution: 


0.800c 


Exercise: 


Problem: 


A neutron lives 900 s when at rest relative to an observer. How fast is 
the neutron moving relative to an observer who measures its life span 
to be 2065 s? 


Exercise: 


Problem: 


If relativistic effects are to be less than 1%, then y must be less than 
1.01. At what relative velocity is y = 1.01? 


Solution: 


0.140c 
Exercise: 


Problem: 


If relativistic effects are to be less than 3%, then y must be less than 
1.03. At what relative velocity is y = 1.03? 


Exercise: 


Problem: 


(a) At what relative velocity is ~ = 1.50? (b) At what relative velocity 
is y = 100? 


Solution: 
(a) 0.745c 


(b) 0.99995c (to five digits to show effect) 


Exercise: 


Problem: 


(a) At what relative velocity is ~ = 2.00? (b) At what relative velocity 
is y = 10.0? 


Exercise: 


Problem: Unreasonable Results 


(a) Find the value of for the following situation. An Earth-bound 

observer measures 23.9 h to have passed while signals from a high- 
velocity space probe indicate that 24.0 h have passed on board. (b) 
What is unreasonable about this result? (c) Which assumptions are 

unreasonable or inconsistent? 


Solution: 
(a) 0.996 
(b) y cannot be less than 1. 


(c) Assumption that time is longer in moving ship is unreasonable. 


Glossary 


time dilation 
the phenomenon of time passing slower to an observer who is moving 
relative to another observer 


proper time 
Ato. the time measured by an observer at rest relative to the event 
being observed: At = aul = = yAto, where y = 


Vira 


twin paradox 
this asks why a twin traveling at a relativistic speed away and then 
back towards the Earth ages less than the Earth-bound twin. The 


premise to the paradox is faulty because the traveling twin is 
accelerating, and special relativity does not apply to accelerating 
frames of reference 


Length Contraction 


e Describe proper length. 
e Calculate length contraction. 
e Explain why we don’t notice these effects at everyday scales. 


People might describe 
distances differently, but 
at relativistic speeds, the 

distances really are 
different. (credit: Corey 
Leopold, Flickr) 


Have you ever driven on a road that seems like it goes on forever? If you 
look ahead, you might say you have about 10 km left to go. Another 
traveler might say the road ahead looks like it’s about 15 km long. If you 
both measured the road, however, you would agree. Traveling at everyday 
speeds, the distance you both measure would be the same. You will read in 
this section, however, that this is not true at relativistic speeds. Close to the 
speed of light, distances measured are not the same when measured by 
different observers. 


Proper Length 


One thing all observers agree upon is relative speed. Even though clocks 
measure different elapsed times for the same process, they still agree that 
relative speed, which is distance divided by elapsed time, is the same. This 


implies that distance, too, depends on the observer’s relative motion. If two 
observers see different times, then they must also see different distances for 
relative speed to be the same to each of them. 


The muon discussed in [link] illustrates this concept. To an observer on the 
Earth, the muon travels at 0.950c for 7.05 ys from the time it is produced 
until it decays. Thus it travels a distance 

Equation: 


Lo = vAt = (0.950)(3.00 x 10° m/s)(7.05 x 10° s) = 2.01 km 


relative to the Earth. In the muon’s frame of reference, its lifetime is only 
2.20 ps. It has enough time to travel only 
Equation: 


L = vAto = (0.950)(3.00 x 10° m/s)(2.20 x 10~° s) = 0.627 km. 


The distance between the same two events (production and decay of a 
muon) depends on who measures it and how they are moving relative to it. 


Note: 

Proper Length 

Proper length Lo is the distance between two points measured by an 
observer who is at rest relative to both of the points. 


The Earth-bound observer measures the proper length Lg, because the 
points at which the muon is produced and decays are stationary relative to 
the Earth. To the muon, the Earth, air, and clouds are moving, and so the 
distance L it sees is not the proper length. 


0.627 km 42 


§ Ea 2.01 km ————-ff » § Poser ing. 
“~ ad ee ae ay 
—— 
v 
(a) (b) 


(a) The Earth-bound observer sees the muon travel 2.01 
km between clouds. (b) The muon sees itself travel the 
same path, but only a distance of 0.627 km. The Earth, 
air, and clouds are moving relative to the muon in its 
frame, and all appear to have smaller lengths along the 
direction of travel. 


Length Contraction 


To develop an equation relating distances measured by different observers, 
we note that the velocity relative to the Earth-bound observer in our muon 
example is given by 

Equation: 


Lo 
v= — 
At 
The time relative to the Earth-bound observer is At, since the object being 
timed is moving relative to this observer. The velocity relative to the 


moving observer is given by 
Equation: 


The moving observer travels with the muon and therefore observes the 
proper time Ato. The two velocities are identical; thus, 


Equation: 


Lo L 


At Rig. 


We know that At = yAto. Substituting this equation into the relationship 
above gives 
Equation: 


Substituting for y gives an equation relating the distances measured by 
different observers. 


Note: 

Length Contraction 

Length contraction L is the shortening of the measured length of an 
object moving relative to the observer’s frame. 

Equation: 


If we measure the length of anything moving relative to our frame, we find 
its length LD to be smaller than the proper length Lo that would be measured 
if the object were stationary. For example, in the muon’s reference frame, 
the distance between the points where it was produced and where it decayed 
is shorter. Those points are fixed relative to the Earth but moving relative to 
the muon. Clouds and other objects are also contracted along the direction 
of motion in the muon’s reference frame. 


Example: 

Calculating Length Contraction: The Distance between Stars 
Contracts when You Travel at High Velocity 

Suppose an astronaut, such as the twin discussed in Simultaneity and Time 
Dilation, travels so fast that ~ = 30.00. (a) She travels from the Earth to 
the nearest star system, Alpha Centauri, 4.300 light years (ly) away as 
measured by an Earth-bound observer. How far apart are the Earth and 
Alpha Centauri as measured by the astronaut? (b) In terms of c, what is her 
velocity relative to the Earth? You may neglect the motion of the Earth 
relative to the Sun. (See [link].) 


iy 
a Pr Alpha 
Earth * 
-_-____—- ., ——__________+ 
— bo 
ane; (a) 
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rad ° Santen 
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(b) 


(a) The Earth-bound observer measures the 
proper distance between the Earth and the 
Alpha Centauri. (b) The astronaut observes a 
length contraction, since the Earth and the 
Alpha Centauri move relative to her ship. She 
can travel this shorter distance in a smaller time 
(her proper time) without exceeding the speed 
of light. 


Strategy 

First note that a light year (ly) is a convenient unit of distance on an 
astronomical scale—it is the distance light travels in a year. For part (a), 
note that the 4.300 ly distance between the Alpha Centauri and the Earth is 


the proper distance Lg, because it is measured by an Earth-bound observer 
to whom both stars are (approximately) stationary. To the astronaut, the 
Earth and the Alpha Centauri are moving by at the same velocity, and so 
the distance between them is the contracted length L. In part (b), we are 
given ‘y, and so we can find v by rearranging the definition of 7 to express 
v in terms of c. 

Solution for (a) 


1. Identify the knowns. Ly, — 4.300 ly; y = 30.00 
2. Identify the unknown. L 
3. Choose the appropriate equation. L = = 


4. Rearrange the equation to solve for the unknown. 
Equation: 


age 
7 


4.300 ly 
30.00 


= 0.1433 ly 


Solution for (b) 


1. Identify the known. y = 30.00 

2. Identify the unknown. v in terms of c 

3. Choose the appropriate equation. y = Tess 
ee 


4. Rearrange the equation to solve for the unknown. 
Equation: 


30.00 = 


Squaring both sides of the equation and rearranging terms gives 
Equation: 


so that 
Equation: 
2 
Al 
“ey eee 
C2 900.0 
and 
Equation: 
Se ee Sane 
cz 900.0 om 


Taking the square root, we find 
Equation: 


” — 0.99944, 
(G 


which is rearranged to produce a value for the velocity 
Equation: 


v= 0.9994c. 


Discussion 

First, remember that you should not round off calculations until the final 
result is obtained, or you could get erroneous results. This is especially true 
for special relativity calculations, where the differences might only be 
revealed after several decimal places. The relativistic effect is large here ( 
y=30.00), and we see that v is approaching (not equaling) the speed of 
light. Since the distance as measured by the astronaut is so much smaller, 
the astronaut can travel it in much less time in her frame. 


People could be sent very large distances (thousands or even millions of 
light years) and age only a few years on the way if they traveled at 
extremely high velocities. But, like emigrants of centuries past, they would 
leave the Earth they know forever. Even if they returned, thousands to 
millions of years would have passed on the Earth, obliterating most of what 
now exists. There is also a more serious practical obstacle to traveling at 
such velocities; immensely greater energies than classical physics predicts 
would be needed to achieve such high velocities. This will be discussed in 
Relatavistic Energy. 


Why don’t we notice length contraction in everyday life? The distance to 
the grocery shop does not seem to depend on whether we are moving or not. 


Examining the equation LZ = Lo,/1— ¥, we see that at low velocities ( 


v<<c) the lengths are nearly equal, the classical expectation. But length 
contraction is real, if not commonly experienced. For example, a charged 
particle, like an electron, traveling at relativistic velocity has electric field 
lines that are compressed along the direction of motion as seen by a 
stationary observer. (See [link].) As the electron passes a detector, such as a 
coil of wire, its field interacts much more briefly, an effect observed at 
particle accelerators such as the 3 km long Stanford Linear Accelerator 
(SLAC). In fact, to an electron traveling down the beam pipe at SLAC, the 
accelerator and the Earth are all moving by and are length contracted. The 
relativistic effect is so great than the accelerator is only 0.5 m long to the 
electron. It is actually easier to get the electron beam down the pipe, since 
the beam does not have to be as precisely aimed to get down a short pipe as 
it would down one 3 km long. This, again, is an experimental verification of 
the Special Theory of Relativity. 


The electric field lines of a 
high-velocity charged 
particle are compressed 
along the direction of motion 
by length contraction. This 
produces a different signal 
when the particle goes 
through a coil, an 
experimentally verified 
effect of length contraction. 


Exercise: 
Check Your Understanding 


Problem: 


A particle is traveling through the Earth’s atmosphere at a speed of 
0.750c. To an Earth-bound observer, the distance it travels is 2.50 km. 
How far does the particle travel in the particle’s frame of reference? 


Solution: 
Answer 
Equation: 


Summary 


e All observers agree upon relative speed. 

e Distance depends on an observer’s motion. Proper length Lg is the 
distance between two points measured by an observer who is at rest 
relative to both of the points. Earth-bound observers measure proper 
length when measuring the distance between two points that are 
Stationary relative to the Earth. 

¢ Length contraction L is the shortening of the measured length of an 
object moving relative to the observer’s frame: 

Equation: 


Conceptual Questions 


Exercise: 
Problem: 
To whom does an object seem greater in length, an observer moving 


with the object or an observer moving relative to the object? Which 
observer measures the object’s proper length? 


Exercise: 
Problem: 
Relativistic effects such as time dilation and length contraction are 


present for cars and airplanes. Why do these effects seem strange to 
us? 


Exercise: 


Problem: 


Suppose an astronaut is moving relative to the Earth at a significant 
fraction of the speed of light. (a) Does he observe the rate of his clocks 
to have slowed? (b) What change in the rate of Earth-bound clocks 
does he see? (c) Does his ship seem to him to shorten? (d) What about 
the distance between stars that lie on lines parallel to his motion? (e) 
Do he and an Earth-bound observer agree on his velocity relative to 
the Earth? 


Problems & Exercises 


Exercise: 
Problem: 


A spaceship, 200 m long as seen on board, moves by the Earth at 
0.970c. What is its length as measured by an Earth-bound observer? 


Solution: 


48.6 m 

Exercise: 
Problem: 
How fast would a 6.0 m-long sports car have to be going past you in 
order for it to appear only 5.5 m long? 

Exercise: 
Problem: 
(a) How far does the muon in [link] travel according to the Earth- 
bound observer? (b) How far does it travel as viewed by an observer 
moving with it? Base your calculation on its velocity relative to the 


Earth and the time it lives (proper time). (c) Verify that these two 
distances are related through length contraction y=3.20. 


Solution: 


(a) 1.387 km = 1.39 km 


(b) 0.433 km 
_ Lo —  1.387x10* m 
(c) Y 3.20 


= 433.4m =] 0.433 km 


Thus, the distances in parts (a) and (b) are related when y = 3.20. 
Exercise: 

Problem: 

(a) How long would the muon in [link] have lived as observed on the 

Earth if its velocity was 0.0500c? (b) How far would it have traveled 


as observed on the Earth? (c) What distance is this in the muon’s 
frame? 


Exercise: 
Problem: 
(a) How long does it take the astronaut in [link] to travel 4.30 ly at 
0.99944c (as measured by the Earth-bound observer)? (b) How long 
does it take according to the astronaut? (c) Verify that these two times 
are related through time dilation with y=30.00 as given. 
Solution: 


(a) 4.303 y (to four digits to show any effect) 


(b) 0.1434 y 


4. 
(c) At = yAty > y= RE = gine = 30.0 


Thus, the two times are related when y=30.00. 


Exercise: 


Problem: 


(a) How fast would an athlete need to be running for a 100-m race to 
look 100 yd long? (b) Is the answer consistent with the fact that 
relativistic effects are difficult to observe in ordinary circumstances? 
Explain. 


Exercise: 


Problem: Unreasonable Results 


(a) Find the value of - for the following situation. An astronaut 
measures the length of her spaceship to be 25.0 m, while an Earth- 
bound observer measures it to be 100 m. (b) What is unreasonable 
about this result? (c) Which assumptions are unreasonable or 
inconsistent? 


Solution: 

(a) 0.250 

(b) y must be >1 

(c) The Earth-bound observer must measure a shorter length, so it is 


unreasonable to assume a longer length. 


Exercise: 


Problem: Unreasonable Results 


A spaceship is heading directly toward the Earth at a velocity of 
0.800c. The astronaut on board claims that he can send a canister 
toward the Earth at 1.20c relative to the Earth. (a) Calculate the 
velocity the canister must have relative to the spaceship. (b) What is 
unreasonable about this result? (c) Which assumptions are 
unreasonable or inconsistent? 


Glossary 


proper length 
Lo; the distance between two points measured by an observer who is 
at rest relative to both of the points; Earth-bound observers measure 
proper length when measuring the distance between two points that are 
stationary relative to the Earth 


length contraction 
L, the shortening of the measured length of an object moving relative 


to the observer’s frame: L=Ly 4/1 — vw _ Lo 


Relativistic Addition of Velocities 


¢ Calculate relativistic velocity addition. 

e Explain when relativistic velocity addition should be used instead of 
classical addition of velocities. 

e Calculate relativistic Doppler shift. 


The total velocity of a 
kayak, like this one on the 
Deerfield River in 


Massachusetts, is its 
velocity relative to the 
water as well as the 
water’s velocity relative 
to the riverbank. (credit: 
abkfenris, Flickr) 


If you’ve ever seen a kayak move down a fast-moving river, you know that 
remaining in the same place would be hard. The river current pulls the 
kayak along. Pushing the oars back against the water can move the kayak 
forward in the water, but that only accounts for part of the velocity. The 
kayak’s motion is an example of classical addition of velocities. In classical 
physics, velocities add as vectors. The kayak’s velocity is the vector sum of 
its velocity relative to the water and the water’s velocity relative to the 
riverbank. 


Classical Velocity Addition 


For simplicity, we restrict our consideration of velocity addition to one- 
dimensional motion. Classically, velocities add like regular numbers in one- 
dimensional motion. (See [link].) Suppose, for example, a girl is riding in a 
sled at a speed 1.0 m/s relative to an observer. She throws a snowball first 
forward, then backward at a speed of 1.5 m/s relative to the sled. We denote 
direction with plus and minus signs in one dimension; in this example, 
forward is positive. Let v be the velocity of the sled relative to the Earth, u 
the velocity of the snowball relative to the Earth-bound observer, and wu/ the 
velocity of the snowball relative to the sled. 


Observer 


u'’=-15mls 
a 


u=-—0.5 mis 


Classically, velocities add like ordinary numbers in 
one-dimensional motion. Here the girl throws a 
snowball forward and then backward from a sled. 
The velocity of the sled relative to the Earth is 
v=1.0 m/s. The velocity of the snowball relative 


to the sled is u/, while its velocity relative to the 
Earth is wu. Classically, u=v-+ul. 


Note: 
Classical Velocity Addition 
Equation: 


u=v-+tul 


Thus, when the girl throws the snowball forward, 

u=1.0m/s+1.5 m/s = 2.5 m/s. It makes good intuitive sense that the 
snowball will head towards the Earth-bound observer faster, because it is 
thrown forward from a moving vehicle. When the girl throws the snowball 
backward, u = 1.0 m/s + (—1.5 m/s) = —0.5 m/s. The minus sign 
means the snowball moves away from the Earth-bound observer. 


Relativistic Velocity Addition 


The second postulate of relativity (verified by extensive experimental 
observation) says that classical velocity addition does not apply to light. 
Imagine a car traveling at night along a straight road, as in [link]. If 
classical velocity addition applied to light, then the light from the car’s 
headlights would approach the observer on the sidewalk at a speed u=v-++c. 
But we know that light will move away from the car at speed c relative to 
the driver of the car, and light will move towards the observer on the 
sidewalk at speed c, too. 


According to experiment and the second postulate of 
relativity, light from the car’s headlights moves away 
from the car at speed c and towards the observer on 
the sidewalk at speed c. Classical velocity addition is 
not valid. 


Note: 

Relativistic Velocity Addition 

Either light is an exception, or the classical velocity addition formula only 
works at low velocities. The latter is the case. The correct formula for one- 
dimensional relativistic velocity addition is 

Equation: 


v+tul 
US 7 


C2 
where v is the relative velocity between two observers, w is the velocity of 
an object relative to one observer, and w/ is the velocity relative to the 
other observer. (For ease of visualization, we often choose to measure wu in 
our reference frame, while someone moving at v relative to us measures u/ 


.) Note that the term ““ becomes very small at low velocities, and 


C2 
om gives a result very close to classical velocity addition. As 
lp 
Cc 


Uh = 


before, we see that classical velocity addition is an excellent approximation 
to the correct relativistic formula for small velocities. No wonder that it 
seems correct in our experience. 


Example: 

Showing that the Speed of Light towards an Observer is Constant (in a 
Vacuum): The Speed of Light is the Speed of Light 

Suppose a spaceship heading directly towards the Earth at half the speed of 
light sends a signal to us on a laser-produced beam of light. Given that the 
light leaves the ship at speed c as observed from the ship, calculate the 
speed at which it approaches the Earth. 


Ww —_ laser light as hig 
al —_> 


v = 0.500c 


ree laserlight = sa 


v = 0.500c¢ 
Strategy 
Because the light and the spaceship are moving at relativistic speeds, we 
cannot use simple velocity addition. Instead, we can determine the speed at 
which the light approaches the Earth using relativistic velocity addition. 
Solution 


1. Identify the knowns. v=0.500c; u/= c 
2. Identify the unknown. wu 
3. Choose the appropriate equation. uw = —+- 


4. Plug the knowns into the equation. 
Equation: 


v+ul 
14+ 
_0.500c+c _ 
14 a ( 


vi = 


(0.500+1)c 
fle 0.5000? 
1.500c 

= 1+0.500 


1.500c 
1.500 


= C 


Discussion 

Relativistic velocity addition gives the correct result. Light leaves the ship 
at speed c and approaches the Earth at speed c. The speed of light is 
independent of the relative motion of source and observer, whether the 
observer is on the ship or Earth-bound. 


Velocities cannot add to greater than the speed of light, provided that v is 
less than c and u/ does not exceed c. The following example illustrates that 
relativistic velocity addition is not as symmetric as classical velocity 
addition. 


Example: 

Comparing the Speed of Light towards and away from an Observer: 
Relativistic Package Delivery 

Suppose the spaceship in the previous example is approaching the Earth at 
half the speed of light and shoots a canister at a speed of 0.750c. (a) At 
what velocity will an Earth-bound observer see the canister if it is shot 
directly towards the Earth? (b) If it is shot directly away from the Earth? 
(See [link].) 


u | u' 
a—_—_—_—_©>_ :: <=" 
4 a ie ———— 
ay v = 0.50 ay ~—=—v=0.50e 
Canister toward Earth Canister away from Earth 
Strategy 


Because the canister and the spaceship are moving at relativistic speeds, 
we must determine the speed of the canister by an Earth-bound observer 
using relativistic velocity addition instead of simple velocity addition. 
Solution for (a) 


1. Identify the knowns. v=0.500c; w/= 0.750c 
2. Identify the unknown. wu 


3. Choose the appropriate equation. u=44 


14+ 


c 


4. Plug the knowns into the equation. 
Equation: 


vtul 
14+ 
= 0.500c +0.750c 
14 ERE 
1.250c 
140.375 


O7909e 


Solution for (b) 


1. Identify the knowns. v = 0.500c; w/= —0.750c 
2. Identify the unknown. wu 


3. Choose the appropriate equation. u = 242 


14+ 


& 


4. Plug the knowns into the equation. 
Equation: 


v+ul 

14+ 
0.500c +(—0.750c) 
14 we 


—0.250c 
1—0.375 


= —0.400c 


Discussion 

The minus sign indicates velocity away from the Earth (in the opposite 
direction from v), which means the canister is heading towards the Earth in 
part (a) and away in part (b), as expected. But relativistic velocities do not 
add as simply as they do classically. In part (a), the canister does approach 
the Earth faster, but not at the simple sum of 1.250c. The total velocity is 
less than you would get classically. And in part (b), the canister moves 
away from the Earth at a velocity of —0.400c, which is faster than the 
—Q.250c you would expect classically. The velocities are not even 
symmetric. In part (a) the canister moves 0.409c faster than the ship 
relative to the Earth, whereas in part (b) it moves 0.900c slower than the 
ship. 


Doppler Shift 


Although the speed of light does not change with relative velocity, the 
frequencies and wavelengths of light do. First discussed for sound waves, a 
Doppler shift occurs in any wave when there is relative motion between 
source and observer. 


Note: 

Relativistic Doppler Effects 

The observed wavelength of electromagnetic radiation is longer (called a 
red shift) than that emitted by the source when the source moves away 


from the observer and shorter (called a blue shift) when the source moves 
towards the observer. 
Equation: 


In the Doppler equation, A,p; is the observed wavelength, A, is the source 
wavelength, and w is the relative velocity of the source to the observer. The 
velocity u is positive for motion away from an observer and negative for 
motion toward an observer. In terms of source frequency and observed 
frequency, this equation can be written 

Equation: 


fobs =f, 


—_ 
+ 
ole 


Notice that the — and + signs are different than in the wavelength equation. 


Note: 

Career Connection: Astronomer 

If you are interested in a career that requires a knowledge of special 
relativity, there’s probably no better connection than astronomy. 
Astronomers must take into account relativistic effects when they calculate 
distances, times, and speeds of black holes, galaxies, quasars, and all other 
astronomical objects. To have a career in astronomy, you need at least an 
undergraduate degree in either physics or astronomy, but a Master’s or 
doctoral degree is often required. You also need a good background in 
high-level mathematics. 


Example: 

Calculating a Doppler Shift: Radio Waves from a Receding Galaxy 
Suppose a galaxy is moving away from the Earth at a speed 0.825c . It 
emits radio waves with a wavelength of 0.525 m. What wavelength would 
we detect on the Earth? 

Strategy 

Because the galaxy is moving at a relativistic speed, we must determine the 
Doppler shift of the radio waves using the relativistic Doppler shift instead 
of the classical Doppler shift. 

Solution 


1. Identify the knowns. u=0.825c ; A, = 0.525 m 
2. Identify the unknown. Aobs 


fee 


iG 


3. Choose the appropriate equation. Aow=Aea/ ine 


& 


4. Plug the knowns into the equation. 


Equation: 
Acbs = As i“ 
= (0.525 m) iste 
== Adank, 
Discussion 


Because the galaxy is moving away from the Earth, we expect the 
wavelengths of radiation it emits to be redshifted. The wavelength we 
calculated is 1.70 m, which is redshifted from the original wavelength of 
0.525 m. 


The relativistic Doppler shift is easy to observe. This equation has everyday 
applications ranging from Doppler-shifted radar velocity measurements of 
transportation to Doppler-radar storm monitoring. In astronomical 


observations, the relativistic Doppler shift provides velocity information 
such as the motion and distance of stars. 

Exercise: 

Check Your Understanding 


Problem: 
Suppose a space probe moves away from the Earth at a speed 0.350c. 


It sends a radio wave message back to the Earth at a frequency of 1.50 
GHz. At what frequency is the message received on the Earth? 


Solution: 
Answer 
Equation: 
La _ 0.350¢ 
fots=fo4/ 7a = (1.50 GHz) 1) baste = 1.04 GHz 
c Cc 


Section Summary 


e With classical velocity addition, velocities add like regular numbers in 
one-dimensional motion: u=v+u/, where v is the velocity between 
two observers, wu is the velocity of an object relative to one observer, 
and wu/ is the velocity relative to the other observer. 

e Velocities cannot add to be greater than the speed of light. Relativistic 
velocity addition describes the velocities of an object moving at a 
relativistic speed: 

Equation: 


utul 


Lae 


C2 


e An observer of electromagnetic radiation sees relativistic Doppler 
effects if the source of the radiation is moving relative to the observer. 
The wavelength of the radiation is longer (called a red shift) than that 


emitted by the source when the source moves away from the observer 
and shorter (called a blue shift) when the source moves toward the 
observer. The shifted wavelength is described by the equation 
Equation: 


Aobs is the observed wavelength, A, is the source wavelength, and wu is 
the relative velocity of the source to the observer. 


Conceptual Questions 


Exercise: 
Problem: 
Explain the meaning of the terms “red shift” and “blue shift” as they 
relate to the relativistic Doppler effect. 

Exercise: 
Problem: 
What happens to the relativistic Doppler effect when relative velocity 
is zero? Is this the expected result? 

Exercise: 
Problem: 
Is the relativistic Doppler effect consistent with the classical Doppler 
effect in the respect that Ajps is larger for motion away? 


Exercise: 


Problem: 


All galaxies farther away than about 50 x 10° ly exhibit a red shift in 
their emitted light that is proportional to distance, with those farther 
and farther away having progressively greater red shifts. What does 
this imply, assuming that the only source of red shift is relative 
motion? (Hint: At these large distances, it is space itself that is 
expanding, but the effect on light is the same.) 


Problems & Exercises 


Exercise: 
Problem: 
Suppose a spaceship heading straight towards the Earth at 0.750c can 
shoot a canister at 0.500c relative to the ship. (a) What is the velocity 


of the canister relative to the Earth, if it is shot directly at the Earth? 
(b) If it is shot directly away from the Earth? 


Solution: 
(a) 0.909c 
(b) 0.400c 


Exercise: 


Problem: 


Repeat the previous problem with the ship heading directly away from 
the Earth. 


Exercise: 


Problem: 


If a spaceship is approaching the Earth at 0.100c and a message 
capsule is sent toward it at 0.100c relative to the Earth, what is the 
speed of the capsule relative to the ship? 


Solution: 


0.198c 

Exercise: 
Problem: 
(a) Suppose the speed of light were only 3000 m/s. A jet fighter 
moving toward a target on the ground at 800 m/s shoots bullets, each 
having a muzzle velocity of 1000 m/s. What are the bullets’ velocity 


relative to the target? (b) If the speed of light was this small, would 
you observe relativistic effects in everyday life? Discuss. 


Exercise: 


Problem: 


If a galaxy moving away from the Earth has a speed of 1000 km/s and 
emits 656 nm light characteristic of hydrogen (the most common 
element in the universe). (a) What wavelength would we observe on 
the Earth? (b) What type of electromagnetic radiation is this? (c) Why 
is the speed of the Earth in its orbit negligible here? 


Solution: 
a) 658 nm 
b) red 


c) v/c = 9.92 x 10~° (negligible) 


Exercise: 


Problem: 


A space probe speeding towards the nearest star moves at 0.250c and 
sends radio information at a broadcast frequency of 1.00 GHz. What 
frequency is received on the Earth? 


Exercise: 
Problem: 
If two spaceships are heading directly towards each other at 0.800c, at 


what speed must a canister be shot from the first ship to approach the 
other at 0.999c as seen by the second ship? 


Solution: 


0.991c 
Exercise: 
Problem: 
Two planets are on a collision course, heading directly towards each 
other at 0.250c. A spaceship sent from one planet approaches the 


second at 0.750c as seen by the second planet. What is the velocity of 
the ship relative to the first planet? 


Exercise: 


Problem: 


When a missile is shot from one spaceship towards another, it leaves 
the first at 0.950c and approaches the other at 0.750c. What is the 
relative velocity of the two ships? 


Solution: 


—0.696c 


Exercise: 


Problem: 

What is the relative velocity of two spaceships if one fires a missile at 

the other at 0.750c and the other observes it to approach at 0.950c? 
Exercise: 

Problem: 

Near the center of our galaxy, hydrogen gas is moving directly away 

from us in its orbit about a black hole. We receive 1900 nm 


electromagnetic radiation and know that it was 1875 nm when emitted 
by the hydrogen gas. What is the speed of the gas? 


Solution: 


0.01324c 

Exercise: 
Problem: 
A highway patrol officer uses a device that measures the speed of 
vehicles by bouncing radar off them and measuring the Doppler shift. 
The outgoing radar has a frequency of 100 GHz and the returning echo 
has a frequency 15.0 kHz higher. What is the velocity of the vehicle? 


Note that there are two Doppler shifts in echoes. Be certain not to 
round off until the end of the problem, because the effect is small. 


Exercise: 


Problem: 
Prove that for any relative velocity v between two observers, a beam of 
light sent from one to the other will approach at speed c (provided that 


v is less than c, of course). 


Solution: 


ul = c, SO 


“us v+ul =e vtec — vtec 
~ 14+(vur/c?) 1+(ve/c?) 1+(v/c) 
a: SNE) 
SS ey 
Exercise: 
Problem: 


Show that for any relative velocity v between two observers, a beam of 
light projected by one directly away from the other will move away at 
the speed of light (provided that v is less than c, of course). 


Exercise: 


Problem: 


(a) All but the closest galaxies are receding from our own Milky Way 
Galaxy. If a galaxy 12.0 x 10° ly ly away is receding from us at 0. 
0.900c, at what velocity relative to us must we send an exploratory 
probe to approach the other galaxy at 0.990c, as measured from that 
galaxy? (b) How long will it take the probe to reach the other galaxy as 
measured from the Earth? You may assume that the velocity of the 
other galaxy remains constant. (c) How long will it then take for a 
radio signal to be beamed back? (All of this is possible in principle, 
but not practical.) 


Solution: 
a) 0.99947c 
b) 1.2064 x 101 y 


c) 1.2058 x 101 y (all to sufficient digits to show effects) 


Glossary 


classical velocity addition 
the method of adding velocities when v<<c; velocities add like 
regular numbers in one-dimensional motion: wu = v+u/, where v is the 


velocity between two observers, w is the velocity of an object relative 
to one observer, and w/ is the velocity relative to the other observer 


relativistic velocity addition 
the method of adding velocities of an object moving at a relativistic 


speed: u= ae , where v is the relative velocity between two 
ce 


observers, u is the velocity of an object relative to one observer, and u/ 
is the velocity relative to the other observer 


relativistic Doppler effects 
a change in wavelength of radiation that is moving relative to the 
observer; the wavelength of the radiation is longer (called a red shift) 
than that emitted by the source when the source moves away from the 
observer and shorter (called a blue shift) when the source moves 
toward the observer; the shifted wavelength is described by the 
equation 
Equation: 


where Aops is the observed wavelength, A, is the source wavelength, 
and wu is the velocity of the source to the observer 


Relativistic Momentum 


¢ Calculate relativistic momentum. 
e Explain why the only mass it makes sense to talk about is rest mass. 


Momentum is an 
important concept for 
these football players 
from the University of 
California at Berkeley 
and the University of 

California at Davis. 
Players with more mass 
often have a larger impact 
because their momentum 
is larger. For objects 
moving at relativistic 
speeds, the effect is even 
greater. (credit: John 

Martinez Pavliga) 


In classical physics, momentum is a simple product of mass and velocity. However, we saw in the last 
section that when special relativity is taken into account, massive objects have a speed limit. What 
effect do you think mass and velocity have on the momentum of objects moving at relativistic speeds? 


Momentum is one of the most important concepts in physics. The broadest form of Newton’s second 
law is stated in terms of momentum. Momentum is conserved whenever the net external force on a 
system is zero. This makes momentum conservation a fundamental tool for analyzing collisions. All of 
Work, Energy, and Energy Resources is devoted to momentum, and momentum has been important for 
many other topics as well, particularly where collisions were involved. We will see that momentum has 
the same importance in modern physics. Relativistic momentum is conserved, and much of what we 
know about subatomic structure comes from the analysis of collisions of accelerator-produced 
relativistic particles. 


The first postulate of relativity states that the laws of physics are the same in all inertial frames. Does 
the law of conservation of momentum survive this requirement at high velocities? The answer is yes, 
provided that the momentum is defined as follows. 


Note: 


Relativistic Momentum 
Relativistic momentum p is classical momentum multiplied by the relativistic factor ~¥. 
Equation: 


p— ymu, 


where m is the rest mass of the object, u is its velocity relative to an observer, and the relativistic 
factor 
Equation: 


Note that we use u for velocity here to distinguish it from relative velocity v between observers. Only 
one observer is being considered here. With p defined in this way, total momentum p;,; is conserved 
whenever the net external force is zero, just as in classical physics. Again we see that the relativistic 
quantity becomes virtually the same as the classical at low velocities. That is, relativistic momentum 
ymu becomes the classical mu at low velocities, because 7 is very nearly equal to 1 at low velocities. 


Relativistic momentum has the same intuitive feel as classical momentum. It is greatest for large 
masses moving at high velocities, but, because of the factor y, relativistic momentum approaches 
infinity as w approaches c. (See [link].) This is another indication that an object with mass cannot reach 
the speed of light. If it did, its momentum would become infinite, an unreasonable value. 


ed iad > 
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momentum P,.; (kg m/s) 
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0 0.2c 0.4c 0.6c 0.8c 1.0c 
speed u (m/s) 


Relativistic momentum 
approaches infinity as the 
velocity of an object 
approaches the speed of 
light. 


Note: 

Misconception Alert: Relativistic Mass and Momentum 

The relativistically correct definition of momentum as p = ymu is sometimes taken to imply that mass 
varies with velocity: myar = ym, particularly in older textbooks. However, note that m is the mass of 


the object as measured by a person at rest relative to the object. Thus, m is defined to be the rest mass, 
which could be measured at rest, perhaps using gravity. When a mass is moving relative to an 
observer, the only way that its mass can be determined is through collisions or other means in which 
momentum is involved. Since the mass of a moving object cannot be determined independently of 
momentum, the only meaningful mass is rest mass. Thus, when we use the term mass, assume it to be 
identical to rest mass. 


Relativistic momentum is defined in such a way that the conservation of momentum will hold in all 
inertial frames. Whenever the net external force on a system is zero, relativistic momentum is 
conserved, just as is the case for classical momentum. This has been verified in numerous experiments. 


In Relativistic Energy, the relationship of relativistic momentum to energy is explored. That subject 
will produce our first inkling that objects without mass may also have momentum. 

Exercise: 

Check Your Understanding 


Problem: 


What is the momentum of an electron traveling at a speed 0.985c? The rest mass of the electron is 
9.11 x 10°-*! kg. 


Solution: 
Answer 
Equation: 


= = 1.56 x 10-7! kg - m/s 


u2 0.985c)? 
yl =e yl =e ce “ 


mu (9.11 x 10-*! kg)(0.985)(3.00 x 108 m/s) 
p=ymu = 
ce 


Section Summary 


¢ The law of conservation of momentum is valid whenever the net external force is zero and for 
relativistic momentum. Relativistic momentum p is classical momentum multiplied by the 
relativistic factor y. 

¢ p=ymu, where m™ is the rest mass of the object, u is its velocity relative to an observer, and the 
relativistic factor y = J L = 
aa 

e At low velocities, relativistic momentum is equivalent to classical momentum. 

¢ Relativistic momentum approaches infinity as u approaches c. This implies that an object with 
mass cannot reach the speed of light. 

e Relativistic momentum is conserved, just as classical momentum is conserved. 


Conceptual Questions 


Exercise: 


Problem: How does modern relativity modify the law of conservation of momentum? 


Exercise: 
Problem: 


Is it possible for an external force to be acting on a system and relativistic momentum to be 
conserved? Explain. 


Problem Exercises 


Exercise: 
Problem: 


Find the momentum of a helium nucleus having a mass of 6.68 x 10°?” kg that is moving at 
0.200c. 


Solution: 


4.09 x 10°! kg- m/s 


Exercise: 


Problem: What is the momentum of an electron traveling at 0.980c? 
Exercise: 
Problem: 
(a) Find the momentum of a 1.00 x 10° kg asteroid heading towards the Earth at 30.0 km /s. (b) 


Find the ratio of this momentum to the classical momentum. (Hint: Use the approximation that 
y =1+4 (1/2)v?/c? at low velocities.) 


Solution: 
(a) 3.000000015 x 101% kg - m/s. 
(b) Ratio of relativistic to classical momenta equals 1.000000005 (extra digits to show small 
effects) 
Exercise: 
Problem: 
(a) What is the momentum of a 2000 kg satellite orbiting at 4.00 km/s? (b) Find the ratio of this 


momentum to the classical momentum. (Hint: Use the approximation that y = 1 + (1/2)v?/c? at 
low velocities.) 


Exercise: 


Problem: 


What is the velocity of an electron that has a momentum of 3.04 x 10°! kg-m /s? Note that you 
must calculate the velocity to at least four digits to see the difference from c. 


Solution: 


2.9957 x 10° m/s 


Exercise: 


Problem: Find the velocity of a proton that has a momentum of 4.48 x -10-! kg-m/s. 
Exercise: 

Problem: 

(a) Calculate the speed of a 1.00-yg particle of dust that has the same momentum as a proton 


moving at 0.999c. (b) What does the small speed tell us about the mass of a proton compared to 
even a tiny amount of macroscopic matter? 


Solution: 
(a) 1.121 x 10° m/s 


(b) The small speed tells us that the mass of a proton is substantially smaller than that of even a 
tiny amount of macroscopic matter! 

Exercise: 
Problem: 


(a) Calculate -+y for a proton that has a momentum of 1.00 kg-m/s. (b) What is its speed? Such 
protons form a rare component of cosmic radiation with uncertain origins. 


Glossary 


relativistic momentum 
p, the momentum of an object moving at relativistic velocity; p = ymu, where m is the rest mass 
of the object, u is its velocity relative to an observer, and the relativistic factor y = 


2 
jy“ 
my) 


= 


rest mass 
the mass of an object as measured by a person at rest relative to the object 


Relativistic Energy 


¢ Compute total energy of a relativistic object. 

¢ Compute the kinetic energy of a relativistic object. 

e Describe rest energy, and explain how it can be converted to other forms. 
e Explain why massive particles cannot reach C. 


The National Spherical 
Torus Experiment 
(NSTX) has a fusion 
reactor in which 
hydrogen isotopes 
undergo fusion to 
produce helium. In this 
process, a relatively small 
mass of fuel is converted 
into a large amount of 
energy. (credit: Princeton 
Plasma Physics 
Laboratory) 


A tokamak is a form of experimental fusion reactor, which can change mass to energy. 
Accomplishing this requires an understanding of relativistic energy. Nuclear reactors are 
proof of the conservation of relativistic energy. 


Conservation of energy is one of the most important laws in physics. Not only does energy 
have many important forms, but each form can be converted to any other. We know that 
classically the total amount of energy in a system remains constant. Relativistically, energy 
is still conserved, provided its definition is altered to include the possibility of mass 
changing to energy, as in the reactions that occur within a nuclear reactor. Relativistic 
energy is intentionally defined so that it will be conserved in all inertial frames, just as is the 
case for relativistic momentum. As a consequence, we learn that several fundamental 
quantities are related in ways not known in classical physics. All of these relationships are 
verified by experiment and have fundamental consequences. The altered definition of energy 


contains some of the most fundamental and spectacular new insights into nature found in 
recent history. 


Total Energy and Rest Energy 


The first postulate of relativity states that the laws of physics are the same in all inertial 
frames. Einstein showed that the law of conservation of energy is valid relativistically, if we 
define energy to include a relativistic factor. 


Note: 

Total Energy 

Total energy £ is defined to be 
Equation: 


E =ymce’, 


1 
relative to an observer. There are many aspects of the total energy F that we will discuss— 
among them are how kinetic and potential energies are included in FE, and how EF is related 
to relativistic momentum. But first, note that at rest, total energy is not zero. Rather, when 
v = 0, we have y = 1, and an object has rest energy. 


where m is mass, c is the speed of light, y = and v is the velocity of the mass 


Note: 

Rest Energy 
Rest energy is 
Equation: 


Eo = me’. 


This is the correct form of Einstein’s most famous equation, which for the first time showed 
that energy is related to the mass of an object at rest. For example, if energy is stored in the 
object, its rest mass increases. This also implies that mass can be destroyed to release 
energy. The implications of these first two equations regarding relativistic energy are so 
broad that they were not completely recognized for some years after Einstein published 
them in 1907, nor was the experimental proof that they are correct widely recognized at 
first. Einstein, it should be noted, did understand and describe the meanings and 
implications of his theory. 


Example: 

Calculating Rest Energy: Rest Energy is Very Large 

Calculate the rest energy of a 1.00-g mass. 

Strategy 

One gram is a small mass—less than half the mass of a penny. We can multiply this mass, 
in SI units, by the speed of light squared to find the equivalent rest energy. 

Solution 


. Identify the knowns. m = 1.00 x 10-2 kg; c = 3.00 x 108 m/s 

. Identify the unknown. Eo 

. Choose the appropriate equation. Hy = mc 

. Plug the knowns into the equation. 
Equation: 


2 
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Bo = me? = (1, 00210 ke) (3.005 10° ams) 
= 9.00 x 10! kg - m?/s” 


5. Convert units. 


Noting that 1 kg - m?/ s” = 1 J, we see the rest mass energy is 
Equation: 


Eo = 9.00 x 10'° J. 


Discussion 

This is an enormous amount of energy for a 1.00-g mass. We do not notice this energy, 
because it is generally not available. Rest energy is large because the speed of light c is a 
large number and c? is a very large number, so that mc? is huge for any macroscopic mass. 
The 9.00 x 10!° J rest mass energy for 1.00 g is about twice the energy released by the 
Hiroshima atomic bomb and about 10,000 times the kinetic energy of a large aircraft 
carrier. If a way can be found to convert rest mass energy into some other form (and all 
forms of energy can be converted into one another), then huge amounts of energy can be 
obtained from the destruction of mass. 


Today, the practical applications of the conversion of mass into another form of energy, such 
as in nuclear weapons and nuclear power plants, are well known. But examples also existed 
when Einstein first proposed the correct form of relativistic energy, and he did describe 
some of them. Nuclear radiation had been discovered in the previous decade, and it had been 
a mystery as to where its energy originated. The explanation was that, in certain nuclear 
processes, a small amount of mass is destroyed and energy is released and carried by nuclear 
radiation. But the amount of mass destroyed is so small that it is difficult to detect that any is 
missing. Although Einstein proposed this as the source of energy in the radioactive salts 


then being studied, it was many years before there was broad recognition that mass could be 
and, in fact, commonly is converted to energy. (See [link].) 


The Sun (a) and the 
Susquehanna Steam 
Electric Station (b) both 
convert mass into energy 
—the Sun via nuclear 
fusion, the electric station 
via nuclear fission. 
(credits: (a) 
NASA/Goddard Space 
Flight Center, Scientific 
Visualization Studio; (b) 
U.S. government) 


Because of the relationship of rest energy to mass, we now consider mass to be a form of 
energy rather than something separate. There had not even been a hint of this prior to 
Einstein’s work. Such conversion is now known to be the source of the Sun’s energy, the 
energy of nuclear decay, and even the source of energy keeping Earth’s interior hot. 


Stored Energy and Potential Energy 


What happens to energy stored in an object at rest, such as the energy put into a battery by 
charging it, or the energy stored in a toy gun’s compressed spring? The energy input 
becomes part of the total energy of the object and, thus, increases its rest mass. All stored 
and potential energy becomes mass in a system. Why is it we don’t ordinarily notice this? In 
fact, conservation of mass (meaning total mass is constant) was one of the great laws 
verified by 19th-century science. Why was it not noticed to be incorrect? The following 
example helps answer these questions. 


Example: 

Calculating Rest Mass: A Small Mass Increase due to Energy Input 

A car battery is rated to be able to move 600 ampere-hours (A-h) of charge at 12.0 V. (a) 
Calculate the increase in rest mass of such a battery when it is taken from being fully 
depleted to being fully charged. (b) What percent increase is this, given the battery’s mass 
is 20.0 kg? 

Strategy 

In part (a), we first must find the energy stored in the battery, which equals what the battery 
can supply in the form of electrical potential energy. Since PE jee = qV, we have to 
calculate the charge q in 600 A-h, which is the product of the current J and the time t. We 
then multiply the result by 12.0 V. We can then calculate the battery’s increase in mass 
using AE = PEg¢jec = (Am)c?. Part (b) is a simple ratio converted to a percentage. 
Solution for (a) 


1. Identify the knowns. J -¢ = 600 A - h; V = 12.0 V; c = 3.00 x 10° m/s 
2. Identify the unknown. Am 
3. Choose the appropriate equation. PEgtee = (Am)c? 
4, Rearrange the equation to solve for the unknown. Am = 
5. Plug the knowns into the equation. 

Equation: 


PEelec 
C2 


PE elec 
Am = = 


(600 A-h)(12.0 V) 
(3.00 108)? 


Write amperes A as coulombs per second (C/s), and convert hours to seconds. 
Equation: 


(600 C/s-h( 28%) (12.0 J/C) 
(3.00 x 108 m/s)? 
(2.16 10° C)(12.0 J/C) 
(3.00x10° m/s)? 


Using the conversion 1 kg - m?/ s” = 1 J, we can write the mass as 
Aim = 2.885 10e ke 


Solution for (b) 


1. Identify the knowns. Am = 2.88 x 107-1? kg; m = 20.0 kg 
2. Identify the unknown. % change 

3. Choose the appropriate equation. % increase = Am. x 100% 
4. Plug the knowns into the equation. 


Equation: 
% increase = Am x 100% 
_—-2.88x107 kg 
= oe 100% 
= 1.44 x 109%. 
Discussion 


Both the actual increase in mass and the percent increase are very small, since energy is 
divided by c?, a very large number. We would have to be able to measure the mass of the 
battery to a precision of a billionth of a percent, or 1 part in 101!, to notice this increase. It 
is no wonder that the mass variation is not readily observed. In fact, this change in mass is 
so small that we may question how you could verify it is real. The answer is found in 
nuclear processes in which the percentage of mass destroyed is large enough to be 
measured. The mass of the fuel of a nuclear reactor, for example, is measurably smaller 
when its energy has been used. In that case, stored energy has been released (converted 
mostly to heat and electricity) and the rest mass has decreased. This is also the case when 
you use the energy stored in a battery, except that the stored energy is much greater in 
nuclear processes, making the change in mass measurable in practice as well as in theory. 


Kinetic Energy and the Ultimate Speed Limit 


Kinetic energy is energy of motion. Classically, kinetic energy has the familiar expression 
smu". The relativistic expression for kinetic energy is obtained from the work-energy 


theorem. This theorem states that the net work on a system goes into kinetic energy. If our 
system starts from rest, then the work-energy theorem is 
Equation: 


Wret = KE. 


Relativistically, at rest we have rest energy Ey = mc?. The work increases this to the total 
energy E = ymc?. Thus, 
Equation: 


Ware = E— Ey = yme? — mc? = (y — 1) mc’. 


Relativistically, we have Wyet = KE ye). 


Note: 

Relativistic Kinetic Energy 
Relativistic kinetic energy is 
Equation: 


KE, = (y — 1) me’. 


When motionless, we have v = 0 and 
Equation: 


so that KE.) = 0 at rest, as expected. But the expression for relativistic kinetic energy 
(such as total energy and rest energy) does not look much like the classical smv*. To show 
that the classical expression for kinetic energy is obtained at low velocities, we note that the 
binomial expansion for yy at low velocities gives 

Equation: 


saat 
LS 2 2° 


A binomial expansion is a way of expressing an algebraic quantity as a sum of an infinite 
series of terms. In some cases, as in the limit of small velocity here, most terms are very 
small. Thus the expression derived for 7 here is not exact, but it is a very accurate 
approximation. Thus, at low velocities, 

Equation: 


Entering this into the expression for relativistic kinetic energy gives 
Equation: 
1 v? 


KE vel a 5 


1 
5 >| mc? = ym = K Bigs. 
Cc 


So, in fact, relativistic kinetic energy does become the same as classical kinetic energy when 
V<<e. 


It is even more interesting to investigate what happens to kinetic energy when the velocity 
of an object approaches the speed of light. We know that -y becomes infinite as v approaches 
c, so that KE;e) also becomes infinite as the velocity approaches the speed of light. (See 
[link].) An infinite amount of work (and, hence, an infinite amount of energy input) is 
required to accelerate a mass to the speed of light. 


Note: 
The Speed of Light 
No object with mass can attain the speed of light. 


So the speed of light is the ultimate speed limit for any particle having mass. All of this is 
consistent with the fact that velocities less than c always add to less than c. Both the 
relativistic form for kinetic energy and the ultimate speed limit being c have been confirmed 
in detail in numerous experiments. No matter how much energy is put into accelerating a 
mass, its velocity can only approach—not reach—the speed of light. 


* 
=) 


o 
o 


2.0 


Kinetic Energy, KE (J) 


0 0.2c¢ 0.4c 0.6c 0.8¢ 
Speed v (m/s) 


This graph of KE; ¢ 


versus velocity shows 
how kinetic energy 
approaches infinity as 
velocity approaches the 
speed of light. It is thus 
not possible for an object 
having mass to reach the 
speed of light. Also 
shown is KE \gjass, the 
classical kinetic energy, 
which is similar to 
relativistic kinetic energy 
at low velocities. Note 
that much more energy is 
required to reach high 
velocities than predicted 
classically. 


Example: 

Comparing Kinetic Energy: Relativistic Energy Versus Classical Kinetic Energy 
An electron has a velocity v = 0.990c. (a) Calculate the kinetic energy in MeV of the 
electron. (b) Compare this with the classical value for kinetic energy at this velocity. (The 
mass of an electron is 9.11 x 10°! kg.) 

Strategy 

The expression for relativistic kinetic energy is always correct, but for (a) it must be used 
since the velocity is highly relativistic (close to c). First, we will calculate the relativistic 
factor y, and then use it to determine the relativistic kinetic energy. For (b), we will 
calculate the classical kinetic energy (which would be close to the relativistic value if v 
were less than a few percent of c) and see that it is not the same. 

Solution for (a) 


1. Identify the knowns. v = 0.990c; m = 9.11 x 107°! kg 
2. Identify the unknown. KE; 

3. Choose the appropriate equation. KEye) = (y — 1) mc? 
4. Plug the knowns into the equation. 


First calculate y. We will carry extra digits because this is an intermediate calculation. 
Equation: 


BR 


v/1—(0.990)? 
= 7.0888 


Next, we use this value to calculate the kinetic energy. 
Equation: 
KEva = (y-1) mc? 
(7.0888 — 1)(9.11 x 10°*! kg)(3.00 x 10° m/s)? 
4.99x 108 J 


5. Convert units. 
Equation: 


KE vei 


-13 1 MeV. 
(4.99 x 10 Die) 
3.12 MeV 


Solution for (b) 


1. List the knowns. v = 0.990c; m = 9.11 x 10-7! kg 
2. List the unknown. KE ass 
3. Choose the appropriate equation. KE glass = +mv 
4. Plug the knowns into the equation. 

Equation: 
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KE glass = smu" 
= (9.00 x 10° kg)(0.990)?(3.00 x 10° m/s)? 
402107771 


5. Convert units. 
Equation: 


at -14 7/__1MeV__ 
KEgass = 4.02 x 10 a( 1.60x10 8 i) 
= 0.251 MeV 


Discussion 


As might be expected, since the velocity is 99.0% of the speed of light, the classical kinetic 
energy is significantly off from the correct relativistic value. Note also that the classical 
value is much smaller than the relativistic value. In fact, KEye1/KEy,,, = 12.4 here. This 
is some indication of how difficult it is to get a mass moving close to the speed of light. 
Much more energy is required than predicted classically. Some people interpret this extra 
energy as going into increasing the mass of the system, but, as discussed in Relativistic 
Momentum, this cannot be verified unambiguously. What is certain is that ever-increasing 
amounts of energy are needed to get the velocity of a mass a little closer to that of light. An 
energy of 3 MeV is a very small amount for an electron, and it can be achieved with 
present-day particle accelerators. SLAC, for example, can accelerate electrons to over 

50 x 10° eV = 50,000 MeV. 

Is there any point in getting v a little closer to c than 99.0% or 99.9%? The answer is yes. 
We learn a great deal by doing this. The energy that goes into a high-velocity mass can be 
converted to any other form, including into entirely new masses. (See [link].) Most of what 
we know about the substructure of matter and the collection of exotic short-lived particles 
in nature has been learned this way. Particles are accelerated to extremely relativistic 
energies and made to collide with other particles, producing totally new species of particles. 
Patterns in the characteristics of these previously unknown particles hint at a basic 
substructure for all matter. These particles and some of their characteristics will be covered 
in Particle Physics. 


The Fermi National 
Accelerator Laboratory, 
near Batavia, Illinois, was 
a subatomic particle 
collider that accelerated 
protons and antiprotons to 
attain energies up to 1 
Tev (a trillion 
electronvolts). The 
circular ponds near the 
rings were built to 
dissipate waste heat. This 
accelerator was shut 
down in September 2011. 
(credit: Fermilab, Reidar 
Hahn) 


Relativistic Energy and Momentum 


We know classically that kinetic energy and momentum are related to each other, since 
Equation: 


KE dass = See os 
m 


Relativistically, we can obtain a relationship between energy and momentum by 
algebraically manipulating their definitions. This produces 
Equation: 


EP = (pe)? + (me*)?, 


where F is the relativistic total energy and p is the relativistic momentum. This relationship 
between relativistic energy and relativistic momentum is more complicated than the 
classical, but we can gain some interesting new insights by examining it. First, total energy 
is related to momentum and rest mass. At rest, Momentum is zero, and the equation gives 
the total energy to be the rest energy mc? (so this equation is consistent with the discussion 
of rest energy above). However, as the mass is accelerated, its momentum p increases, thus 
increasing the total energy. At sufficiently high velocities, the rest energy term (mc*)? 
becomes negligible compared with the momentum term (pc)?; thus, E = pc at extremely 
relativistic velocities. 


If we consider momentum p to be distinct from mass, we can determine the implications of 
the equation E* = (pc)? + (mc?)?’, for a particle that has no mass. If we take m to be zero 
in this equation, then F = pe, or p = E’/c. Massless particles have this momentum. There 
are several massless particles found in nature, including photons (these are quanta of 
electromagnetic radiation). Another implication is that a massless particle must travel at 
speed c and only at speed c. While it is beyond the scope of this text to examine the 
relationship in the equation EZ? = (pc)? + (mc?)?, in detail, we can see that the 
relationship has important implications in special relativity. 


Note: 
Problem-Solving Strategies for Relativity 


Examine the : ye ts. the Vis very close to 1, then relativistic 


situation to Relativistic v1—z quantitative effects are small and differ very 


determine that itis effects are relativistic little from the usually easier 
necessary touse related to factor. If classical calculations. 

relativity 

Identify exactly what needs to be determined in the problem (identify the unknowns). 
Make a list of what is given or can be inferred from the Look in particular for 

problem as stated (identify the knowns). information on relative velocity 
Make certain you Decide, for example, which observer sees time dilated or length 
understand the contracted before plugging into equations. If you have thought about 
conceptual aspects of who sees what, who is moving with the event being observed, who 
the problem before __ sees proper time, and so on, you will find it much easier to 

making any determine if your calculation is reasonable. 

calculations. 

Determine the primary type of You will find the section summary helpful in 
calculation to be done to find the determining whether a length contraction, relativistic 
unknowns identified above. kinetic energy, or some other concept is involved. 

Do not round As noted in the text, you must often perform your calculations to many digits 
off during the to see the desired effect. You may round off at the very end of the problem, 
calculation. but do not use a rounded number in a subsequent calculation. 


UV 


Check the answer This may be more difficult for “or relativistic effects that are in 
to see if it is relativity, since we do not encounter the wrong direction (such as a 
reasonable: Does _ it directly. But you can look for time contraction where a dilation 
it make sense? velocities greater than was expected). 

Exercise: 


Check Your Understanding 


Problem: 


A photon decays into an electron-positron pair. What is the kinetic energy of the 
electron if its speed is 0.992c? 


Solution: 
Answer 
Equation: 


KE, = (y—1)mc? = | —~— -1] me? 


= Ss —1] (9.11 x 10~*" kg)(3.00 x 10° m/s)? = 5.67 x 10°28 J 
1- (0.992c) 


ce 


Section Summary 


e Relativistic energy is conserved as long as we define it to include the possibility of 
mass changing to energy. 


° Total Energy is defined as: E = ymc?, where y = —4 


e Rest energy is Hy = mc*, meaning that mass is a form of energy. If energy is stored in 
an object, its mass increases. Mass can be destroyed to release energy. 

e We do not ordinarily notice the increase or decrease in mass of an object because the 
change in mass is so small for a large increase in energy. 

e The relativistic work-energy theorem is 
Ware = E — Ey = ymc? — mc? = (y-1) mc’. 

e Relativistically, Wnet = KE;e1 , where KE,< is the relativistic kinetic energy. 

e Relativistic kinetic energy is KE,<«. = (y — 1) mc?, where y = ce At low 
2 
velocities, relativistic kinetic energy reduces to classical kinetic energy. 

¢ No object with mass can attain the speed of light because an infinite amount of work 
and an infinite amount of energy input is required to accelerate a mass to the speed of 
light. 

¢ The equation E* = (pc)? + (mc?)? relates the relativistic total energy E and the 
relativistic momentum p. At extremely high velocities, the rest energy mc? becomes 
negligible, and & = pc. 
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Conceptual Questions 


Exercise: 
Problem: 
How are the classical laws of conservation of energy and conservation of mass 
modified by modern relativity? 

Exercise: 
Problem: 
What happens to the mass of water in a pot when it cools, assuming no molecules 
escape or are added? Is this observable in practice? Explain. 

Exercise: 
Problem: 
Consider a thought experiment. You place an expanded balloon of air on weighing 
scales outside in the early morning. The balloon stays on the scales and you are able to 


measure changes in its mass. Does the mass of the balloon change as the day 
progresses? Discuss the difficulties in carrying out this experiment. 


Exercise: 
Problem: 
The mass of the fuel in a nuclear reactor decreases by an observable amount as it puts 


out energy. Is the same true for the coal and oxygen combined in a conventional power 
plant? If so, is this observable in practice for the coal and oxygen? Explain. 


Exercise: 
Problem: 
We know that the velocity of an object with mass has an upper limit of c. Is there an 
upper limit on its momentum? Its energy? Explain. 


Exercise: 


Problem: Given the fact that light travels at c, can it have mass? Explain. 
Exercise: 


Problem: 


If you use an Earth-based telescope to project a laser beam onto the Moon, you can 
move the spot across the Moon’s surface at a velocity greater than the speed of light. 
Does this violate modern relativity? (Note that light is being sent from the Earth to the 
Moon, not across the surface of the Moon.) 


Problems & Exercises 


Exercise: 


Problem: 


What is the rest energy of an electron, given its mass is 9.11 x 10°! kg? Give your 
answer in joules and MeV. 


Solution: 
8.20x 10-4 J 


0.512 MeV 
Exercise: 


Problem: 


Find the rest energy in joules and MeV of a proton, given its mass is 1.67 x 1072" kg. 


Exercise: 


Problem: 


If the rest energies of a proton and a neutron (the two constituents of nuclei) are 938.3 
and 939.6 MeV respectively, what is the difference in their masses in kilograms? 


Solution: 


2.3 x 10-2 ke 
Exercise: 
Problem: 
The Big Bang that began the universe is estimated to have released 10°° J of energy. 


How many stars could half this energy create, assuming the average star’s mass is 
4.00 x 10°° kg? 


Exercise: 


Problem: 


A supernova explosion of a 2.00 x 10°! kg star produces 1.00 x 10“ J of energy. (a) 


How many kilograms of mass are converted to energy in the explosion? (b) What is the 
ratio Am/m of mass destroyed to the original mass of the star? 


Solution: 
(a) 1.11 x 10?” kg 


(b) 5.56 x 107° 
Exercise: 
Problem: 
(a) Using data from [link], calculate the mass converted to energy by the fission of 1.00 
kg of uranium. (b) What is the ratio of mass destroyed to the original mass, Am/m? 
Exercise: 
Problem: 
(a) Using data from [link], calculate the amount of mass converted to energy by the 
fusion of 1.00 kg of hydrogen. (b) What is the ratio of mass destroyed to the original 


mass, Am/m? (c) How does this compare with Am/m for the fission of 1.00 kg of 
uranium? 


Solution: 


7.1x 10-3 kg 


T1104 


The ratio is greater for hydrogen. 

Exercise: 
Problem: 
There is approximately 10°* J of energy available from fusion of hydrogen in the 
world’s oceans. (a) If 10°° J of this energy were utilized, what would be the decrease 
in mass of the oceans? Assume that 0.08% of the mass of a water molecule is converted 
to energy during the fusion of hydrogen. (b) How great a volume of water does this 


correspond to? (c) Comment on whether this is a significant fraction of the total mass 
of the oceans. 


Exercise: 
Problem: 
A muon has a rest mass energy of 105.7 MeV, and it decays into an electron and a 


massless particle. (a) If all the lost mass is converted into the electron’s kinetic energy, 
find + for the electron. (b) What is the electron’s velocity? 


Solution: 
208 


0.999988c 
Exercise: 
Problem: 
A m-meson is a particle that decays into a muon and a massless particle. The 7-meson 
has a rest mass energy of 139.6 MeV, and the muon has a rest mass energy of 105.7 


MeV. Suppose the 7-meson is at rest and all of the missing mass goes into the muon’s 
kinetic energy. How fast will the muon move? 


Exercise: 


Problem: 

(a) Calculate the relativistic kinetic energy of a 1000-kg car moving at 30.0 m/s if the 
speed of light were only 45.0 m/s. (b) Find the ratio of the relativistic kinetic energy to 
classical. 


Solution: 


6.92 x 10° J 


1.54 
Exercise: 
Problem: 
Alpha decay is nuclear decay in which a helium nucleus is emitted. If the helium 


nucleus has a mass of 6.80 x 10-2” kg and is given 5.00 MeV of kinetic energy, what 
is its velocity? 


Exercise: 
Problem: 
(a) Beta decay is nuclear decay in which an electron is emitted. If the electron is given 
0.750 MeV of kinetic energy, what is its velocity? (b) Comment on how the high 


velocity is consistent with the kinetic energy as it compares to the rest mass energy of 
the electron. 


Solution: 
(a) 0.914c 


(b) The rest mass energy of an electron is 0.511 MeV, so the kinetic energy is 
approximately 150% of the rest mass energy. The electron should be traveling close to 
the speed of light. 


Exercise: 
Problem: 
A positron is an antimatter version of the electron, having exactly the same mass. 
When a positron and an electron meet, they annihilate, converting all of their mass into 
energy. (a) Find the energy released, assuming negligible kinetic energy before the 
annihilation. (b) If this energy is given to a proton in the form of kinetic energy, what is 


its velocity? (c) If this energy is given to another electron in the form of kinetic energy, 
what is its velocity? 


Exercise: 
Problem: 
What is the kinetic energy in MeV of a 7-meson that lives 1.40 x 10~1° s as measured 


in the laboratory, and 0.840 x 10~1° s when at rest relative to an observer, given that 
its rest energy is 135 MeV? 


Solution: 


90.0 MeV 


Exercise: 


Problem: 


Find the kinetic energy in MeV of a neutron with a measured life span of 2065 s, given 
its rest energy is 939.6 MeV, and rest life span is 900s. 

Exercise: 
Problem: 


(a) Show that (pc)?/(mc?)? = y? — 1. This means that at large velocities pe >> mc’. 
(b) Is # = pe when y = 30.0, as for the astronaut discussed in the twin paradox? 


Solution: 


E? = p?c? + m2ct = y?m?c", so that 
(a) pc? = (7? — 1)m?c* , and therefore 


(me?)” 


(b) yes 


Exercise: 


(pe)” a? 4 


Problem: 


One cosmic ray neutron has a velocity of 0.250c relative to the Earth. (a) What is the 
neutron’s total energy in MeV? (b) Find its momentum. (c) Is & ~ pc in this situation? 
Discuss in terms of the equation given in part (a) of the previous problem. 


Exercise: 
Problem: 


What is y for a proton having a mass energy of 938.3 MeV accelerated through an 
effective potential of 1.0 TV (teravolt) at Fermilab outside Chicago? 


Solution: 


1.07 x 10° 
Exercise: 
Problem: 
(a) What is the effective accelerating potential for electrons at the Stanford Linear 


Accelerator, if y = 1.00 x 10° for them? (b) What is their total energy (nearly the 
same as kinetic in this case) in GeV? 


Exercise: 


Problem: 


(a) Using data from [link], find the mass destroyed when the energy in a barrel of crude 
oil is released. (b) Given these barrels contain 200 liters and assuming the density of 
crude oil is 750 kg/ m’°, what is the ratio of mass destroyed to original mass, Am/m? 


Solution: 
6.56 x 10 8 kg 


437% 10°-* 
Exercise: 
Problem: 
(a) Calculate the energy released by the destruction of 1.00 kg of mass. (b) How many 
kilograms could be lifted to a 10.0 km height by this amount of energy? 
Exercise: 
Problem: 
A Van de Graaff accelerator utilizes a 50.0 MV potential difference to accelerate 


charged particles such as protons. (a) What is the velocity of a proton accelerated by 
such a potential? (b) An electron? 


Solution: 
0.314c 


0.99995c 
Exercise: 
Problem: 
Suppose you use an average of 500 kW-h of electric energy per month in your home. 
(a) How long would 1.00 g of mass converted to electric energy with an efficiency of 


38.0% last you? (b) How many homes could be supplied at the 500 kW-h per month 
rate for one year by the energy from the described mass conversion? 


Exercise: 
Problem: 
(a) A nuclear power plant converts energy from nuclear fission into electricity with an 
efficiency of 35.0%. How much mass is destroyed in one year to produce a continuous 


1000 MW of electric power? (b) Do you think it would be possible to observe this mass 
loss if the total mass of the fuel is 10* kg? 


Solution: 
(a) 1.00 kg 
(b) This much mass would be measurable, but probably not observable just by looking 
because it is 0.01% of the total mass. 

Exercise: 
Problem: 
Nuclear-powered rockets were researched for some years before safety concerns 
became paramount. (a) What fraction of a rocket’s mass would have to be destroyed to 
get it into a low Earth orbit, neglecting the decrease in gravity? (Assume an orbital 
altitude of 250 km, and calculate both the kinetic energy (classical) and the 


gravitational potential energy needed.) (b) If the ship has a mass of 1.00 x 10° kg (100 
tons), what total yield nuclear explosion in tons of TNT is needed? 


Exercise: 
Problem: 
The Sun produces energy at a rate of 4.00 x 107° w by the fusion of hydrogen. (a) 
How many kilograms of hydrogen undergo fusion each second? (b) If the Sun is 90.0% 
hydrogen and half of this can undergo fusion before the Sun changes character, how 
long could it produce energy at its current rate? (c) How many kilograms of mass is the 


Sun losing per second? (d) What fraction of its mass will it have lost in the time found 
in part (b)? 


Solution: 
(a) 6.3 x 10" kg/s 
(b) 4.5 x 10° y 
(c) 4.44 x 10° kg 
(d) 0.32% 
Exercise: 
Problem: Unreasonable Results 
A proton has a mass of 1.67 x 10~?’ kg. A physicist measures the proton’s total 
energy to be 50.0 MeV. (a) What is the proton’s kinetic energy? (b) What is 


unreasonable about this result? (c) Which assumptions are unreasonable or 
inconsistent? 


Exercise: 


Problem: Construct Your Own Problem 


Consider a highly relativistic particle. Discuss what is meant by the term “highly 
relativistic.” (Note that, in part, it means that the particle cannot be massless.) 
Construct a problem in which you calculate the wavelength of such a particle and show 
that it is very nearly the same as the wavelength of a massless particle, such as a 
photon, with the same energy. Among the things to be considered are the rest energy of 
the particle (it should be a known particle) and its total energy, which should be large 
compared to its rest energy. 


Exercise: 


Problem: Construct Your Own Problem 


Consider an astronaut traveling to another star at a relativistic velocity. Construct a 
problem in which you calculate the time for the trip as observed on the Earth and as 
observed by the astronaut. Also calculate the amount of mass that must be converted to 
energy to get the astronaut and ship to the velocity travelled. Among the things to be 
considered are the distance to the star, the velocity, and the mass of the astronaut and 
ship. Unless your instructor directs you otherwise, do not include any energy given to 
other masses, such as rocket propellants. 


Glossary 


total energy 
defined as E = ymc?, where y = 


rest energy 


the energy stored in an object at rest: Ep = mc? 


relativistic kinetic energy 
the kinetic energy of an object moving at relativistic speeds: KE,.) = (y — 1) mc?, 


where y = ; 
Uv 
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Introduction to Atomic Physics 
class="introduction" 


Individual 
carbon 
atoms are 
visible in 
this image 
of a carbon 
nanotube 
made by a 
scanning 
tunneling 
electron 
microscope 
. (credit: 
Taner 
Yildirim, 
National 
Institute of 
Standards 
and 
Technology 
, Via 
Wikimedia 
Commons) 


> aw 
4 ® 


*e«* ae | 
TrYvyesaar 


- 


14 * 


Bean 
+) beans 


= 


Seas 
*®eaa 


a 


From childhood on, we learn that atoms are a substructure of all things 
around us, from the air we breathe to the autumn leaves that blanket a forest 
trail. Invisible to the eye, the existence and properties of atoms are used to 
explain many phenomena—a theme found throughout this text. In this 
chapter, we discuss the discovery of atoms and their own substructures; we 
then apply quantum mechanics to the description of atoms, and their 
properties and interactions. Along the way, we will find, much like the 
scientists who made the original discoveries, that new concepts emerge with 
applications far beyond the boundaries of atomic physics. 


Discovery of the Atom 
e Describe the basic structure of the atom, the substructure of all matter. 


How do we know that atoms are really there if we cannot see them with our 
eyes? A brief account of the progression from the proposal of atoms by the 
Greeks to the first direct evidence of their existence follows. 


People have long speculated about the structure of matter and the existence 
of atoms. The earliest significant ideas to survive are due to the ancient 
Greeks in the fifth century BCE, especially those of the philosophers 
Leucippus and Democritus. (There is some evidence that philosophers in 
both India and China made similar speculations, at about the same time.) 
They considered the question of whether a substance can be divided without 
limit into ever smaller pieces. There are only a few possible answers to this 
question. One is that infinitesimally small subdivision is possible. Another 
is what Democritus in particular believed—that there is a smallest unit that 
cannot be further subdivided. Democritus called this the atom. We now 
know that atoms themselves can be subdivided, but their identity is 
destroyed in the process, so the Greeks were correct in a respect. The 
Greeks also felt that atoms were in constant motion, another correct notion. 


The Greeks and others speculated about the properties of atoms, proposing 
that only a few types existed and that all matter was formed as various 
combinations of these types. The famous proposal that the basic elements 
were earth, air, fire, and water was brilliant, but incorrect. The Greeks had 
identified the most common examples of the four states of matter (solid, 
gas, plasma, and liquid), rather than the basic elements. More than 2000 
years passed before observations could be made with equipment capable of 
revealing the true nature of atoms. 


Over the centuries, discoveries were made regarding the properties of 
substances and their chemical reactions. Certain systematic features were 
recognized, but similarities between common and rare elements resulted in 
efforts to transmute them (lead into gold, in particular) for financial gain. 
Secrecy was endemic. Alchemists discovered and rediscovered many facts 
but did not make them broadly available. As the Middle Ages ended, 
alchemy gradually faded, and the science of chemistry arose. It was no 


longer possible, nor considered desirable, to keep discoveries secret. 
Collective knowledge grew, and by the beginning of the 19th century, an 
important fact was well established—the masses of reactants in specific 
chemical reactions always have a particular mass ratio. This is very strong 
indirect evidence that there are basic units (atoms and molecules) that have 
these same mass ratios. The English chemist John Dalton (1766-1844) did 
much of this work, with significant contributions by the Italian physicist 
Amedeo Avogadro (1776-1856). It was Avogadro who developed the idea 
of a fixed number of atoms and molecules in a mole, and this special 
number is called Avogadro’s number in his honor. The Austrian physicist 
Johann Josef Loschmidt was the first to measure the value of the constant in 
1865 using the kinetic theory of gases. 


Note: 

Patterns and Systematics 

The recognition and appreciation of patterns has enabled us to make many 
discoveries. The periodic table of elements was proposed as an organized 
summary of the known elements long before all elements had been 
discovered, and it led to many other discoveries. We shall see in later 
chapters that patterns in the properties of subatomic particles led to the 
proposal of quarks as their underlying structure, an idea that is still bearing 
fruit. 


Knowledge of the properties of elements and compounds grew, culminating 
in the mid-19th-century development of the periodic table of the elements 
by Dmitri Mendeleev (1834-1907), the great Russian chemist. Mendeleev 
proposed an ingenious array that highlighted the periodic nature of the 
properties of elements. Believing in the systematics of the periodic table, he 
also predicted the existence of then-unknown elements to complete it. Once 
these elements were discovered and determined to have properties predicted 
by Mendeleev, his periodic table became universally accepted. 


Also during the 19th century, the kinetic theory of gases was developed. 
Kinetic theory is based on the existence of atoms and molecules in random 


thermal motion and provides a microscopic explanation of the gas laws, 
heat transfer, and thermodynamics (see Introduction to Temperature, 
Kinetic Theory, and the Gas Laws and Introduction to Laws of 
Thermodynamics). Kinetic theory works so well that it is another strong 
indication of the existence of atoms. But it is still indirect evidence— 
individual atoms and molecules had not been observed. There were heated 
debates about the validity of kinetic theory until direct evidence of atoms 
was obtained. 


The first truly direct evidence of atoms is credited to Robert Brown, a 
Scottish botanist. In 1827, he noticed that tiny pollen grains suspended in 
still water moved about in complex paths. This can be observed with a 
microscope for any small particles in a fluid. The motion is caused by the 
random thermal motions of fluid molecules colliding with particles in the 
fluid, and it is now called Brownian motion. (See [link].) Statistical 
fluctuations in the numbers of molecules striking the sides of a visible 
particle cause it to move first this way, then that. Although the molecules 
cannot be directly observed, their effects on the particle can be. By 
examining Brownian motion, the size of molecules can be calculated. The 
smaller and more numerous they are, the smaller the fluctuations in the 
numbers striking different sides. 


5o8 O. 


The position of a 
pollen grain in 
water, Measured 
every few seconds 
under a 
microscope, 


exhibits Brownian 
motion. Brownian 
motion is due to 
fluctuations in the 
number of atoms 
and molecules 
colliding with a 
small mass, causing 
it to move about in 
complex paths. 
This is nearly direct 
evidence for the 
existence of atoms, 
providing a 
satisfactory 
alternative 
explanation cannot 
be found. 


It was Albert Einstein who, starting in his epochal year of 1905, published 
several papers that explained precisely how Brownian motion could be used 
to measure the size of atoms and molecules. (In 1905 Einstein created 
special relativity, proposed photons as quanta of EM radiation, and 
produced a theory of Brownian motion that allowed the size of atoms to be 
determined. All of this was done in his spare time, since he worked days as 
a patent examiner. Any one of these very basic works could have been the 
crowning achievement of an entire career—yet Einstein did even more in 
later years.) Their sizes were only approximately known to be 10° '° m, 
based on a comparison of latent heat of vaporization and surface tension 
made in about 1805 by Thomas Young of double-slit fame and the famous 
astronomer and mathematician Simon Laplace. 


Using Einstein’s ideas, the French physicist Jean-Baptiste Perrin (1870— 
1942) carefully observed Brownian motion; not only did he confirm 
Einstein’s theory, he also produced accurate sizes for atoms and molecules. 


Since molecular weights and densities of materials were well established, 
knowing atomic and molecular sizes allowed a precise value for Avogadro’s 
number to be obtained. (If we know how big an atom is, we know how 
many fit into a certain volume.) Perrin also used these ideas to explain 
atomic and molecular agitation effects in sedimentation, and he received the 
1926 Nobel Prize for his achievements. Most scientists were already 
convinced of the existence of atoms, but the accurate observation and 
analysis of Brownian motion was conclusive—it was the first truly direct 
evidence. 


A huge array of direct and indirect evidence for the existence of atoms now 
exists. For example, it has become possible to accelerate ions (much as 
electrons are accelerated in cathode-ray tubes) and to detect them 
individually as well as measure their masses (see More Applications of 
Magnetism for a discussion of mass spectrometers). Other devices that 
observe individual atoms, such as the scanning tunneling electron 
microscope, will be discussed elsewhere. (See [link].) All of our 
understanding of the properties of matter is based on and consistent with the 
atom. The atom’s substructures, such as electron shells and the nucleus, are 
both interesting and important. The nucleus in turn has a substructure, as do 
the particles of which it is composed. These topics, and the question of 
whether there is a smallest basic structure to matter, will be explored in later 
parts of the text. 


Individual atoms 
can be detected 
with devices such 
as the scanning 
tunneling electron 


microscope that 
produced this 
image of individual 
gold atoms ona 
graphite substrate. 
(credit: Erwin 
Rossen, Eindhoven 
University of 
Technology, via 
Wikimedia 
Commons) 


Section Summary 


e Atoms are the smallest unit of elements; atoms combine to form 
molecules, the smallest unit of compounds. 

e The first direct observation of atoms was in Brownian motion. 

e Analysis of Brownian motion gave accurate sizes for atoms (101° m 
on average) and a precise value for Avogadro’s number. 


Conceptual Questions 


Exercise: 


Problem: 


Name three different types of evidence for the existence of atoms. 
Exercise: 

Problem: 

Explain why patterns observed in the periodic table of the elements are 


evidence for the existence of atoms, and why Brownian motion is a 
more direct type of evidence for their existence. 


Exercise: 


Problem: If atoms exist, why can’t we see them with visible light? 


Problems & Exercises 


Exercise: 
Problem: 
Using the given charge-to-mass ratios for electrons and protons, and 
knowing the magnitudes of their charges are equal, what is the ratio of 
the proton’s mass to the electron’s? (Note that since the charge-to-mass 


ratios are given to only three-digit accuracy, your answer may differ 
from the accepted ratio in the fourth digit.) 


Solution: 


1.84 x 10° 
Exercise: 
Problem: 
(a) Calculate the mass of a proton using the charge-to-mass ratio given 


for it in this chapter and its known charge. (b) How does your result 
compare with the proton mass given in this chapter? 


Exercise: 
Problem: 
If someone wanted to build a scale model of the atom with a nucleus 
1.00 m in diameter, how far away would the nearest electron need to 
be? 


Solution: 


50 km 


Glossary 


atom 
basic unit of matter, which consists of a central, positively charged 
nucleus surrounded by negatively charged electrons 


Brownian motion 
the continuous random movement of particles of matter suspended in a 
liquid or gas 


Discovery of the Parts of the Atom: Electrons and Nuclei 


e Describe how electrons were discovered. 

e Explain the Millikan oil drop experiment. 

¢ Describe Rutherford’s gold foil experiment. 

e Describe Rutherford’s planetary model of the atom. 


Just as atoms are a substructure of matter, electrons and nuclei are 
substructures of the atom. The experiments that were used to discover 
electrons and nuclei reveal some of the basic properties of atoms and can be 
readily understood using ideas such as electrostatic and magnetic force, 
already covered in previous chapters. 


Note: 

Charges and Electromagnetic Forces 

In previous discussions, we have noted that positive charge is associated 
with nuclei and negative charge with electrons. We have also covered 
many aspects of the electric and magnetic forces that affect charges. We 
will now explore the discovery of the electron and nucleus as substructures 
of the atom and examine their contributions to the properties of atoms. 


The Electron 


Gas discharge tubes, such as that shown in [link], consist of an evacuated 
glass tube containing two metal electrodes and a rarefied gas. When a high 
voltage is applied to the electrodes, the gas glows. These tubes were the 
precursors to today’s neon lights. They were first studied seriously by 
Heinrich Geissler, a German inventor and glassblower, starting in the 
1860s. The English scientist William Crookes, among others, continued to 
study what for some time were called Crookes tubes, wherein electrons are 
freed from atoms and molecules in the rarefied gas inside the tube and are 
accelerated from the cathode (negative) to the anode (positive) by the high 
potential. These “cathode rays” collide with the gas atoms and molecules 
and excite them, resulting in the emission of electromagnetic (EM) 


radiation that makes the electrons’ path visible as a ray that spreads and 
fades as it moves away from the cathode. 


Gas discharge tubes today are most commonly called cathode-ray tubes, 
because the rays originate at the cathode. Crookes showed that the electrons 
carry momentum (they can make a small paddle wheel rotate). He also 
found that their normally straight path is bent by a magnet in the direction 
expected for a negative charge moving away from the cathode. These were 
the first direct indications of electrons and their charge. 


A gas discharge tube 
glows when a high 
voltage is applied to it. 
Electrons emitted from 
the cathode are 
accelerated toward the 
anode; they excite atoms 
and molecules in the gas, 
which glow in response. 
Once called Geissler 
tubes and later Crookes 
tubes, they are now 
known as cathode-ray 


tubes (CRTs) and are 
found in older TVs, 
computer screens, and x- 
ray machines. When a 
magnetic field is applied, 
the beam bends in the 
direction expected for 
negative charge. (credit: 
Paul Downey, Flickr) 


The English physicist J. J. Thomson (1856-1940) improved and expanded 
the scope of experiments with gas discharge tubes. (See [link] and [link].) 
He verified the negative charge of the cathode rays with both magnetic and 
electric fields. Additionally, he collected the rays in a metal cup and found 
an excess of negative charge. Thomson was also able to measure the ratio of 
the charge of the electron to its mass, g- /7™-—an important step to finding 
the actual values of both ge and me. [link] shows a cathode-ray tube, which 
produces a narrow beam of electrons that passes through charging plates 
connected to a high-voltage power supply. An electric field E is produced 
between the charging plates, and the cathode-ray tube is placed between the 
poles of a magnet so that the electric field E is perpendicular to the 
magnetic field B of the magnet. These fields, being perpendicular to each 
other, produce opposing forces on the electrons. As discussed for mass 
spectrometers in More Applications of Magnetism, if the net force due to 
the fields vanishes, then the velocity of the charged particle is v = F/B. In 
this manner, Thomson determined the velocity of the electrons and then 
moved the beam up and down by adjusting the electric field. 


J. J. Thomson (credit: 
www.firstworldwar.com 
, via Wikimedia 
Commons) 


Diagram of Thomson’s CRT. 
(credit: Kurzon, Wikimedia 
Commons) 


Wire to high- 
voltage supply 


Anode 


Cathode 


a 


This schematic shows the electron beam in a CRT 
passing through crossed electric and magnetic fields and 
causing phosphor to glow when striking the end of the 
tube. 


To see how the amount of deflection is used to calculate q, /m,, note that 
the deflection is proportional to the electric force on the electron: 
Equation: 


=o. 


But the vertical deflection is also related to the electron’s mass, since the 
electron’s acceleration is 
Equation: 


The value of F’ is not known, since ge was not yet known. Substituting the 
expression for electric force into the expression for acceleration yields 


Equation: 


F ge 
a= = 
Me Me 
Gathering terms, we have 
Equation: 
de @ 
Me E 


The deflection is analyzed to get a, and —& is determined from the applied 


voltage and distance between the plates; thus, fe can be determined. With 


“@<- can be obtained by 


the velocity known, another measurement of 
bending the beam of electrons with the magnetic field. Since 

Frnag = UeVB = mea, we have ge/me = a/vB. Consistent results are 
obtained using magnetic deflection. 


What is so important about q, /7™m¢, the ratio of the electron’s charge to its 
mass? The value obtained is 
Equation: 


de 
Me 


— —1.76 x 101! C/kg (electron). 


This is a huge number, as Thomson realized, and it implies that the electron 
has a very small mass. It was known from electroplating that about 

10° C /kg is needed to plate a material, a factor of about 1000 less than the 
charge per kilogram of electrons. Thomson went on to do the same 
experiment for positively charged hydrogen ions (now known to be bare 
protons) and found a charge per kilogram about 1000 times smaller than 
that for the electron, implying that the proton is about 1000 times more 
massive than the electron. Today, we know more precisely that 

Equation: 


me = 9.58 x 10’ C/kg (proton), 


Pp 


where q, is the charge of the proton and m, is its mass. This ratio (to four 
significant figures) is 1836 times less charge per kilogram than for the 
electron. Since the charges of electrons and protons are equal in magnitude, 
this implies m, = 1836m, . 


Thomson performed a variety of experiments using differing gases in 
discharge tubes and employing other methods, such as the photoelectric 
effect, for freeing electrons from atoms. He always found the same 
properties for the electron, proving it to be an independent particle. For his 
work, the important pieces of which he began to publish in 1897, Thomson 
was awarded the 1906 Nobel Prize in Physics. In retrospect, it is difficult to 
appreciate how astonishing it was to find that the atom has a substructure. 
Thomson himself said, “It was only when I was convinced that the 
experiment left no escape from it that I published my belief in the existence 
of bodies smaller than atoms.” 


Thomson attempted to measure the charge of individual electrons, but his 
method could determine its charge only to the order of magnitude expected. 


Since Faraday’s experiments with electroplating in the 1830s, it had been 
known that about 100,000 C per mole was needed to plate singly ionized 
ions. Dividing this by the number of ions per mole (that is, by Avogadro’s 
number), which was approximately known, the charge per ion was 
calculated to be about 1.6 x 107!" G, close to the actual value. 


An American physicist, Robert Millikan (1868-1953) (see [link]), decided 
to improve upon Thomson’s experiment for measuring q, and was 
eventually forced to try another approach, which is now a classic 
experiment performed by students. The Millikan oil drop experiment is 
shown in [link]. 


Robert Millikan 
(credit: Unknown 
Author, via 
Wikimedia 
Commons) 


3 we 
— Atomizer 


The Millikan oil 
drop experiment 
produced the first 
accurate direct 
measurement of the 


charge on 
electrons, one of 
the most 
fundamental 
constants in nature. 
Fine drops of oil 
become charged 
when sprayed. 
Their movement is 
observed between 
metal plates with a 
potential applied to 
oppose the 
gravitational force. 
The balance of 
gravitational and 
electric forces 
allows the 
calculation of the 
charge on a drop. 
The charge is found 
to be quantized in 
units of 
—1.6 x 10°°C, 
thus determining 
directly the charge 
of the excess and 
missing electrons 
on the oil drops. 


In the Millikan oil drop experiment, fine drops of oil are sprayed from an 
atomizer. Some of these are charged by the process and can then be 
suspended between metal plates by a voltage between the plates. In this 


situation, the weight of the drop is balanced by the electric force: 
Equation: 


MdropJ = qeE 


The electric field is produced by the applied voltage, hence, & = V//d, and 
V is adjusted to just balance the drop’s weight. The drops can be seen as 
points of reflected light using a microscope, but they are too small to 
directly measure their size and mass. The mass of the drop is determined by 
observing how fast it falls when the voltage is turned off. Since air 
resistance is very significant for these submicroscopic drops, the more 
massive drops fall faster than the less massive, and sophisticated 
sedimentation calculations can reveal their mass. Oil is used rather than 
water, because it does not readily evaporate, and so mass is nearly constant. 
Once the mass of the drop is known, the charge of the electron is given by 
rearranging the previous equation: 

Equation: 


a MdropY _ Maropgd 
q == E i-= V ) 


where d is the separation of the plates and V is the voltage that holds the 
drop motionless. (The same drop can be observed for several hours to see 
that it really is motionless.) By 1913 Millikan had measured the charge of 
the electron g. to an accuracy of 1%, and he improved this by a factor of 10 
within a few years to a value of —1.60 x 10-'° C. He also observed that 
all charges were multiples of the basic electron charge and that sudden 
changes could occur in which electrons were added or removed from the 
drops. For this very fundamental direct measurement of gq, and for his 
studies of the photoelectric effect, Millikan was awarded the 1923 Nobel 
Prize in Physics. 


With the charge of the electron known and the charge-to-mass ratio known, 
the electron’s mass can be calculated. It is 
Equation: 


Substituting known values yields 


Equation: 
—1.60 x 10°C 
Me SE 
—1.76 x 10"! C/kg 
or 
Equation: 


Me = 9.11 x 107%! kg (electron’s mass), 


where the round-off errors have been corrected. The mass of the electron 
has been verified in many subsequent experiments and is now known to an 
accuracy of better than one part in one million. It is an incredibly small 
mass and remains the smallest known mass of any particle that has mass. 
(Some particles, such as photons, are massless and cannot be brought to 
rest, but travel at the speed of light.) A similar calculation gives the masses 
of other particles, including the proton. To three digits, the mass of the 
proton is now known to be 

Equation: 


m, = 1.67 x 10°*" kg (proton’s mass), 


which is nearly identical to the mass of a hydrogen atom. What Thomson 
and Millikan had done was to prove the existence of one substructure of 
atoms, the electron, and further to show that it had only a tiny fraction of 
the mass of an atom. The nucleus of an atom contains most of its mass, and 
the nature of the nucleus was completely unanticipated. 


Another important characteristic of quantum mechanics was also beginning 
to emerge. All electrons are identical to one another. The charge and mass 
of electrons are not average values; rather, they are unique values that all 
electrons have. This is true of other fundamental entities at the 
submicroscopic level. All protons are identical to one another, and so on. 


The Nucleus 


Here, we examine the first direct evidence of the size and mass of the 
nucleus. In later chapters, we will examine many other aspects of nuclear 
physics, but the basic information on nuclear size and mass is so important 
to understanding the atom that we consider it here. 


Nuclear radioactivity was discovered in 1896, and it was soon the subject of 
intense study by a number of the best scientists in the world. Among them 
was New Zealander Lord Ernest Rutherford, who made numerous 
fundamental discoveries and earned the title of “father of nuclear physics.” 
Born in Nelson, Rutherford did his postgraduate studies at the Cavendish 
Laboratories in England before taking up a position at McGill University in 
Canada where he did the work that earned him a Nobel Prize in Chemistry 
in 1908. In the area of atomic and nuclear physics, there is much overlap 
between chemistry and physics, with physics providing the fundamental 
enabling theories. He returned to England in later years and had six future 
Nobel Prize winners as students. Rutherford used nuclear radiation to 
directly examine the size and mass of the atomic nucleus. The experiment 
he devised is shown in [link]. A radioactive source that emits alpha 
radiation was placed in a lead container with a hole in one side to produce a 
beam of alpha particles, which are a type of ionizing radiation ejected by 
the nuclei of a radioactive source. A thin gold foil was placed in the beam, 
and the scattering of the alpha particles was observed by the glow they 
caused when they struck a phosphor screen. 


Source of 
@ particles 


Rutherford’s experiment gave direct evidence for 
the size and mass of the nucleus by scattering 
alpha particles from a thin gold foil. Alpha 
particles with energies of about 5 MeV are emitted 
from a radioactive source (which is a small metal 
container in which a specific amount of a 
radioactive material is sealed), are collimated into 
a beam, and fall upon the foil. The number of 
particles that penetrate the foil or scatter to various 
angles indicates that gold nuclei are very small and 
contain nearly all of the gold atom’s mass. This is 
particularly indicated by the alpha particles that 
scatter to very large angles, much like a soccer ball 
bouncing off a goalie’s head. 


Alpha particles were known to be the doubly charged positive nuclei of 
helium atoms that had kinetic energies on the order of 5 MeV when emitted 
in nuclear decay, which is the disintegration of the nucleus of an unstable 
nuclide by the spontaneous emission of charged particles. These particles 
interact with matter mostly via the Coulomb force, and the manner in which 
they scatter from nuclei can reveal nuclear size and mass. This is analogous 
to observing how a bowling ball is scattered by an object you cannot see 
directly. Because the alpha particle’s energy is so large compared with the 
typical energies associated with atoms (MeV versus eV), you would expect 
the alpha particles to simply crash through a thin foil much like a 
supersonic bowling ball would crash through a few dozen rows of bowling 
pins. Thomson had envisioned the atom to be a small sphere in which equal 
amounts of positive and negative charge were distributed evenly. The 
incident massive alpha particles would suffer only small deflections in such 
a model. Instead, Rutherford and his collaborators found that alpha particles 
occasionally were scattered to large angles, some even back in the direction 
from which they came! Detailed analysis using conservation of momentum 
and energy—particularly of the small number that came straight back— 
implied that gold nuclei are very small compared with the size of a gold 
atom, contain almost all of the atom’s mass, and are tightly bound. Since 


the gold nucleus is several times more massive than the alpha particle, a 
head-on collision would scatter the alpha particle straight back toward the 
source. In addition, the smaller the nucleus, the fewer alpha particles that 
would hit one head on. 


Although the results of the experiment were published by his colleagues in 
1909, it took Rutherford two years to convince himself of their meaning. 
Like Thomson before him, Rutherford was reluctant to accept such radical 
results. Nature on a small scale is so unlike our classical world that even 
those at the forefront of discovery are sometimes surprised. Rutherford later 
wrote: “It was almost as incredible as if you fired a 15-inch shell at a piece 
of tissue paper and it came back and hit you. On consideration, I realized 
that this scattering backwards ... [meant] ... the greatest part of the mass of 
the atom was concentrated in a tiny nucleus.” In 1911, Rutherford published 
his analysis together with a proposed model of the atom. The size of the 
nucleus was determined to be about 107° m, or 100,000 times smaller than 
the atom. This implies a huge density, on the order of 10° g/ cm’®, vastly 
unlike any macroscopic matter. Also implied is the existence of previously 
unknown nuclear forces to counteract the huge repulsive Coulomb forces 
among the positive charges in the nucleus. Huge forces would also be 
consistent with the large energies emitted in nuclear radiation. 


The small size of the nucleus also implies that the atom is mostly empty 
inside. In fact, in Rutherford’s experiment, most alphas went straight 
through the gold foil with very little scattering, since electrons have such 
small masses and since the atom was mostly empty with nothing for the 
alpha to hit. There were already hints of this at the time Rutherford 
performed his experiments, since energetic electrons had been observed to 
penetrate thin foils more easily than expected. [link] shows a schematic of 
the atoms in a thin foil with circles representing the size of the atoms (about 
10~'° m) and dots representing the nuclei. (The dots are not to scale—if 
they were, you would need a microscope to see them.) Most alpha particles 
miss the small nuclei and are only slightly scattered by electrons. 
Occasionally, (about once in 8000 times in Rutherford’s experiment), an 
alpha hits a nucleus head-on and is scattered straight backward. 


10° m 


An expanded view of 
the atoms in the gold 
foil in Rutherford’s 
experiment. Circles 
represent the atoms 
(about 10-1? m in 
diameter), while the 
dots represent the 
nuclei (about 10-!° m 
in diameter). To be 
visible, the dots are 
much larger than 
scale. Most alpha 
particles crash through 
but are relatively 
unaffected because of 
their high energy and 
the electron’s small 
mass. Some, however, 
head straight toward a 
nucleus and are 
scattered straight back. 
A detailed analysis 
gives the size and 
mass of the nucleus. 


Based on the size and mass of the nucleus revealed by his experiment, as 
well as the mass of electrons, Rutherford proposed the planetary model of 
the atom. The planetary model of the atom pictures low-mass electrons 
orbiting a large-mass nucleus. The sizes of the electron orbits are large 
compared with the size of the nucleus, with mostly vacuum inside the atom. 
This picture is analogous to how low-mass planets in our solar system orbit 
the large-mass Sun at distances large compared with the size of the sun. In 
the atom, the attractive Coulomb force is analogous to gravitation in the 
planetary system. (See [link].) Note that a model or mental picture is 
needed to explain experimental results, since the atom is too small to be 
directly observed with visible light. 


Rutherford’s planetary 
model of the atom 
incorporates the 
characteristics of the 
nucleus, electrons, and 
the size of the atom. This 
model was the first to 
recognize the structure of 
atoms, in which low-mass 
electrons orbit a very 
small, massive nucleus in 
orbits much larger than 
the nucleus. The atom is 
mostly empty and is 
analogous to our 
planetary system. 


Rutherford’s planetary model of the atom was crucial to understanding the 
characteristics of atoms, and their interactions and energies, as we shall see 
in the next few sections. Also, it was an indication of how different nature is 
from the familiar classical world on the small, quantum mechanical scale. 
The discovery of a substructure to all matter in the form of atoms and 
molecules was now being taken a step further to reveal a substructure of 
atoms that was simpler than the 92 elements then known. We have 
continued to search for deeper substructures, such as those inside the 
nucleus, with some success. In later chapters, we will follow this quest in 
the discussion of quarks and other elementary particles, and we will look at 
the direction the search seems now to be heading. 


Note: 

PhET Explorations: Rutherford Scattering 

How did Rutherford figure out the structure of the atom without being able 
to see it? Simulate the famous experiment in which he disproved the Plum 
Pudding model of the atom by observing alpha particles bouncing off 
atoms and determining that they must have a small core. 


https://phet.colorado.edu/sims/html/rutherford-scattering /latest/rutherford- 
scattering en.html 


Section Summary 


e Atoms are composed of negatively charged electrons, first proved to 
exist in cathode-ray-tube experiments, and a positively charged 
nucleus. 

e All electrons are identical and have a charge-to-mass ratio of 
Equation: 


de _ 1.76 x 10"! C/ke. 


e 


e The positive charge in the nuclei is carried by particles called protons, 
which have a charge-to-mass ratio of 
Equation: 


9.57 x 107 C/kg. 


Mp 


e Mass of electron, 
Equation: 


Me = 9.11 x 10°*! kg. 


e Mass of proton, 
Equation: 


ig = 1-67 6107" ke: 


e The planetary model of the atom pictures electrons orbiting the 
nucleus in the same way that planets orbit the sun. 


Conceptual Questions 


Exercise: 


Problem: 


What two pieces of evidence allowed the first calculation of m,, the 
mass of the electron? 


(a) The ratios ge /me and gp/Mp. 
(b) The values of g. and Ep. 
(c) The ratio ge /me- and qe. 


Justify your response. 


Exercise: 


Problem: 


How do the allowed orbits for electrons in atoms differ from the 
allowed orbits for planets around the sun? Explain how the 
correspondence principle applies here. 


Problem Exercises 


Exercise: 
Problem: 


Rutherford found the size of the nucleus to be about 10~*° m. This 
implied a huge density. What would this density be for gold? 


Solution: 


6 x 10” kg/m? 

Exercise: 
Problem: 
In Millikan’s oil-drop experiment, one looks at a small oil drop held 
motionless between two plates. Take the voltage between the plates to 
be 2033 V, and the plate separation to be 2.00 cm. The oil drop (of 


density 0.81 g/ cm”) has a diameter of 4.0 x 10~° m. Find the charge 
on the drop, in terms of electron units. 


Exercise: 
Problem: 
(a) An aspiring physicist wants to build a scale model of a hydrogen 


atom for her science fair project. If the atom is 1.00 m in diameter, 
how big should she try to make the nucleus? 


(b) How easy will this be to do? 


Solution: 
(a) 10.0 um 


(b) It isn’t hard to make one of approximately this size. It would be 
harder to make it exactly 10.0 um. 


Glossary 


cathode-ray tube 
a vacuum tube containing a source of electrons and a screen to view 
images 


planetary model of the atom 
the most familiar model or illustration of the structure of the atom 


Bohr’s Theory of the Hydrogen Atom 


e Describe the mysteries of atomic spectra. 

e Explain Bohr’s theory of the hydrogen atom. 

e Explain Bohr’s planetary model of the atom. 

e Illustrate energy state using the energy-level diagram. 
¢ Describe the triumphs and limits of Bohr’s theory. 


The great Danish physicist Niels Bohr (1885-1962) made immediate use of Rutherford’s 
planetary model of the atom. ([link]). Bohr became convinced of its validity and spent part of 
1912 at Rutherford’s laboratory. In 1913, after returning to Copenhagen, he began publishing 
his theory of the simplest atom, hydrogen, based on the planetary model of the atom. For 
decades, many questions had been asked about atomic characteristics. From their sizes to their 
spectra, much was known about atoms, but little had been explained in terms of the laws of 
physics. Bohr’s theory explained the atomic spectrum of hydrogen and established new and 
broadly applicable principles in quantum mechanics. 


Niels Bohr, Danish physicist, 
used the planetary model of the 
atom to explain the atomic 
spectrum and size of the 
hydrogen atom. His many 
contributions to the 
development of atomic physics 
and quantum mechanics, his 
personal influence on many 
students and colleagues, and his 
personal integrity, especially in 
the face of Nazi oppression, 
earned him a prominent place in 
history. (credit: Unknown 
Author, via Wikimedia 
Commons) 


Mysteries of Atomic Spectra 


As noted in Quantization of Energy , the energies of some small systems are quantized. 
Atomic and molecular emission and absorption spectra have been known for over a century to 
be discrete (or quantized). (See [link].) Maxwell and others had realized that there must be a 
connection between the spectrum of an atom and its structure, something like the resonant 
frequencies of musical instruments. But, in spite of years of efforts by many great minds, no 
one had a workable theory. (It was a running joke that any theory of atomic and molecular 
spectra could be destroyed by throwing a book of data at it, so complex were the spectra.) 
Following Einstein’s proposal of photons with quantized energies directly proportional to 
their wavelengths, it became even more evident that electrons in atoms can exist only in 
discrete orbits. 


Discharge tube 


Slit 


Gratin ‘ 
9 Photographic 
film or other 
detector 


(b) 


Part (a) shows, from left to right, a discharge tube, slit, 
and diffraction grating producing a line spectrum. Part 
(b) shows the emission line spectrum for iron. The 
discrete lines imply quantized energy states for the atoms 
that produce them. The line spectrum for each element is 
unique, providing a powerful and much used analytical 
tool, and many line spectra were well known for many 
years before they could be explained with physics. 
(credit for (b): Yttrium91, Wikimedia Commons) 


In some cases, it had been possible to devise formulas that described the emission spectra. As 
you might expect, the simplest atom—hydrogen, with its single electron—has a relatively 
simple spectrum. The hydrogen spectrum had been observed in the infrared (IR), visible, and 
ultraviolet (UV), and several series of spectral lines had been observed. (See [link].) These 
series are named after early researchers who studied them in particular depth. 


The observed hydrogen-spectrum wavelengths can be calculated using the following 
formula: 
Equation: 


where J is the wavelength of the emitted EM radiation and R is the Rydberg constant, 
determined by the experiment to be 
Equation: 


R = 1.097 x 10’/m (or m“’). 


The constant n¢ is a positive integer associated with a specific series. For the Lyman series, 
n¢ = 1; for the Balmer series, n¢ = 2; for the Paschen series, n¢ = 3; and so on. The Lyman 
series is entirely in the UV, while part of the Balmer series is visible with the remainder UV. 
The Paschen series and all the rest are entirely IR. There are apparently an unlimited number 
of series, although they lie progressively farther into the infrared and become difficult to 
observe as m¢ increases. The constant nj is a positive integer, but it must be greater than ng. 
Thus, for the Balmer series, n¢ = 2 and n; = 3, 4, 5, 6, .... Note that n; can approach 
infinity. While the formula in the wavelengths equation was just a recipe designed to fit data 
and was not based on physical principles, it did imply a deeper meaning. Balmer first devised 
the formula for his series alone, and it was later found to describe all the other series by using 
different values of ns. Bohr was the first to comprehend the deeper meaning. Again, we see 
the interplay between experiment and theory in physics. Experimentally, the spectra were well 
established, an equation was found to fit the experimental data, but the theoretical foundation 
was missing. 


Wavelength, 2 - 
cf E E E 3 
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A schematic of the hydrogen spectrum shows several 
series named for those who contributed most to their 
determination. Part of the Balmer series is in the visible 
spectrum, while the Lyman series is entirely in the UV, 
and the Paschen series and others are in the IR. Values of 
me and n; are shown for some of the lines. 


Example: 

Calculating Wave Interference of a Hydrogen Line 

What is the distance between the slits of a grating that produces a first-order maximum for 
the second Balmer line at an angle of 15°? 

Strategy and Concept 

For an Integrated Concept problem, we must first identify the physical principles involved. In 
this example, we need to know (a) the wavelength of light as well as (b) conditions for an 
interference maximum for the pattern from a double slit. Part (a) deals with a topic of the 
present chapter, while part (b) considers the wave interference material of Wave Optics. 
Solution for (a) 

Hydrogen spectrum wavelength. The Balmer series requires that n¢ = 2. The first line in 
the series is taken to be for n; = 3, and so the second would have n; = 4. 

The calculation is a straightforward application of the wavelength equation. Entering the 
determined values for n¢ and n; yields 


Equation: 
‘co 1 1 
t= (4-4) 
= (1.097 x 107m“) (4 - 4) 
= 2,057 x 10° me. 
Inverting to find A gives 
Equation: 
1 = -9 
A 2057x10°m2> 486 x 10 m 


= 486 nm. 


Discussion for (a) 

This is indeed the experimentally observed wavelength, corresponding to the second (blue- 
green) line in the Balmer series. More impressive is the fact that the same simple recipe 
predicts all of the hydrogen spectrum lines, including new ones observed in subsequent 
experiments. What is nature telling us? 

Solution for (b) 

Double-slit interference (Wave Optics). To obtain constructive interference for a double slit, 
the path length difference from two slits must be an integral multiple of the wavelength. This 
condition was expressed by the equation 

Equation: 


dsin9=m4A, 


where d is the distance between slits and 6 is the angle from the original direction of the 
beam. The number m is the order of the interference; m = 1 in this example. Solving for d 
and entering known values yields 

Equation: 


1)(486 
Ape eon) ss 10-9 ml 
sin 15° 


Discussion for (b) 
This number is similar to those used in the interference examples of Introduction to Quantum 
Physics (and is close to the spacing between slits in commonly used diffraction glasses). 


Bohr’s Solution for Hydrogen 


Bohr was able to derive the formula for the hydrogen spectrum using basic physics, the 
planetary model of the atom, and some very important new proposals. His first proposal is 
that only certain orbits are allowed: we say that the orbits of electrons in atoms are quantized. 
Each orbit has a different energy, and electrons can move to a higher orbit by absorbing 
energy and drop to a lower orbit by emitting energy. If the orbits are quantized, the amount of 
energy absorbed or emitted is also quantized, producing discrete spectra. Photon absorption 
and emission are among the primary methods of transferring energy into and out of atoms. 
The energies of the photons are quantized, and their energy is explained as being equal to the 
change in energy of the electron when it moves from one orbit to another. In equation form, 
this is 

Equation: 


AE =hf = E,— E;. 


Here, AF is the change in energy between the initial and final orbits, and hf is the energy of 
the absorbed or emitted photon. It is quite logical (that is, expected from our everyday 
experience) that energy is involved in changing orbits. A blast of energy is required for the 
space shuttle, for example, to climb to a higher orbit. What is not expected is that atomic 
orbits should be quantized. This is not observed for satellites or planets, which can have any 
orbit given the proper energy. (See [link].) 


AE =6,- & =hf 


The planetary model of 
the atom, as modified by 
Bohr, has the orbits of the 
electrons quantized. Only 
certain orbits are allowed, 

explaining why atomic 

spectra are discrete 

(quantized). The energy 

carried away from an 
atom by a photon comes 
from the electron 
dropping from one 
allowed orbit to another 
and is thus quantized. 
This is likewise true for 
atomic absorption of 
photons. 


[link] shows an energy-level diagram, a convenient way to display energy states. In the 
present discussion, we take these to be the allowed energy levels of the electron. Energy is 
plotted vertically with the lowest or ground state at the bottom and with excited states above. 
Given the energies of the lines in an atomic spectrum, it is possible (although sometimes very 
difficult) to determine the energy levels of an atom. Energy-level diagrams are used for many 
systems, including molecules and nuclei. A theory of the atom or any other system must 
predict its energies based on the physics of the system. 
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An energy-level diagram 
plots energy vertically 
and is useful in 
visualizing the energy 
states of a system and the 
transitions between them. 
This diagram is for the 
hydrogen-atom electrons, 
showing a transition 
between two orbits 
having energies #4 and 
Eo. 


Bohr was clever enough to find a way to calculate the electron orbital energies in hydrogen. 
This was an important first step that has been improved upon, but it is well worth repeating 
here, because it does correctly describe many characteristics of hydrogen. Assuming circular 
orbits, Bohr proposed that the angular momentum L of an electron in its orbit is 
quantized, that is, it has only specific, discrete values. The value for L is given by the 
formula 

Equation: 


h 
L= Wi, = n—(n = 1, 2, 3, <<<), 
20 


where L is the angular momentum, m, is the electron’s mass, 7, is the radius of the n th 
orbit, and h is Planck’s constant. Note that angular momentum is L = Jw. For a small object 
at aradius r, I = mr? and w = v/r, so that L = (mr?) (v/r) = mur. Quantization says 
that this value of mur can only be equal to h/2, 2h/2, 3h/2, etc. At the time, Bohr himself 
did not know why angular momentum should be quantized, but using this assumption he was 


able to calculate the energies in the hydrogen spectrum, something no one else had done at the 
time. 


From Bohr’s assumptions, we will now derive a number of important properties of the 
hydrogen atom from the classical physics we have covered in the text. We start by noting the 
centripetal force causing the electron to follow a circular path is supplied by the Coulomb 
force. To be more general, we note that this analysis is valid for any single-electron atom. So, 
if anucleus has Z protons (Z = 1 for hydrogen, 2 for helium, etc.) and only one electron, that 
atom is called a hydrogen-like atom. The spectra of hydrogen-like ions are similar to 
hydrogen, but shifted to higher energy by the greater attractive force between the electron and 
nucleus. The magnitude of the centripetal force is mev2/Tn, while the Coulomb force is 
k(Zq,)(qe) /r2. The tacit assumption here is that the nucleus is more massive than the 
stationary electron, and the electron orbits about it. This is consistent with the planetary model 
of the atom. Equating these, 

Equation: 


Zq. mv" 


k 


= (Coulomb = centripetal). 
re Tn 


Angular momentum quantization is stated in an earlier equation. We solve that equation for v, 
substitute it into the above, and rearrange the expression to obtain the radius of the orbit. This 
yields: 

Equation: 


2 
i= as for allowed orbits(n = 1,2,3,...), 


where ag is defined to be the Bohr radius, since for the lowest orbit (x = 1) and for 
hydrogen (Z = 1), r; = ag. It is left for this chapter’s Problems and Exercises to show that 
the Bohr radius is 

Equation: 


h2 
agn= 


== > = 0.529 x 10-9 m. 
An°mekqe 


These last two equations can be used to calculate the radii of the allowed (quantized) 
electron orbits in any hydrogen-like atom. It is impressive that the formula gives the correct 
size of hydrogen, which is measured experimentally to be very close to the Bohr radius. The 
earlier equation also tells us that the orbital radius is proportional to 7, as illustrated in [link]. 


I 


The allowed electron orbits in 
hydrogen have the radii shown. 
These radii were first calculated 

by Bohr and are given by the 

equation r, = 2 ap. The 
lowest orbit has the 
experimentally verified 
diameter of a hydrogen atom. 


To get the electron orbital energies, we start by noting that the electron energy is the sum of 
its kinetic and potential energy: 
Equation: 


E, = KE+ PE. 


Kinetic energy is the familiar KE = (1/2)m,v?, assuming the electron is not moving at 


relativistic speeds. Potential energy for the electron is electrical, or PE = q.V, where V is the 
potential due to the nucleus, which looks like a point charge. The nucleus has a positive 
charge Zq, ; thus, V = kZq,/rn, recalling an earlier equation for the potential due to a point 


charge. Since the electron’s charge is negative, we see that PE = —kZq,/r,. Entering the 
expressions for KE and PE, we find 
Equation: 
1 Zq 
E, = —m.v? —k a 
2 Tx 


Now we substitute r,, and v from earlier equations into the above expression for energy. 
Algebraic manipulation yields 


Equation: 


Z2 
En — — Ty Boln — 1, 2, Ds see) 


for the orbital energies of hydrogen-like atoms. Here, / is the ground-state energy 
(n = 1) for hydrogen (Z = 1) and is given by 


Equation: 
Qn q?m._k? 
Ey = 2 = 13.6 eV 
Thus, for hydrogen, 
Equation: 
13.6 eV 
E> = 72 (S128) 


[link] shows an energy-level diagram for hydrogen that also illustrates how the various 
spectral series for hydrogen are related to transitions between energy levels. 
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Energy-level diagram for 


hydrogen showing the 
Lyman, Balmer, and 
Paschen series of 
transitions. The orbital 
energies are calculated 
using the above equation, 
first derived by Bohr. 


Electron total energies are negative, since the electron is bound to the nucleus, analogous to 
being in a hole without enough kinetic energy to escape. As n approaches infinity, the total 
energy becomes zero. This corresponds to a free electron with no kinetic energy, since r,, gets 
very large for large n, and the electric potential energy thus becomes zero. Thus, 13.6 eV is 
needed to ionize hydrogen (to go from —13.6 eV to 0, or unbound), an experimentally verified 
number. Given more energy, the electron becomes unbound with some kinetic energy. For 
example, giving 15.0 eV to an electron in the ground state of hydrogen strips it from the atom 
and leaves it with 1.4 eV of kinetic energy. 


Finally, let us consider the energy of a photon emitted in a downward transition, given by the 
equation to be 
Equation: 


AE =hf = EF, — Ex. 


Substituting E,, = (- 13.6 eV/n7), we see that 
Equation: 


Dividing both sides of this equation by hc gives an expression for 1/2: 
Equation: 


he Cc A he 


hf f 1 weed (2 | 


It can be shown that 
Equation: 


13.6 eV) (1.602 x 10~!9 J/eV 
(Ae) - ( )( /eV) ~1,097 x 107m =R 
he (6.626 x 10-*4 J-s) (2.998 x 10° m/s) 


is the Rydberg constant. Thus, we have used Bohr’s assumptions to derive the formula first 
proposed by Balmer years earlier as a recipe to fit experimental data. 
Equation: 


We see that Bohr’s theory of the hydrogen atom answers the question as to why this 
previously known formula describes the hydrogen spectrum. It is because the energy levels 
are proportional to 1/ n”, where n is a non-negative integer. A downward transition releases 
energy, and so n; must be greater than n¢. The various series are those where the transitions 
end on a certain level. For the Lyman series, ns = 1 — that is, all the transitions end in the 
ground state (see also [link]). For the Balmer series, n¢ = 2, or all the transitions end in the 
first excited state; and so on. What was once a recipe is now based in physics, and something 
new is emerging—angular momentum is quantized. 


Triumphs and Limits of the Bohr Theory 


Bohr did what no one had been able to do before. Not only did he explain the spectrum of 
hydrogen, he correctly calculated the size of the atom from basic physics. Some of his ideas 
are broadly applicable. Electron orbital energies are quantized in all atoms and molecules. 
Angular momentum is quantized. The electrons do not spiral into the nucleus, as expected 
classically (accelerated charges radiate, so that the electron orbits classically would decay 
quickly, and the electrons would sit on the nucleus—matter would collapse). These are major 
triumphs. 


But there are limits to Bohr’s theory. It cannot be applied to multielectron atoms, even one as 
simple as a two-electron helium atom. Bohr’s model is what we call semiclassical. The orbits 
are quantized (nonclassical) but are assumed to be simple circular paths (classical). As 
quantum mechanics was developed, it became clear that there are no well-defined orbits; 
rather, there are clouds of probability. Bohr’s theory also did not explain that some spectral 
lines are doublets (split into two) when examined closely. We shall examine many of these 
aspects of quantum mechanics in more detail, but it should be kept in mind that Bohr did not 
fail. Rather, he made very important steps along the path to greater knowledge and laid the 
foundation for all of atomic physics that has since evolved. 


Note: 

PhET Explorations: Models of the Hydrogen Atom 

How did scientists figure out the structure of atoms without looking at them? Try out 
different models by shooting light at the atom. Check how the prediction of the model 


matches the experimental results. 


https://archive.cnx.org/specials/d77cc1d0-33e4-11e6-b016-6726afecd2be/hydrogen- 
atom/#sim-hydrogen-atom 


Section Summary 


e The planetary model of the atom pictures electrons orbiting the nucleus in the way that 
planets orbit the sun. Bohr used the planetary model to develop the first reasonable 
theory of hydrogen, the simplest atom. Atomic and molecular spectra are quantized, with 
hydrogen spectrum wavelengths given by the formula 


Equation: 
Hn 1 1 
XE ne)’ 


where J is the wavelength of the emitted EM radiation and R is the Rydberg constant, 
which has the value 
Equation: 


R= 1097 x 10" ta 


e The constants n; and n¢ are positive integers, and n; must be greater than 7¢. 

¢ Bohr correctly proposed that the energy and radii of the orbits of electrons in atoms are 
quantized, with energy for transitions between orbits given by 
Equation: 


AE =hf =E,— E;, 


where AF is the change in energy between the initial and final orbits and hf is the 
energy of an absorbed or emitted photon. It is useful to plot orbital energies on a vertical 
graph called an energy-level diagram. 

¢ Bohr proposed that the allowed orbits are circular and must have quantized orbital 
angular momentum given by 
Equation: 


h 
L=m.ovr, = n—(n=1,2,3...), 
27 


where L is the angular momentum, r,, is the radius of the nth orbit, and h is Planck’s 
constant. For all one-electron (hydrogen-like) atoms, the radius of an orbit is given by 
Equation: 


2 
= 5 an(allowed orbits n = 1, 2, 3, we) 


Z is the atomic number of an element (the number of electrons is has when neutral) and 
ap is defined to be the Bohr radius, which is 
Equation: 


h2 


= ata hae = 0.529 x 16” m. 
TMT MNe KG, 


aB 


¢ Furthermore, the energies of hydrogen-like atoms are given by 
Equation: 


Z2 
En = — 73 Eo(n = lis Zs 3 eal, 


where jf, is the ground-state energy and is given by 


Equation: 

2n?q4m,k? 

Ey = ————_ = 13.-6.eV. 
h2 

Thus, for hydrogen, 
Equation: 

13.6 eV 

Ey = og ie 1, 2 3 she 


e The Bohr Theory gives accurate values for the energy levels in hydrogen-like atoms, but 
it has been improved upon in several respects. 


Conceptual Questions 


Exercise: 
Problem: 
How do the allowed orbits for electrons in atoms differ from the allowed orbits for 
planets around the sun? Explain how the correspondence principle applies here. 
Exercise: 
Problem: 
Explain how Bohr’s rule for the quantization of electron orbital angular momentum 
differs from the actual rule. 


Exercise: 


Problem: 
What is a hydrogen-like atom, and how are the energies and radii of its electron orbits 
related to those in hydrogen? 

Problems & Exercises 


Exercise: 


Problem: 


By calculating its wavelength, show that the first line in the Lyman series is UV 
radiation. 


Solution: 


4=R(4- 4) d= 4[ S27 |i = 2,0 =1, sorhat 


ne Ne 


2 
A= Cae a = 1.22 x 10-7 m = 122 nm, which is UV radiation. 


Exercise: 
Problem: 
Find the wavelength of the third line in the Lyman series, and identify the type of EM 
radiation. 

Exercise: 


Problem: 


Look up the values of the quantities inag = and verify that the Bohr radius 


4n?mkq@ ’ 
ap is 0.529 x 10719 m. 
Solution: 
es h2 = (6.626 x 10-4 J-s)? — 0.529 x 10-29 m 
BT “armekZg  4n2(9.109x10-*! kg)(8.988x 109 N-m?/C?)(1)(1.602x10-9 GC)? 
Exercise: 

: F ‘ 2n2q4m,k? 

Problem: Verify that the ground state energy Ep is 13.6 eV by using By = —~-— 


Exercise: 


Problem: 


If a hydrogen atom has its electron in the n = 4 state, how much energy in eV is needed 
to ionize it? 


Solution: 


0.850 eV 
Exercise: 
Problem: 
A hydrogen atom in an excited state can be ionized with less energy than when it is in its 
ground state. What is n for a hydrogen atom if 0.850 eV of energy can ionize it? 
Exercise: 


Problem: 
Find the radius of a hydrogen atom in the n = 2 state according to Bohr’s theory. 
Solution: 


2:12 X10°) mm 
Exercise: 
Problem: 
Show that (13.6 eV) /hc = 1.097 x 10’ m = R (Rydberg’s constant), as discussed in 
the text. 
Exercise: 
Problem: 


What is the smallest-wavelength line in the Balmer series? Is it in the visible part of the 
spectrum? 


Solution: 
365 nm 


It is in the ultraviolet. 
Exercise: 
Problem: 


Show that the entire Paschen series is in the infrared part of the spectrum. To do this, you 
only need to calculate the shortest wavelength in the series. 


Exercise: 


Problem: 


Do the Balmer and Lyman series overlap? To answer this, calculate the shortest- 
wavelength Balmer line and the longest-wavelength Lyman line. 


Solution: 
No overlap 
365 nm 


122 nm 
Exercise: 


Problem: 
(a) Which line in the Balmer series is the first one in the UV part of the spectrum? 
(b) How many Balmer series lines are in the visible part of the spectrum? 


(c) How many are in the UV? 
Exercise: 
Problem: 


A wavelength of 4.653 pm is observed in a hydrogen spectrum for a transition that ends 
in the ng = 5 level. What was n; for the initial level of the electron? 


Solution: 


7 
Exercise: 
Problem: 
A singly ionized helium ion has only one electron and is denoted He*. What is the ion’s 
radius in the ground state compared to the Bohr radius of hydrogen atom? 
Exercise: 
Problem: 


A beryllium ion with a single electron (denoted Be®*) is in an excited state with radius 
the same as that of the ground state of hydrogen. 


(a) What is n for the Be** ion? 


(b) How much energy in eV is needed to ionize the ion from this excited state? 


Solution: 
(a) 2 


(b) 54.4 eV 
Exercise: 
Problem: 


Atoms can be ionized by thermal collisions, such as at the high temperatures found in the 
solar corona. One such ion is C*°, a carbon atom with only a single electron. 


(a) By what factor are the energies of its hydrogen-like levels greater than those of 
hydrogen? 


(b) What is the wavelength of the first line in this ion’s Paschen series? 


(c) What type of EM radiation is this? 
Exercise: 


Problem: 


2 2 
Verify Equations r, = 7ap and ag = ae = 0.529 x 10-1 m using the 


approach stated in the text. That is, equate the Coulomb and centripetal forces and then 
insert an expression for velocity from the condition for angular momentum quantization. 


Solution: 
kZqe _ meV? _ kZq _ kZqe 1 ; och 
"Gr Ta a Oe that r, = VE Hs VRS From the equation mur, = n>—, we 
F ‘ ee te kZ@e An? mr? 
can substitute for the velocity, giving: r, = A - p27 SO that 
a — ap, where ap = —"— 
0 Z Armekg 4 Be a An’mekq? 
Exercise: 
Problem: 


The wavelength of the four Balmer series lines for hydrogen are found to be 410.3, 
434.2, 486.3, and 656.5 nm. What average percentage difference is found between these 


wavelength numbers and those predicted by - = R(+ — +) ? It is amazing how well 
f£ i 


a simple formula (disconnected originally from theory) could duplicate this 
phenomenon. 


Glossary 


hydrogen spectrum wavelengths 
the wavelengths of visible light from hydrogen; can be calculated by 
2S) pa oem ae 
N= nm one 

f i 

Rydberg constant 
a physical constant related to the atomic spectra with an established value of 
1.097 x 107 m™ 


double-slit interference 
an experiment in which waves or particles from a single source impinge upon two slits 
so that the resulting interference pattern may be observed 


energy-level diagram 
a diagram used to analyze the energy level of electrons in the orbits of an atom 


Bohr radius 
the mean radius of the orbit of an electron around the nucleus of a hydrogen atom in its 
ground state 


hydrogen-like atom 
any atom with only a single electron 


energies of hydrogen-like atoms 
Bohr formula for energies of electron states in hydrogen-like atoms: 


E, = —4% Ep(n = 1,2, 3, ...) 


X Rays: Atomic Origins and Applications 


e Define x-ray tube and its spectrum. 

e Show the x-ray characteristic energy. 

¢ Specify the use of x rays in medical observations. 

e Explain the use of x rays in CT scanners in diagnostics. 


Each type of atom (or element) has its own characteristic electromagnetic 
spectrum. X rays lie at the high-frequency end of an atom’s spectrum and 
are characteristic of the atom as well. In this section, we explore 
characteristic x rays and some of their important applications. 


We have previously discussed x rays as a part of the electromagnetic 
spectrum in Photon Energies and the Electromagnetic Spectrum. That 
module illustrated how an x-ray tube (a specialized CRT) produces x rays. 
Electrons emitted from a hot filament are accelerated with a high voltage, 
gaining significant kinetic energy and striking the anode. 


There are two processes by which x rays are produced in the anode of an x- 
ray tube. In one process, the deceleration of electrons produces x rays, and 
these x rays are called bremsstrahlung, or braking radiation. The second 
process is atomic in nature and produces characteristic x rays, so called 
because they are characteristic of the anode material. The x-ray spectrum in 
[link] is typical of what is produced by an x-ray tube, showing a broad 
curve of bremsstrahlung radiation with characteristic x-ray peaks on it. 


~ 


X-ray intensity 


F max f 
qv = hf max 


X-ray spectrum obtained 
when energetic electrons 
strike a material, such as 


in the anode of a CRT. 
The smooth part of the 
spectrum is 
bremsstrahlung radiation, 
while the peaks are 
characteristic of the 
anode material. A 
different anode material 
would have characteristic 
x-ray peaks at different 
frequencies. 


The spectrum in [link] is collected over a period of time in which many 
electrons strike the anode, with a variety of possible outcomes for each hit. 
The broad range of x-ray energies in the bremsstrahlung radiation indicates 
that an incident electron’s energy is not usually converted entirely into 
photon energy. The highest-energy x ray produced is one for which all of 
the electron’s energy was converted to photon energy. Thus the accelerating 
voltage and the maximum x-ray energy are related by conservation of 
energy. Electric potential energy is converted to kinetic energy and then to 
photon energy, so that Emax = hf max = deV. Units of electron volts are 
convenient. For example, a 100-kV accelerating voltage produces x-ray 
photons with a maximum energy of 100 keV. 


Some electrons excite atoms in the anode. Part of the energy that they 
deposit by collision with an atom results in one or more of the atom’s inner 
electrons being knocked into a higher orbit or the atom being ionized. When 
the anode’s atoms de-excite, they emit characteristic electromagnetic 
radiation. The most energetic of these are produced when an inner-shell 
vacancy is filled—that is, when an n = 1 or n = 2 shell electron has been 
excited to a higher level, and another electron falls into the vacant spot. A 
characteristic x ray (see Photon Energies and the Electromagnetic 
Spectrum) is electromagnetic (EM) radiation emitted by an atom when an 
inner-shell vacancy is filled. [link] shows a representative energy-level 
diagram that illustrates the labeling of characteristic x rays. X rays created 


when an electron falls into ann = 1 shell vacancy are called AK when they 
come from the next higher level; that is, an nm = 2 to n = 1 transition. The 
labels kK, L, M,... come from the older alphabetical labeling of shells 
starting with K rather than using the principal quantum numbers 1, 2, 3, .... 
A more energetic Kg x ray is produced when an electron falls into an 

nm = 1 shell vacancy from the n = 8 shell; that is, ann = 3ton = 1 
transition. Similarly, when an electron falls into the nm = 2 shell from the 

nm = 3 shell, an La x ray is created. The energies of these x rays depend on 
the energies of electron states in the particular atom and, thus, are 
characteristic of that element: every element has it own set of x-ray 
energies. This property can be used to identify elements, for example, to 
find trace (small) amounts of an element in an environmental or biological 
sample. 


Ly 


K n=1 


A characteristic x ray is 
emitted when an electron 
fills an inner-shell 
vacancy, as shown for 
several transitions in this 
approximate energy level 
diagram for a multiple- 
electron atom. 
Characteristic x rays are 
labeled according to the 
shell that had the vacancy 
and the shell from which 


the electron came. A Ky 
x ray, for example, is 
produced when an 
electron coming from the 
n = 2 shell fills the 
nm = 1 shell vacancy. 


Example: 

Characteristic X-Ray Energy 

Calculate the approximate energy of a K, x ray from a tungsten anode in 
an x-ray tube. 

Strategy 

How do we calculate energies in a multiple-electron atom? In the case of 
characteristic x rays, the following approximate calculation is reasonable. 
Characteristic x rays are produced when an inner-shell vacancy is filled. 
Inner-shell electrons are nearer the nucleus than others in an atom and thus 
feel little net effect from the others. This is similar to what happens inside a 
charged conductor, where its excess charge is distributed over the surface 
so that it produces no electric field inside. It is reasonable to assume the 
inner-shell electrons have hydrogen-like energies, as given by 

Nope ~ 2 Eo(n = 1, 2,3, ...). As noted, a K, x ray is produced by an 
n = 2 ton = I transition. Since there are two electrons in a filled AK shell, 
a vacancy would leave one electron, so that the effective charge would be 
Z — 1 rather than Z. For tungsten, Z = 74, so that the effective charge is 
73. 

Solution 

E, = —- Z_ Ey (n = 1, 2, 3, ...) gives the orbital energies for hydrogen- 
like atoms to be EF, = —(Z?/n?)Eo, where Ky = 13.6 eV. As noted, the 
effective Z is 73. Now the K, x-ray energy is given by 

Equation: 


Ex, = AE = Ej — Ey = Ey — FE, 


where 


Equation: 

he 73? 
and 
Equation: 

Yh G3 
Thus, 
Equation: 

Ex, = —18.1 keV — (—72.5 keV) = 54.4 keV. 

Discussion 


This large photon energy is typical of characteristic x rays from heavy 
elements. It is large compared with other atomic emissions because it is 
produced when an inner-shell vacancy is filled, and inner-shell electrons 
are tightly bound. Characteristic x ray energies become progressively 
larger for heavier elements because their energy increases approximately as 
Z*. Significant accelerating voltage is needed to create these inner-shell 
vacancies. In the case of tungsten, at least 72.5 kV is needed, because other 
shells are filled and you cannot simply bump one electron to a higher filled 
shell. Tungsten is a common anode material in x-ray tubes; so much of the 
energy of the impinging electrons is absorbed, raising its temperature, that 
a high-melting-point material like tungsten is required. 


Medical and Other Diagnostic Uses of X-rays 


All of us can identify diagnostic uses of x-ray photons. Among these are the 
universal dental and medical x rays that have become an essential part of 
medical diagnostics. (See [link] and [link].) X rays are also used to inspect 


our luggage at airports, as shown in [link], and for early detection of cracks 
in crucial aircraft components. An x ray is not only a noun meaning high- 
energy photon, it also is an image produced by x rays, and it has been made 
into a familiar verb—to be x-rayed. 


An x-ray image reveals 
fillings in a person’s 
teeth. (credit: Dmitry G, 
Wikimedia Commons) 


This x-ray image of 
a person’s chest 
shows many 
details, including 
an artificial 
pacemaker. (credit: 
Sunzi99, 


Wikimedia 
Commons) 


This x-ray image 
shows the contents of 
a piece of luggage. 
The denser the 
material, the darker 
the shadow. (credit: 
IDuke, Wikimedia 
Commons) 


The most common x-ray images are simple shadows. Since x-ray photons 
have high energies, they penetrate materials that are opaque to visible light. 
The more energy an x-ray photon has, the more material it will penetrate. 
So an x-ray tube may be operated at 50.0 kV for a chest x ray, whereas it 
may need to be operated at 100 kV to examine a broken leg in a cast. The 
depth of penetration is related to the density of the material as well as to the 
energy of the photon. The denser the material, the fewer x-ray photons get 
through and the darker the shadow. Thus x rays excel at detecting breaks in 
bones and in imaging other physiological structures, such as some tumors, 
that differ in density from surrounding material. Because of their high 
photon energy, x rays produce significant ionization in materials and 


damage cells in biological organisms. Modern uses minimize exposure to 
the patient and eliminate exposure to others. Biological effects of x rays 
will be explored in the next chapter along with other types of ionizing 
radiation such as those produced by nuclei. 


As the x-ray energy increases, the Compton effect (see Photon Momentum) 
becomes more important in the attenuation of the x rays. Here, the x ray 
scatters from an outer electron shell of the atom, giving the ejected electron 
some kinetic energy while losing energy itself. The probability for 
attenuation of the x rays depends upon the number of electrons present (the 
material’s density) as well as the thickness of the material. Chemical 
composition of the medium, as characterized by its atomic number Z, is not 
important here. Low-energy x rays provide better contrast (sharper images). 
However, due to greater attenuation and less scattering, they are more 
absorbed by thicker materials. Greater contrast can be achieved by injecting 
a substance with a large atomic number, such as barium or iodine. The 
structure of the part of the body that contains the substance (e.g., the gastro- 
intestinal tract or the abdomen) can easily be seen this way. 


Breast cancer is the second-leading cause of death among women 
worldwide. Early detection can be very effective, hence the importance of 
x-ray diagnostics. A mammogram cannot diagnose a malignant tumor, only 
give evidence of a lump or region of increased density within the breast. X- 
ray absorption by different types of soft tissue is very similar, so contrast is 
difficult; this is especially true for younger women, who typically have 
denser breasts. For older women who are at greater risk of developing 
breast cancer, the presence of more fat in the breast gives the lump or tumor 
more contrast. MRI (Magnetic resonance imaging) has recently been used 
as a Supplement to conventional x rays to improve detection and eliminate 
false positives. The subject’s radiation dose from x rays will be treated in a 
later chapter. 


A standard x ray gives only a two-dimensional view of the object. Dense 
bones might hide images of soft tissue or organs. If you took another x ray 
from the side of the person (the first one being from the front), you would 
gain additional information. While shadow images are sufficient in many 
applications, far more sophisticated images can be produced with modern 


technology. [link] shows the use of a computed tomography (CT) scanner, 
also called computed axial tomography (CAT) scanner. X rays are passed 
through a narrow section (called a slice) of the patient’s body (or body part) 
over a range of directions. An array of many detectors on the other side of 
the patient registers the x rays. The system is then rotated around the patient 
and another image is taken, and so on. The x-ray tube and detector array are 
mechanically attached and so rotate together. Complex computer image 
processing of the relative absorption of the x rays along different directions 
produces a highly-detailed image. Different slices are taken as the patient 
moves through the scanner on a table. Multiple images of different slices 
can also be computer analyzed to produce three-dimensional information, 
sometimes enhancing specific types of tissue, as shown in [link]. G. 
Hounsfield (UK) and A. Cormack (US) won the Nobel Prize in Medicine in 
1979 for their development of computed tomography. 


A patient being 
positioned in a CT 
scanner aboard the 


hospital ship USNS 
Mercy. The CT 
scanner passes x rays 
through slices of the 
patient’s body (or 
body part) over a 
range of directions. 
The relative 
absorption of the x 
rays along different 


directions is computer 
analyzed to produce 
highly detailed 
images. Three- 
dimensional 
information can be 
obtained from multiple 
Slices. (credit: Rebecca 
Moat, U.S. Navy) 


This three-dimensional 
image of a skull was 
produced by computed 
tomography, involving 
analysis of several x- 
ray Slices of the head. 
(credit: Emailshankar, 
Wikimedia Commons) 


X-Ray Diffraction and Crystallography 


Since x-ray photons are very energetic, they have relatively short 
wavelengths. For example, the 54.4-keV K, x ray of [link] has a 


wavelength \ = hc/F = 0.0228 nm. Thus, typical x-ray photons act like 
rays when they encounter macroscopic objects, like teeth, and produce 
sharp shadows; however, since atoms are on the order of 0.1 nm in size, x 
rays can be used to detect the location, shape, and size of atoms and 
molecules. The process is called x-ray diffraction, because it involves the 
diffraction and interference of x rays to produce patterns that can be 
analyzed for information about the structures that scattered the x rays. 
Perhaps the most famous example of x-ray diffraction is the discovery of 
the double-helix structure of DNA in 1953 by an international team of 
scientists working at the Cavendish Laboratory—American James Watson, 
Englishman Francis Crick, and New Zealand—born Maurice Wilkins. Using 
x-ray diffraction data produced by Rosalind Franklin, they were the first to 
discern the structure of DNA that is so crucial to life. For this, Watson, 
Crick, and Wilkins were awarded the 1962 Nobel Prize in Physiology or 
Medicine. There is much debate and controversy over the issue that 
Rosalind Franklin was not included in the prize. 


[link] shows a diffraction pattern produced by the scattering of x rays from 
a crystal. This process is known as x-ray crystallography because of the 
information it can yield about crystal structure, and it was the type of data 
Rosalind Franklin supplied to Watson and Crick for DNA. Not only do x 
rays confirm the size and shape of atoms, they give information on the 
atomic arrangements in materials. For example, current research in high- 
temperature superconductors involves complex materials whose lattice 
arrangements are crucial to obtaining a superconducting material. These can 
be studied using x-ray crystallography. 


X-ray diffraction from 
the crystal of a protein, 
hen egg lysozyme, 
produced this 
interference pattern. 
Analysis of the pattern 
yields information 
about the structure of 
the protein. (credit: 
Del45, Wikimedia 
Commons) 


Historically, the scattering of x rays from crystals was used to prove that x 
rays are energetic EM waves. This was suspected from the time of the 
discovery of x rays in 1895, but it was not until 1912 that the German Max 
von Laue (1879-1960) convinced two of his colleagues to scatter x rays 
from crystals. If a diffraction pattern is obtained, he reasoned, then the x 
rays must be waves, and their wavelength could be determined. (The 
spacing of atoms in various crystals was reasonably well known at the time, 
based on good values for Avogadro’s number.) The experiments were 
convincing, and the 1914 Nobel Prize in Physics was given to von Laue for 
his suggestion leading to the proof that x rays are EM waves. In 1915, the 
unique father-and-son team of Sir William Henry Bragg and his son Sir 
William Lawrence Bragg were awarded a joint Nobel Prize for inventing 


the x-ray spectrometer and the then-new science of x-ray analysis. The 
elder Bragg had migrated to Australia from England just after graduating in 
mathematics. He learned physics and chemistry during his career at the 
University of Adelaide. The younger Bragg was born in Adelaide but went 
back to the Cavendish Laboratories in England to a career in x-ray and 
neutron crystallography; he provided support for Watson, Crick, and 
Wilkins for their work on unraveling the mysteries of DNA and to Max 
Perutz for his 1962 Nobel Prize-winning work on the structure of 
hemoglobin. Here again, we witness the enabling nature of physics— 
establishing instruments and designing experiments as well as solving 
mysteries in the biomedical sciences. 


Certain other uses for x rays will be studied in later chapters. X rays are 
useful in the treatment of cancer because of the inhibiting effect they have 
on cell reproduction. X rays observed coming from outer space are useful in 
determining the nature of their sources, such as neutron stars and possibly 
black holes. Created in nuclear bomb explosions, x rays can also be used to 
detect clandestine atmospheric tests of these weapons. X rays can cause 
excitations of atoms, which then fluoresce (emitting characteristic EM 
radiation), making x-ray-induced fluorescence a valuable analytical tool in 
a range of fields from art to archaeology. 


Section Summary 


e X rays are relatively high-frequency EM radiation. They are produced 
by transitions between inner-shell electron levels, which produce x 
rays characteristic of the atomic element, or by decelerating electrons. 

e X rays have many uses, including medical diagnostics and x-ray 
diffraction. 


Conceptual Questions 


Exercise: 


Problem: 
Explain why characteristic x rays are the most energetic in the EM 
emission spectrum of a given element. 
Exercise: 
Problem: 
Why does the energy of characteristic x rays become increasingly 
greater for heavier atoms? 
Exercise: 
Problem: 
Observers at a safe distance from an atmospheric test of a nuclear 


bomb feel its heat but receive none of its copious x rays. Why is air 
opaque to x rays but transparent to infrared? 


Exercise: 
Problem: 
Lasers are used to burn and read CDs. Explain why a laser that emits 


blue light would be capable of burning and reading more information 
than one that emits infrared. 


Exercise: 


Problem: 


Crystal lattices can be examined with x rays but not UV. Why? 
Exercise: 


Problem: 


CT scanners do not detect details smaller than about 0.5 mm. Is this 
limitation due to the wavelength of x rays? Explain. 


Problem Exercises 


Exercise: 
Problem: 
(a) What is the shortest-wavelength x-ray radiation that can be 
generated in an x-ray tube with an applied voltage of 50.0 kV? (b) 


Calculate the photon energy in eV. (c) Explain the relationship of the 
photon energy to the applied voltage. 


Solution: 
(a) 0.248 x 10°-1° m 
(b) 50.0 keV 


(c) The photon energy is simply the applied voltage times the electron 
charge, so the value of the voltage in volts is the same as the value of 
the energy in electron volts. 


Exercise: 
Problem: 
A color television tube also generates some x rays when its electron 
beam strikes the screen. What is the shortest wavelength of these x 
rays, if a 30.0-kV potential is used to accelerate the electrons? (Note 


that TVs have shielding to prevent these x rays from exposing 
viewers. ) 


Exercise: 


Problem: 


An x ray tube has an applied voltage of 100 kV. (a) What is the most 
energetic x-ray photon it can produce? Express your answer in electron 
volts and joules. (b) Find the wavelength of such an X-ray. 


Solution: 


(a) 100 x 10° eV, 1.60 x 10°14 J 


(b) 0.124 x 10°19 m 
Exercise: 
Problem: 
The maximum characteristic x-ray photon energy comes from the 
capture of a free electron into a K shell vacancy. What is this photon 


energy in keV for tungsten, assuming the free electron has no initial 
kinetic energy? 


Exercise: 


Problem: 


What are the approximate energies of the A, and Kg x rays for 
copper? 


Solution: 
(a) 8.00 keV 


(b) 9.48 keV 


Glossary 


X rays 
a form of electromagnetic radiation 


x-ray diffraction 
a technique that provides the detailed information about 
crystallographic structure of natural and manufactured materials 


Introduction to Radioactivity and Nuclear Physics 
class="introduction" 


e Define radioactivity. 


The 
synchrotron 
source 
produces 
electromagneti 
c radiation, as 
evident from 
the visible 
glow. (credit: 
United States 
Department of 
Energy, via 
Wikimedia 
Commons) 


synchrotron) 


Source 
ee 


electromagnetic radiation 


There is an ongoing quest to find substructures of matter. At one time, it 
was thought that atoms would be the ultimate substructure, but just when 
the first direct evidence of atoms was obtained, it became clear that they 
have a substructure and a tiny nucleus. The nucleus itself has spectacular 
characteristics. For example, certain nuclei are unstable, and their decay 
emits radiations with energies millions of times greater than atomic 
energies. Some of the mysteries of nature, such as why the core of the earth 
remains molten and how the sun produces its energy, are explained by 
nuclear phenomena. The exploration of radioactivity and the nucleus 
revealed fundamental and previously unknown particles, forces, and 
conservation laws. That exploration has evolved into a search for further 
underlying structures, such as quarks. In this chapter, the fundamentals of 
nuclear radioactivity and the nucleus are explored. The following two 
chapters explore the more important applications of nuclear physics in the 
field of medicine. We will also explore the basics of what we know about 
quarks and other substructures smaller than nuclei. 


Nuclear Radioactivity 


e Explain nuclear radiation. 

e Explain the types of radiation—alpha emission, beta emission, and 
gamma emission. 

e Explain the ionization of radiation in an atom. 

Define the range of radiation. 


The discovery and study of nuclear radioactivity quickly revealed evidence 
of revolutionary new physics. In addition, uses for nuclear radiation also 
emerged quickly—for example, people such as Emest Rutherford used it to 
determine the size of the nucleus and devices were painted with radon- 
doped paint to make them glow in the dark (see [link]). We therefore begin 
our study of nuclear physics with the discovery and basic features of 
nuclear radioactivity. 
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The dials of this World 
War II aircraft glow in the 
dark, because they are 
painted with radium- 
doped phosphorescent 
paint. It is a poignant 
reminder of the dual 
nature of radiation. 
Although radium paint 
dials are conveniently 
visible day and night, 
they emit radon, a 
radioactive gas that is 
hazardous and is not 


directly sensed. (credit: 
U.S. Air Force Photo) 


Discovery of Nuclear Radioactivity 


In 1896, the French physicist Antoine Henri Becquerel (1852-1908) 
accidentally found that a uranium-rich mineral called pitchblende emits 
invisible, penetrating rays that can darken a photographic plate enclosed in 
an opaque envelope. The rays therefore carry energy; but amazingly, the 
pitchblende emits them continuously without any energy input. This is an 
apparent violation of the law of conservation of energy, one that we now 
understand is due to the conversion of a small amount of mass into energy, 
as related in Einstein’s famous equation E = mc’. It was soon evident that 
Becquerel’s rays originate in the nuclei of the atoms and have other unique 
characteristics. The emission of these rays is called nuclear radioactivity 
or simply radioactivity. The rays themselves are called nuclear radiation. 
A nucleus that spontaneously destroys part of its mass to emit radiation is 
said to decay (a term also used to describe the emission of radiation by 
atoms in excited states). A substance or object that emits nuclear radiation 
is said to be radioactive. 


Two types of experimental evidence imply that Becquerel’s rays originate 
deep in the heart (or nucleus) of an atom. First, the radiation is found to be 
associated with certain elements, such as uranium. Radiation does not vary 
with chemical state—that is, uranium is radioactive whether it is in the form 
of an element or compound. In addition, radiation does not vary with 
temperature, pressure, or ionization state of the uranium atom. Since all of 
these factors affect electrons in an atom, the radiation cannot come from 
electron transitions, as atomic spectra do. The huge energy emitted during 
each event is the second piece of evidence that the radiation cannot be 
atomic. Nuclear radiation has energies of the order of 10° eV per event, 
which is much greater than the typical atomic energies (a few eV), such as 
that observed in spectra and chemical reactions, and more than ten times as 
high as the most energetic characteristic x rays. Becquerel did not 
vigorously pursue his discovery for very long. In 1898, Marie Curie (1867— 


1934), then a graduate student married the already well-known French 
physicist Pierre Curie (1859-1906), began her doctoral study of Becquerel’s 
rays. She and her husband soon discovered two new radioactive elements, 
which she named polonium (after her native land) and radium (because it 
radiates). These two new elements filled holes in the periodic table and, 
further, displayed much higher levels of radioactivity per gram of material 
than uranium. Over a period of four years, working under poor conditions 
and spending their own funds, the Curies processed more than a ton of 
uranium ore to isolate a gram of radium salt. Radium became highly sought 
after, because it was about two million times as radioactive as uranium. 
Curie’s radium salt glowed visibly from the radiation that took its toll on 
them and other unaware researchers. Shortly after completing her Ph.D., 
both Curies and Becquerel shared the 1903 Nobel Prize in physics for their 
work on radioactivity. Pierre was killed in a horse cart accident in 1906, but 
Marie continued her study of radioactivity for nearly 30 more years. 
Awarded the 1911 Nobel Prize in chemistry for her discovery of two new 
elements, she remains the only person to win Nobel Prizes in physics and 
chemistry. Marie’s radioactive fingerprints on some pages of her notebooks 
can still expose film, and she suffered from radiation-induced lesions. She 
died of leukemia likely caused by radiation, but she was active in research 
almost until her death in 1934. The following year, her daughter and son-in- 
law, Irene and Frederic Joliot-Curie, were awarded the Nobel Prize in 
chemistry for their discovery of artificially induced radiation, adding to a 
remarkable family legacy. 


Alpha, Beta, and Gamma 


Research begun by people such as New Zealander Ernest Rutherford soon 
after the discovery of nuclear radiation indicated that different types of rays 
are emitted. Eventually, three types were distinguished and named alpha 
(a), beta(@), and gamma(v7), because, like x-rays, their identities were 
initially unknown. [link] shows what happens if the rays are passed through 
a magnetic field. The ys are unaffected, while the a s and fs are deflected 
in opposite directions, indicating the a s are positive, the @ s negative, and 
the y s uncharged. Rutherford used both magnetic and electric fields to 
show that a s have a positive charge twice the magnitude of an electron, or 
+2 | qe |. In the process, he found the a s charge to mass ratio to be several 


thousand times smaller than the electron’s. Later on, Rutherford collected a 
s from a radioactive source and passed an electric discharge through them, 
obtaining the spectrum of recently discovered helium gas. Among many 
important discoveries made by Rutherford and his collaborators was the 
proof that a radiation is the emission of a helium nucleus. Rutherford won 
the Nobel Prize in chemistry in 1908 for his early work. He continued to 
make important contributions until his death in 1934. 


Phosphorescent screen 
(viewed from above) 


Radioactive 
sources 


Alpha, beta, and gamma rays 
are passed through a magnetic 
field on the way to a 
phosphorescent screen. The a s 
and @ s bend in opposite 
directions, while the ys are 
unaffected, indicating a positive 
charge for a s, negative for 6 s, 
and neutral for 7 s. Consistent 
results are obtained with electric 
fields. Collection of the 
radiation offers further 


confirmation from the direct 
measurement of excess charge. 


Other researchers had already proved that ( s are negative and have the 
Same mass and same charge-to-mass ratio as the recently discovered 
electron. By 1902, it was recognized that @ radiation is the emission of an 
electron. Although { s are electrons, they do not exist in the nucleus before 
it decays and are not ejected atomic electrons—the electron is created in the 
nucleus at the instant of decay. 


Since yy s remain unaffected by electric and magnetic fields, it is natural to 
think they might be photons. Evidence for this grew, but it was not until 
1914 that this was proved by Rutherford and collaborators. By scattering y 
radiation from a crystal and observing interference, they demonstrated that 
y radiation is the emission of a high-energy photon by a nucleus. In fact, 
radiation comes from the de-excitation of a nucleus, just as an x ray comes 
from the de-excitation of an atom. The names "y ray" and "x ray" identify 
the source of the radiation. At the same energy, ‘yy rays and x rays are 
otherwise identical. 


Type of 
Radiation Range 
a A sheet of paper, a few cm of air, fractions of a 


; mm of tissue 
-Particles 


Type of 


Radiation Range 
B A thin aluminum plate, or tens of cm of tissue 
-Particles 
Y 
Several cm of lead or meters of concrete 
Rays 


Properties of Nuclear Radiation 


Ionization and Range 


Two of the most important characteristics of a, 8, and y rays were 
recognized very early. All three types of nuclear radiation produce 
ionization in materials, but they penetrate different distances in materials— 
that is, they have different ranges. Let us examine why they have these 
characteristics and what are some of the consequences. 


Like x rays, nuclear radiation in the form of as, 6s, and ys has enough 
energy per event to ionize atoms and molecules in any material. The energy 
emitted in various nuclear decays ranges from a few keV to more than 

10 MeV, while only a few eV are needed to produce ionization. The effects 
of x rays and nuclear radiation on biological tissues and other materials, 
such as solid state electronics, are directly related to the ionization they 
produce. All of them, for example, can damage electronics or kill cancer 
cells. In addition, methods for detecting x rays and nuclear radiation are 
based on ionization, directly or indirectly. All of them can ionize the air 
between the plates of a capacitor, for example, causing it to discharge. This 
is the basis of inexpensive personal radiation monitors, such as pictured in 
[link]. Apart from a, @, and +, there are other forms of nuclear radiation as 
well, and these also produce ionization with similar effects. We define 
ionizing radiation as any form of radiation that produces ionization 


whether nuclear in origin or not, since the effects and detection of the 
radiation are related to ionization. 


These dosimeters 
(literally, dose meters) are 
personal radiation 
monitors that detect the 
amount of radiation by 
the discharge of a 
rechargeable internal 
capacitor. The amount of 
discharge is related to the 
amount of ionizing 
radiation encountered, a 
measurement of dose. 
One dosimeter is shown 
in the charger. Its scale is 
read through an eyepiece 
on the top. (credit: L. 
Chang, Wikimedia 
Commons) 


The range of radiation is defined to be the distance it can travel through a 
material. Range is related to several factors, including the energy of the 


radiation, the material encountered, and the type of radiation (see [Link]). 
The higher the energy, the greater the range, all other factors being the 
same. This makes good sense, since radiation loses its energy in materials 
primarily by producing ionization in them, and each ionization of an atom 
or a molecule requires energy that is removed from the radiation. The 
amount of ionization is, thus, directly proportional to the energy of the 
particle of radiation, as is its range. 


higher E 


(c) 


The penetration or range of radiation depends on its energy, the 
material it encounters, and the type of radiation. (a) Greater 
energy means greater range. (b) Radiation has a smaller range 
in materials with high electron density. (c) Alphas have the 
smallest range, betas have a greater range, and gammas 
penetrate the farthest. 


Radiation can be absorbed or shielded by materials, such as the lead aprons 
dentists drape on us when taking x rays. Lead is a particularly effective 
shield compared with other materials, such as plastic or air. How does the 
range of radiation depend on material? Ionizing radiation interacts best with 
charged particles in a material. Since electrons have small masses, they 
most readily absorb the energy of the radiation in collisions. The greater the 


density of a material and, in particular, the greater the density of electrons 
within a material, the smaller the range of radiation. 


Note: 

Collisions 

Conservation of energy and momentum often results in energy transfer to a 
less massive object in a collision. This was discussed in detail in Work, 
Energy,_and Energy Resources, for example. 


Different types of radiation have different ranges when compared at the 
Same energy and in the same material. Alphas have the shortest range, betas 
penetrate farther, and gammas have the greatest range. This is directly 
related to charge and speed of the particle or type of radiation. At a given 
energy, each a, {, or -y will produce the same number of ionizations in a 
material (each ionization requires a certain amount of energy on average). 
The more readily the particle produces ionization, the more quickly it will 
lose its energy. The effect of charge is as follows: The a has a charge of 
+2q,, the G has a charge of —q, , and the y is uncharged. The 
electromagnetic force exerted by the a is thus twice as strong as that 
exerted by the 6 and it is more likely to produce ionization. Although 
chargeless, the y does interact weakly because it is an electromagnetic 
wave, but it is less likely to produce ionization in any encounter. More 
quantitatively, the change in momentum Ap given to a particle in the 
material is Ap = F'At, where F is the force the a, {, or y exerts over a 
time At. The smaller the charge, the smaller is F’ and the smaller is the 
momentum (and energy) lost. Since the speed of alphas is about 5% to 10% 
of the speed of light, classical (non-relativistic) formulas apply. 


The speed at which they travel is the other major factor affecting the range 
of as, Bs, and ys. The faster they move, the less time they spend in the 
vicinity of an atom or a molecule, and the less likely they are to interact. 
Since a s and (@ s are particles with mass (helium nuclei and electrons, 
respectively), their energy is kinetic, given classically by tmv’. The mass 


of the @ particle is thousands of times less than that of the a s, so that 6s 
must travel much faster than a s to have the same energy. Since 8 s move 
faster (most at relativistic speeds), they have less time to interact than a s. 
Gamma rays are photons, which must travel at the speed of light. They are 
even less likely to interact than a {, since they spend even less time near a 
given atom (and they have no charge). The range of y s is thus greater than 
the range of ( s. 


Alpha radiation from radioactive sources has a range much less than a 
millimeter of biological tissues, usually not enough to even penetrate the 
dead layers of our skin. On the other hand, the same a radiation can 
penetrate a few centimeters of air, so mere distance from a source prevents 
a radiation from reaching us. This makes a radiation relatively safe for our 
body compared to @ and ¥ radiation. Typical @ radiation can penetrate a few 
millimeters of tissue or about a meter of air. Beta radiation is thus 
hazardous even when not ingested. The range of ('s in lead is about a 
millimeter, and so it is easy to store @ sources in lead radiation-proof 
containers. Gamma rays have a much greater range than either as or (s. In 
fact, if a given thickness of material, like a lead brick, absorbs 90% of the y 
s, then a second lead brick will only absorb 90% of what got through the 
first. Thus, ys do not have a well-defined range; we can only cut down the 
amount that gets through. Typically, ys can penetrate many meters of air, go 
right through our bodies, and are effectively shielded (that is, reduced in 
intensity to acceptable levels) by many centimeters of lead. One benefit of 
s is that they can be used as radioactive tracers (see [Link]). 


This image of the 
concentration of a 
radioactive tracer in a 
patient’s body reveals 
where the most active 
bone cells are, an 
indication of bone cancer. 
A short-lived radioactive 
substance that locates 
itself selectively is given 
to the patient, and the 
radiation is measured 
with an external detector. 
The emitted -y radiation 
has a sufficient range to 
leave the body—the 
range of as and §s is too 
small for them to be 
observed outside the 
patient. (credit: Kieran 
Maher, Wikimedia 
Commons) 


Note: 

PhET Explorations: Beta Decay 

Build an atom out of protons, neutrons, and electrons, and see how the 
element, charge, and mass change. Then play a game to test your ideas! 


https://archive.cnx.org/specials/f0a27b96-f5c8-11e5-a22c- 
73f8c149bebf/beta-decay/#sim-multiple-atoms 


Section Summary 


e¢ Some nuclei are radioactive—they spontaneously decay destroying 
some part of their mass and emitting energetic rays, a process called 
nuclear radioactivity. 

e Nuclear radiation, like x rays, is ionizing radiation, because energy 
sufficient to ionize matter is emitted in each decay. 

e The range (or distance traveled in a material) of ionizing radiation is 
directly related to the charge of the emitted particle and its energy, with 
greater-charge and lower-energy particles having the shortest ranges. 

e Radiation detectors are based directly or indirectly upon the ionization 
created by radiation, as are the effects of radiation on living and inert 
materials. 


Conceptual Questions 


Exercise: 


Problem: 


Suppose the range for 5.0 MeVa ray is known to be 2.0 mm ina 
certain material. Does this mean that every 5.0 MeVa a ray that 
strikes this material travels 2.0 mm, or does the range have an average 
value with some statistical fluctuations in the distances traveled? 
Explain. 


Exercise: 


Problem: 


What is the difference between ¥ rays and characteristic x rays? Is 
either necessarily more energetic than the other? Which can be the 
most energetic? 


Exercise: 
Problem: 
Ionizing radiation interacts with matter by scattering from electrons 
and nuclei in the substance. Based on the law of conservation of 


momentum and energy, explain why electrons tend to absorb more 
energy than nuclei in these interactions. 


Exercise: 
Problem: 
What characteristics of radioactivity show it to be nuclear in origin and 
not atomic? 
Exercise: 
Problem: 
What is the source of the energy emitted in radioactive decay? Identify 


an earlier conservation law, and describe how it was modified to take 
such processes into account. 


Exercise: 
Problem: 
Consider [link]. If an electric field is substituted for the magnetic field 
with positive charge instead of the north pole and negative charge 


instead of the south pole, in which directions will the a, 6, and y rays 
bend? 


Exercise: 


Problem: 


Explain how an qa particle can have a larger range in air than a 3 
particle with the same energy in lead. 
Exercise: 
Problem: 
Arrange the following according to their ability to act as radiation 


shields, with the best first and worst last. Explain your ordering in 
terms of how radiation loses its energy in matter. 


(a) A solid material with low density composed of low-mass atoms. 
(b) A gas composed of high-mass atoms. 
(c) A gas composed of low-mass atoms. 


(d) A solid with high density composed of high-mass atoms. 
Exercise: 
Problem: 
Often, when people have to work around radioactive materials spills, 
we see them wearing white coveralls (usually a plastic material). What 


types of radiation (if any) do you think these suits protect the worker 
from, and how? 


Glossary 


alpha rays 
one of the types of rays emitted from the nucleus of an atom 


beta rays 
one of the types of rays emitted from the nucleus of an atom 


gamma rays 


one of the types of rays emitted from the nucleus of an atom 


ionizing radiation 
radiation (whether nuclear in origin or not) that produces ionization 
whether nuclear in origin or not 


nuclear radiation 
rays that originate in the nuclei of atoms, the first examples of which 
were discovered by Becquerel 


radioactivity 
the emission of rays from the nuclei of atoms 


radioactive 
a substance or object that emits nuclear radiation 


range of radiation 
the distance that the radiation can travel through a material 


Nuclear Decay and Conservation Laws 


e Define and discuss nuclear decay. 

e State the conservation laws. 

e Explain parent and daughter nucleus. 

e Calculate the energy emitted during nuclear decay. 


Nuclear decay has provided an amazing window into the realm of the very small. Nuclear 
decay gave the first indication of the connection between mass and energy, and it revealed the 
existence of two of the four basic forces in nature. In this section, we explore the major 
modes of nuclear decay; and, like those who first explored them, we will discover evidence 
of previously unknown particles and conservation laws. 


Some nuclides are stable, apparently living forever. Unstable nuclides decay (that is, they are 
radioactive), eventually producing a stable nuclide after many decays. We call the original 
nuclide the parent and its decay products the daughters. Some radioactive nuclides decay in 
a single step to a stable nucleus. For example, ©°Co is unstable and decays directly to ®°Ni, 
which is stable. Others, such as 7°°U, decay to another unstable nuclide, resulting in a decay 
series in which each subsequent nuclide decays until a stable nuclide is finally produced. The 
decay series that starts from 7°°U is of particular interest, since it produces the radioactive 
isotopes 2?6Ra and 7!°Po, which the Curies first discovered (see [link]). Radon gas is also 
produced (77*Rn in the series), an increasingly recognized naturally occurring hazard. Since 
radon is a noble gas, it emanates from materials, such as soil, containing even trace amounts 
of 73°U and can be inhaled. The decay of radon and its daughters produces internal damage. 
The 2°8U decay series ends with 2°6Pb, a stable isotope of lead. 
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The decay series produced by 2?8U, the 
most common uranium isotope. 
Nuclides are graphed in the same 
manner as in the chart of nuclides. The 
type of decay for each member of the 
series is shown, as well as the half- 
lives. Note that some nuclides decay by 
more than one mode. You can see why 
radium and polonium are found in 
uranium ore. A stable isotope of lead is 
the end product of the series. 


Note that the daughters of a decay shown in [link] always have two fewer protons and two 
fewer neutrons than the parent. This seems reasonable, since we know that a decay is the 
emission of a *He nucleus, which has two protons and two neutrons. The daughters of 8 
decay have one less neutron and one more proton than their parent. Beta decay is a little more 
subtle, as we shall see. No yy decays are shown in the figure, because they do not produce a 


daughter that differs from the parent. 


4.5 x 10°y 
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Alpha Decay 


In alpha decay, a He nucleus simply breaks away from the parent nucleus, leaving a 
daughter with two fewer protons and two fewer neutrons than the parent (see [link]). One 
example of a decay is shown in [link] for 7?°U. Another nuclide that undergoes a decay is 
239Py. The decay equations for these two nuclides are 

Equation: 


2387 aoe iad Wat Sf 4He 


and 
Equation: 


239Py — 25 + “He. 


Before After 


| B o- 


Parent Daughter 


Alpha decay is the 
separation of a *He 
nucleus from the 
parent. The 
daughter nucleus 
has two fewer 
protons and two 
fewer neutrons than 
the parent. Alpha 
decay occurs 
spontaneously only 
if the daughter and 
“He nucleus have 
less total mass than 
the parent. 


If you examine the periodic table of the elements, you will find that Th has Z = 90, two 
fewer than U, which has Z = 92. Similarly, in the second decay equation, we see that U has 
two fewer protons than Pu, which has Z = 94. The general rule for a decay is best written in 
the format ox n. If a certain nuclide is known to a decay (generally this information must be 
looked up in a table of isotopes, such as in Appendix B), its a decay equation is 


Equation: 


AX —> o3Yn-2 + He» (a decay) 


where Y is the nuclide that has two fewer protons than X, such as Th having two fewer than 
U. So if you were told that 7°?Pu a@ decays and were asked to write the complete decay 
equation, you would first look up which element has two fewer protons (an atomic number 
two lower) and find that this is uranium. Then since four nucleons have broken away from 
the original 239, its atomic mass would be 235. 


It is instructive to examine conservation laws related to a decay. You can see from the 
equation ox N-> oy N-2+ 5He2 that total charge is conserved. Linear and angular 
momentum are conserved, too. Although conserved angular momentum is not of great 
consequence in this type of decay, conservation of linear momentum has interesting 
consequences. If the nucleus is at rest when it decays, its momentum is zero. In that case, the 
fragments must fly in opposite directions with equal-magnitude momenta so that total 
momentum remains zero. This results in the a particle carrying away most of the energy, as a 
bullet from a heavy rifle carries away most of the energy of the powder burned to shoot it. 
Total mass—energy is also conserved: the energy produced in the decay comes from 
conversion of a fraction of the original mass. As discussed in Atomic Physics, the general 
relationship is 

Equation: 


E = (Am)c’. 


Here, & is the nuclear reaction energy (the reaction can be nuclear decay or any other 
reaction), and Am is the difference in mass between initial and final products. When the 
final products have less total mass, Am is positive, and the reaction releases energy (is 
exothermic). When the products have greater total mass, the reaction is endothermic (Am is 
negative) and must be induced with an energy input. For a decay to be spontaneous, the 
decay products must have smaller mass than the parent. 


Example: 

Alpha Decay Energy Found from Nuclear Masses 

Find the energy emitted in the a decay of 7?°Pu. 

Strategy 

Nuclear reaction energy, such as released in @ decay, can be found using the equation 

E = (Am)c?. We must first find Am, the difference in mass between the parent nucleus 
and the products of the decay. This is easily done using masses given in Appendix A. 
Solution 

The decay equation was given earlier for 2?°Pu ; it is 

Equation: 


Py > SU + “He. 


Thus the pertinent masses are those of 239Py, 255U, and the a particle or 4He, all of which 
are listed in Appendix A. The initial mass was m(?3°Pu) = 239.052157 u. The final mass 
is the sum m(2°U)-+m(*He)= 235.043924 u + 4.002602 u = 239.046526 u. Thus, 
Equation: 


Am = m(°Pu) — [m(?5U) + m(*He)] 
239.052157 u — 239.046526 u 
= 0.0005631 u. 


Now we can find F by entering Am into the equation: 
Equation: 


E = (Am)c? = (0.005631 u)c?. 


We know 1 u = 931.5 MeV/c?, and so 
Equation: 


E = (0.005631) (931.5 MeV/c”)(c?) = 5.25 MeV. 


Discussion 

The energy released in this a decay is in the MeV range, about 10° times as great as typical 
chemical reaction energies, consistent with many previous discussions. Most of this energy 
becomes kinetic energy of the a particle (or *He nucleus), which moves away at high speed. 
The energy carried away by the recoil of the 2°°U nucleus is much smaller in order to 
conserve momentum. The ?°U nucleus can be left in an excited state to later emit photons ( 
7 rays). This decay is spontaneous and releases energy, because the products have less mass 
than the parent nucleus. The question of why the products have less mass will be discussed 
in Binding Energy. Note that the masses given in Appendix A are atomic masses of neutral 
atoms, including their electrons. The mass of the electrons is the same before and after a 
decay, and so their masses subtract out when finding Am. In this case, there are 94 electrons 
before and after the decay. 


Beta Decay 


There are actually three types of beta decay. The first discovered was “ordinary” beta decay 
and is called 8~ decay or electron emission. The symbol 8 represents an electron emitted in 
nuclear beta decay. Cobalt-60 is a nuclide that 8~ decays in the following manner: 
Equation: 


6Co > ©Ni+ 8 + neutrino. 


The neutrino is a particle emitted in beta decay that was unanticipated and is of fundamental 
importance. The neutrino was not even proposed in theory until more than 20 years after beta 
decay was known to involve electron emissions. Neutrinos are so difficult to detect that the 
first direct evidence of them was not obtained until 1953. Neutrinos are nearly massless, have 
no charge, and do not interact with nucleons via the strong nuclear force. Traveling 
approximately at the speed of light, they have little time to affect any nucleus they encounter. 
This is, owing to the fact that they have no charge (and they are not EM waves), they do not 
interact through the EM force. They do interact via the relatively weak and very short range 
weak nuclear force. Consequently, neutrinos escape almost any detector and penetrate almost 
any shielding. However, neutrinos do carry energy, angular momentum (they are fermions 
with half-integral spin), and linear momentum away from a beta decay. When accurate 
measurements of beta decay were made, it became apparent that energy, angular momentum, 
and linear momentum were not accounted for by the daughter nucleus and electron alone. 
Either a previously unsuspected particle was carrying them away, or three conservation laws 
were being violated. Wolfgang Pauli made a formal proposal for the existence of neutrinos in 
1930. The Italian-born American physicist Enrico Fermi (1901-1954) gave neutrinos their 
name, meaning little neutral ones, when he developed a sophisticated theory of beta decay 
(see [link]). Part of Fermi’s theory was the identification of the weak nuclear force as being 
distinct from the strong nuclear force and in fact responsible for beta decay. 


Enrico Fermi was 
nearly unique 
among 20th- 
century physicists 
—he made 
significant 
contributions both 


as an 

experimentalist and 
a theorist. His 

many contributions 
to theoretical 


physics included 
the identification of 
the weak nuclear 
force. The fermi 
(fm) is named after 
him, as are an 
entire class of 
subatomic particles 
(fermions), an 
element (Fermium), 
and a major 
research laboratory 
(Fermilab). His 
experimental work 
included studies of 
radioactivity, for 
which he won the 
1938 Nobel Prize 
in physics, and 
creation of the first 
nuclear chain 
reaction. (credit: 
United States 
Department of 
Energy, Office of 
Public Affairs) 


The neutrino also reveals a new conservation law. There are various families of particles, one 
of which is the electron family. We propose that the number of members of the electron 
family is constant in any process or any closed system. In our example of beta decay, there 
are no members of the electron family present before the decay, but after, there is an electron 
and a neutrino. So electrons are given an electron family number of +1. The neutrino in 87 
decay is an electron’s antineutrino, given the symbol v,, where v is the Greek letter nu, and 
the subscript e means this neutrino is related to the electron. The bar indicates this is a 
particle of antimatter. (All particles have antimatter counterparts that are nearly identical 
except that they have the opposite charge. Antimatter is almost entirely absent on Earth, but it 
is found in nuclear decay and other nuclear and particle reactions as well as in outer space.) 
The electron’s antineutrino v,, being antimatter, has an electron family number of —1. The 
total is zero, before and after the decay. The new conservation law, obeyed in all 
circumstances, states that the total electron family number is constant. An electron cannot be 
created without also creating an antimatter family member. This law is analogous to the 
conservation of charge in a situation where total charge is originally zero, and equal amounts 
of positive and negative charge must be created in a reaction to keep the total zero. 


If a nuclide 4X y is known to B~ decay, then its 8~ decay equation is 
Equation: 


oXn + Pes a +B +v.(B decay), 


where Y is the nuclide having one more proton than X (see [link]). So if you know that a 
certain nuclide 8~ decays, you can find the daughter nucleus by first looking up Z for the 
parent and then determining which element has atomic number Z + 1. In the example of the 
B~ decay of ®°Co given earlier, we see that Z = 27 for Co and Z = 28 is Ni. It is as if one 
of the neutrons in the parent nucleus decays into a proton, electron, and neutrino. In fact, 
neutrons outside of nuclei do just that—they live only an average of a few minutes and B~ 
decay in the following manner: 

Equation: 


n>pt+P +H. 


Vv, 
Parent Daughter me 


In B~ decay, the 
parent nucleus 
emits an electron 
and an antineutrino. 
The daughter 
nucleus has one 
more proton and 
one less neutron 
than its parent. 
Neutrinos interact 
so weakly that they 
are almost never 
directly observed, 
but they play a 
fundamental role in 
particle physics. 


We see that charge is conserved in @~ decay, since the total charge is Z before and after the 
decay. For example, in ®°Co decay, total charge is 27 before decay, since cobalt has Z = 27. 
After decay, the daughter nucleus is Ni, which has Z = 28, and there is an electron, so that 
the total charge is also 28 + (—1) or 27. Angular momentum is conserved, but not obviously 
(you have to examine the spins and angular momenta of the final products in detail to verify 
this). Linear momentum is also conserved, again imparting most of the decay energy to the 
electron and the antineutrino, since they are of low and zero mass, respectively. Another new 
conservation law is obeyed here and elsewhere in nature. The total number of nucleons A is 
conserved. In ®°Co decay, for example, there are 60 nucleons before and after the decay. 
Note that total A is also conserved in @ decay. Also note that the total number of protons 
changes, as does the total number of neutrons, so that total Z and total N are not conserved 


in B~ decay, as they are in a decay. Energy released in G~ decay can be calculated given the 
masses of the parent and products. 


Example: 

B~ Decay Energy from Masses 

Find the energy emitted in the 8~ decay of ®°Co. 

Strategy and Concept 

As in the preceding example, we must first find Am, the difference in mass between the 
parent nucleus and the products of the decay, using masses given in Appendix A. Then the 
emitted energy is calculated as before, using EF = (Am)c?. The initial mass is just that of 
the parent nucleus, and the final mass is that of the daughter nucleus and the electron created 
in the decay. The neutrino is massless, or nearly so. However, since the masses given in 
Appendix A are for neutral atoms, the daughter nucleus has one more electron than the 


parent, and so the extra electron mass that corresponds to the § is included in the atomic 
mass of Ni. Thus, 


Equation: 

Am = m(®Co) — m(°Ni ). 
Solution 
The B~ decay equation for ®°Co is 
Equation: 

$°Co33 —-> Se Nize + B+ V.¢. 
As noticed, 
Equation: 


Am = m(Co) — m(°Ni ). 


Entering the masses found in Appendix A gives 
Equation: 


Am = 59.933820 u — 59.930789 u = 0.003031 u. 


Thus, 
Equation: 


E = (Am)c? = (0.003031 u)c?. 


Using 1 u = 931.5 MeV/c?, we obtain 
Equation: 


E = (0.003031)(931.5 MeV/c”)(c”) = 2.82 MeV. 


Discussion and Implications 

Perhaps the most difficult thing about this example is convincing yourself that the 8” mass 
is included in the atomic mass of © Ni. Beyond that are other implications. Again the decay 
energy is in the MeV range. This energy is shared by all of the products of the decay. In 
many ®°Co decays, the daughter nucleus °°Ni is left in an excited state and emits photons ( 
7 rays). Most of the remaining energy goes to the electron and neutrino, since the recoil 
kinetic energy of the daughter nucleus is small. One final note: the electron emitted in G~ 
decay is created in the nucleus at the time of decay. 


The second type of beta decay is less common than the first. It is 6* decay. Certain nuclides 
decay by the emission of a positive electron. This is antielectron or positron decay (see 


[link]). 


B* decay 
Before After 


S| ap 


Vv 
Parent Daughter Ne 


B* decay is the 
emission of a 
positron that 

eventually finds an 
electron to 
annihilate, 
characteristically 
producing gammas 
in opposite 
directions. 


The antielectron is often represented by the symbol e*, but in beta decay it is written as 8 
to indicate the antielectron was emitted in a nuclear decay. Antielectrons are the antimatter 
counterpart to electrons, being nearly identical, having the same mass, spin, and so on, but 
having a positive charge and an electron family number of —1. When a positron encounters 
an electron, there is a mutual annihilation in which all the mass of the antielectron-electron 
pair is converted into pure photon energy. (The reaction, ef + e~ — y +7, conserves 
electron family number as well as all other conserved quantities.) If a nuclide oh n is known 
to B* decay, then its 8* decay equation is 

Equation: 


oXw > 7 4¥wii + Bt + ve (Bt decay), 


where Y is the nuclide having one less proton than X (to conserve charge) and 1 is the 
symbol for the electron’s neutrino, which has an electron family number of +1. Since an 
antimatter member of the electron family (the G*) is created in the decay, a matter member of 
the family (here the v.) must also be created. Given, for example, that 22Na @* decays, you 
can write its full decay equation by first finding that Z = 11 for ?*Na, so that the daughter 
nuclide will have Z = 10, the atomic number for neon. Thus the 8* decay equation for 7*Na 
is 

Equation: 


22 22 oF 
11Nau —> i0Ne12 + B + Ve. 


In B* decay, it is as if one of the protons in the parent nucleus decays into a neutron, a 
positron, and a neutrino. Protons do not do this outside of the nucleus, and so the decay is due 
to the complexities of the nuclear force. Note again that the total number of nucleons is 
constant in this and any other reaction. To find the energy emitted in G* decay, you must 
again count the number of electrons in the neutral atoms, since atomic masses are used. The 
daughter has one less electron than the parent, and one electron mass is created in the decay. 
Thus, in @* decay, 

Equation: 


Am = m(parent) — {m(daughter) + 2m], 


since we use the masses of neutral atoms. 


Electron capture is the third type of beta decay. Here, a nucleus captures an inner-shell 
electron and undergoes a nuclear reaction that has the same effect as 8* decay. Electron 
capture is sometimes denoted by the letters EC. We know that electrons cannot reside in the 
nucleus, but this is a nuclear reaction that consumes the electron and occurs spontaneously 
only when the products have less mass than the parent plus the electron. If a nuclide ax Nn is 
known to undergo electron capture, then its electron capture equation is 

Equation: 


ax nte —- gk Ned + y.(electron capture, or EC). 


Any nuclide that can G* decay can also undergo electron capture (and often does both). The 
same conservation laws are obeyed for EC as for 3* decay. It is good practice to confirm 
these for yourself. 


All forms of beta decay occur because the parent nuclide is unstable and lies outside the 
region of stability in the chart of nuclides. Those nuclides that have relatively more neutrons 
than those in the region of stability will G~ decay to produce a daughter with fewer neutrons, 
producing a daughter nearer the region of stability. Similarly, those nuclides having relatively 
more protons than those in the region of stability will G~ decay or undergo electron capture 
to produce a daughter with fewer protons, nearer the region of stability. 


Gamma Decay 


Gamma decay is the simplest form of nuclear decay—it is the emission of energetic photons 
by nuclei left in an excited state by some earlier process. Protons and neutrons in an excited 
nucleus are in higher orbitals, and they fall to lower levels by photon emission (analogous to 
electrons in excited atoms). Nuclear excited states have lifetimes typically of only about 
10~*“ s, an indication of the great strength of the forces pulling the nucleons to lower states. 
The y decay equation is simply 

Equation: 


ex. > ak +1 +Y24+°°: (7 decay) 


where the asterisk indicates the nucleus is in an excited state. There may be one or more 7 s 
emitted, depending on how the nuclide de-excites. In radioactive decay, -y emission is 
common and is preceded by 7¥ or @ decay. For example, when ©°Co 8~ decays, it most often 
leaves the daughter nucleus in an excited state, written ©°Ni*. Then the nickel nucleus 
quickly y decays by the emission of two penetrating ¥ s: 

Equation: 


6ONi* —> 60Ni + V1 + Y2- 


These are called cobalt rays, although they come from nickel—they are used for cancer 
therapy, for example. It is again constructive to verify the conservation laws for gamma 
decay. Finally, since y decay does not change the nuclide to another species, it is not 
prominently featured in charts of decay series, such as that in [link]. 


There are other types of nuclear decay, but they occur less commonly than a, (, and + decay. 
Spontaneous fission is the most important of the other forms of nuclear decay because of its 
applications in nuclear power and weapons. It is covered in the next chapter. 


Section Summary 


e When a parent nucleus decays, it produces a daughter nucleus following rules and 
conservation laws. There are three major types of nuclear decay, called alpha (a), beta 
(8), and gamma (7). The a decay equation is 
Equation: 


A A-4 4 
ZXN — 7_o\N-2 + 9Heo. 


e Nuclear decay releases an amount of energy F related to the mass destroyed Am by 
Equation: 


E = (Am)c’. 


e There are three forms of beta decay. The G~ decay equation is 
Equation: 


4Xn > $44Yn-1+ 8 +e. 


e The 8* decay equation is 
Equation: 


gXn =a $4Ynu a oR + Ve. 


e The electron capture equation is 
Equation: 


A - _,A 
ZxXN +e > Z-1YN+41 + Ve. 
e 2 is anelectron, G* is an antielectron or positron, v, represents an electron’s neutrino, 
and v, is an electron’s antineutrino. In addition to all previously known conservation 
laws, two new ones arise— conservation of electron family number and conservation of 


the total number of nucleons. The y decay equation is 
Equation: 


eXny 7 oXwtntrete: 


7 is a high-energy photon originating in a nucleus. 


Conceptual Questions 


Exercise: 


Problem: 


Star Trek fans have often heard the term “antimatter drive.” Describe how you could use 
a magnetic field to trap antimatter, such as produced by nuclear decay, and later 
combine it with matter to produce energy. Be specific about the type of antimatter, the 
need for vacuum storage, and the fraction of matter converted into energy. 


Exercise: 
Problem: 
What conservation law requires an electron’s neutrino to be produced in electron 
capture? Note that the electron no longer exists after it is captured by the nucleus. 
Exercise: 
Problem: 
Neutrinos are experimentally determined to have an extremely small mass. Huge 
numbers of neutrinos are created in a supernova at the same time as massive amounts of 
light are first produced. When the 1987A supernova occurred in the Large Magellanic 
Cloud, visible primarily in the Southern Hemisphere and some 100,000 light-years away 
from Earth, neutrinos from the explosion were observed at about the same time as the 


light from the blast. How could the relative arrival times of neutrinos and light be used 
to place limits on the mass of neutrinos? 


Exercise: 
Problem: 


What do the three types of beta decay have in common that is distinctly different from 
alpha decay? 


Problems & Exercises 


In the following eight problems, write the complete decay equation for the given nuclide in 
the complete x N notation. Refer to the periodic table for values of Z. 
Exercise: 


Problem: 


G~ decay of *H (tritium), a manufactured isotope of hydrogen used in some digital 
watch displays, and manufactured primarily for use in hydrogen bombs. 


Solution: 
Equation: 


2H —> >He, + B- + Ve 


Exercise: 


Problem: 


B~ decay of “°K, a naturally occurring rare isotope of potassium responsible for some 
of our exposure to background radiation. 


Exercise: 


Problem: G* decay of °°Mn. 


Solution: 
Equation: 


50 50 + 
95 Mo5 — 54Cro6 + B" + Ve 
Exercise: 


Problem: {* decay of °*Fe. 


Exercise: 


Problem: Electron capture by Be. 


Solution: 
Equation: 


iBeg +e — 3Lig + ve 
Exercise: 
Problem: Electron capture by !In. 


Exercise: 
Problem: 
a decay of 7!°Po, the isotope of polonium in the decay series of ??°U that was 
discovered by the Curies. A favorite isotope in physics labs, since it has a short half-life 


and decays to a stable nuclide. 


Solution: 
Equation: 


210 206 4 
34 P0126 — gp Pbio4 + 9Hee 


Exercise: 
Problem: 
a decay of ??6Ra, another isotope in the decay series of 2°8U, first recognized as a new 


element by the Curies. Poses special problems because its daughter is a radioactive 
noble gas. 


In the following four problems, identify the parent nuclide and write the complete decay 
equation in the ox n notation. Refer to the periodic table for values of Z. 
Exercise: 


Problem: 
G~ decay producing !°’Ba. The parent nuclide is a major waste product of reactors and 


has chemistry similar to potassium and sodium, resulting in its concentration in your 
cells if ingested. 


Solution: 
Equation: 


137 137 = 
55 Csg2 > 56 Bagi + B +ve 


Exercise: 
Problem: 
B~ decay producing °°Y. The parent nuclide is a major waste product of reactors and 


has chemistry similar to calcium, so that it is concentrated in bones if ingested (°°Y is 
also radioactive.) 


Exercise: 


Problem: 
a: decay producing 2*°Ra. The parent nuclide is nearly 100% of the natural element and 
is found in gas lantern mantles and in metal alloys used in jets (7?°Ra is also 


radioactive). 


Solution: 
Equation: 


232 228 4 
90 Thyzo — 98 Ray4o + 9Heg 


Exercise: 


Problem: 
a: decay producing 2°°Pb. The parent nuclide is in the decay series produced by 2??Th, 
the only naturally occurring isotope of thorium. 

Exercise: 
Problem: 
When an electron and positron annihilate, both their masses are destroyed, creating two 
equal energy photons to preserve momentum. (a) Confirm that the annihilation equation 
et +e —y+/yconserves charge, electron family number, and total number of 
nucleons. To do this, identify the values of each before and after the annihilation. (b) 
Find the energy of each y ray, assuming the electron and positron are initially nearly at 


rest. (c) Explain why the two ¥ rays travel in exactly opposite directions if the center of 
mass of the electron-positron system is initially at rest. 


Solution: 


(a) 
charge:(+1) + (—1) =0; electron family number: (+1) + (—1) =0; A:0+0=0 


(b) 0.511 MeV 


(c) The two ¥y rays must travel in exactly opposite directions in order to conserve 
momentum, since initially there is zero momentum if the center of mass is initially at 
rest. 


Exercise: 
Problem: 
Confirm that charge, electron family number, and the total number of nucleons are all 


conserved by the rule for a decay given in the equation eps N—> oo N-2+ 5Hep. To 
do this, identify the values of each before and after the decay. 


Exercise: 
Problem: 
Confirm that charge, electron family number, and the total number of nucleons are all 
conserved by the rule for 8~ decay given in the equation 


ox Nn a ae NEE B- + v-. To do this, identify the values of each before and after 
the decay. 


Solution: 
Equation: 


Z=(Z+4+1)-1; A=A; efn:0=(41)+4+(-1) 


Exercise: 


Problem: 


Confirm that charge, electron family number, and the total number of nucleons are all 
conserved by the rule for @~ decay given in the equation ox N— aed n-1+8 + 
. To do this, identify the values of each before and after the decay. 


Exercise: 


Problem: 


Confirm that charge, electron family number, and the total number of nucleons are all 
conserved by the rule for electron capture given in the equation 

ox nte -> eee 4 N+1 + Ve. To do this, identify the values of each before and after 
the capture. 


Solution: 
Equation: 


Z-1=Z-1; A=A; efn:(+1) = (41) 
Exercise: 
Problem: 


A rare decay mode has been observed in which 2??Ra emits a '*C nucleus. (a) The 
decay equation is ??7Ra +4 X+'4C. Identify the nuclide 4X. (b) Find the energy 
emitted in the decay. The mass of 2?*Ra is 222.015353 u. 


Exercise: 
Problem: (a) Write the complete a decay equation for ??°Ra. 
(b) Find the energy released in the decay. 
Solution: 
(a) 22°Raizg > 32?Ruiz6 + 3He2 
(b) 4.87 MeV 
Exercise: 
Problem: (a) Write the complete a decay equation for 74°Cf. 


(b) Find the energy released in the decay. 


Exercise: 


Problem: 


(a) Write the complete G~ decay equation for the neutron. (b) Find the energy released 
in the decay. 


Solution: 
(a) > phe eve 


(b) ) 0.783 MeV 
Exercise: 


Problem: 


(a) Write the complete 8~ decay equation for 9°Sr, a major waste product of nuclear 
reactors. (b) Find the energy released in the decay. 


Exercise: 


Problem: 


Calculate the energy released in the G* decay of ?*Na, the equation for which is given 
in the text. The masses of 22Na and ?2Ne are 21.994434 and 21.991383 u, respectively. 


Solution: 
1.82 MeV 
Exercise: 
Problem: (a) Write the complete 6* decay equation for !!C. 


(b) Calculate the energy released in the decay. The masses of ''C and ''B are 
11.011433 and 11.009305 u, respectively. 


Exercise: 
Problem: (a) Calculate the energy released in the a decay of 7°°U. 


(b) What fraction of the mass of a single 2°°U is destroyed in the decay? The mass of 
34Th is 234.043593 u. 


(c) Although the fractional mass loss is large for a single nucleus, it is difficult to 
observe for an entire macroscopic sample of uranium. Why is this? 


Solution: 


(a) 4.274 MeV 
(b) 1.927 x 10°° 


(c) Since U-238 is a slowly decaying substance, only a very small number of nuclei 
decay on human timescales; therefore, although those nuclei that decay lose a noticeable 
fraction of their mass, the change in the total mass of the sample is not detectable for a 
macroscopic sample. 


Exercise: 
Problem: (a) Write the complete reaction equation for electron capture by “Be. 
(b) Calculate the energy released. 

Exercise: 
Problem: (a) Write the complete reaction equation for electron capture by 1°O. 
(b) Calculate the energy released. 
Solution: 
(a); O7 +e GS ENg +1 


(b) 2.754 MeV 


Glossary 


parent 
the original state of nucleus before decay 


daughter 
the nucleus obtained when parent nucleus decays and produces another nucleus 
following the rules and the conservation laws 


positron 
the particle that results from positive beta decay; also known as an antielectron 


decay 
the process by which an atomic nucleus of an unstable atom loses mass and energy by 
emitting ionizing particles 


alpha decay 
type of radioactive decay in which an atomic nucleus emits an alpha particle 


beta decay 
type of radioactive decay in which an atomic nucleus emits a beta particle 


gamma decay 
type of radioactive decay in which an atomic nucleus emits a gamma particle 


decay equation 
the equation to find out how much of a radioactive material is left after a given period of 
time 


nuclear reaction energy 
the energy created in a nuclear reaction 


neutrino 
an electrically neutral, weakly interacting elementary subatomic particle 


electron’s antineutrino 
antiparticle of electron’s neutrino 


positron decay 
type of beta decay in which a proton is converted to a neutron, releasing a positron and a 
neutrino 


antielectron 
another term for positron 


decay series 
process whereby subsequent nuclides decay until a stable nuclide is produced 


electron’s neutrino 
a subatomic elementary particle which has no net electric charge 


antimatter 
composed of antiparticles 


electron capture 
the process in which a proton-rich nuclide absorbs an inner atomic electron and 
simultaneously emits a neutrino 


electron capture equation 
equation representing the electron capture 


Half-Life and Activity 


¢ Define half-life. 
¢ Define dating. 
¢ Calculate age of old objects by radioactive dating. 


Unstable nuclei decay. However, some nuclides decay faster than others. 
For example, radium and polonium, discovered by the Curies, decay faster 
than uranium. This means they have shorter lifetimes, producing a greater 
rate of decay. In this section we explore half-life and activity, the 
quantitative terms for lifetime and rate of decay. 


Half-Life 


Why use a term like half-life rather than lifetime? The answer can be found 
by examining [link], which shows how the number of radioactive nuclei in 
a sample decreases with time. The time in which half of the original number 
of nuclei decay is defined as the half-life, ¢; /2. Half of the remaining nuclei 
decay in the next half-life. Further, half of that amount decays in the 
following half-life. Therefore, the number of radioactive nuclei decreases 
from N to N/2 in one half-life, then to N/4 in the next, and to N/8 in the 
next, and so on. If N is a large number, then many half-lives (not just two) 
pass before all of the nuclei decay. Nuclear decay is an example of a purely 
Statistical process. A more precise definition of half-life is that each nucleus 
has a 50% chance of living for a time equal to one half-life 1/2. Thus, if N 
is reasonably large, half of the original nuclei decay in a time of one half- 
life. If an individual nucleus makes it through that time, it still has a 50% 
chance of surviving through another half-life. Even if it happens to make it 
through hundreds of half-lives, it still has a 50% chance of surviving 
through one more. The probability of decay is the same no matter when you 
start counting. This is like random coin flipping. The chance of heads is 
50%, no matter what has happened before. 
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Radioactive decay reduces the 
number of radioactive nuclei 
over time. In one half-life 1/2, 
the number decreases to half of 
its original value. Half of what 
remains decay in the next half- 
life, and half of those in the 
next, and so on. This is an 
exponential decay, as seen in 
the graph of the number of 
nuclei present as a function of 
time. 


There is a tremendous range in the half-lives of various nuclides, from as 
short as 10~2° s for the most unstable, to more than 106 y for the least 
unstable, or about 46 orders of magnitude. Nuclides with the shortest half- 
lives are those for which the nuclear forces are least attractive, an indication 
of the extent to which the nuclear force can depend on the particular 
combination of neutrons and protons. The concept of half-life is applicable 
to other subatomic particles, as will be discussed in Particle Physics. It is 
also applicable to the decay of excited states in atoms and nuclei. The 
following equation gives the quantitative relationship between the original 


number of nuclei present at time zero (Vg) and the number (JV) at a later 
time ¢: 
Equation: 


N=Noe™, 


where e = 2.71828... is the base of the natural logarithm, and ) is the 
decay constant for the nuclide. The shorter the half-life, the larger is the 
value of A, and the faster the exponential e~*’ decreases with time. The 
relationship between the decay constant A and the half-life 1/2 is 
Equation: 


In(2) _ 0.693 


= a 
ti/2 t1/2 


To see how the number of nuclei declines to half its original value in one 
half-life, let ¢ = ¢1/2 in the exponential in the equation N = N, oe. This 
gives VN = Noe = Noe °°? = 0.500.No. For integral numbers of half- 
lives, you can just divide the original number by 2 over and over again, 
rather than using the exponential relationship. For example, if ten half-lives 
have passed, we divide N by 2 ten times. This reduces it to N/1024. For 
an arbitrary time, not just a multiple of the half-life, the exponential 
relationship must be used. 


Radioactive dating is a clever use of naturally occurring radioactivity. Its 
most famous application is carbon-14 dating. Carbon-14 has a half-life of 
5730 years and is produced in a nuclear reaction induced when solar 
neutrinos strike *N in the atmosphere. Radioactive carbon has the same 
chemistry as stable carbon, and so it mixes into the ecosphere, where it is 
consumed and becomes part of every living organism. Carbon-14 has an 
abundance of 1.3 parts per trillion of normal carbon. Thus, if you know the 
number of carbon nuclei in an object (perhaps determined by mass and 
Avogadro’s number), you multiply that number by 1.3107"? to find the 
number of *4C nuclei in the object. When an organism dies, carbon 
exchange with the environment ceases, and 14C is not replenished as it 


decays. By comparing the abundance of !4C in an artifact, such as mummy 
wrappings, with the normal abundance in living tissue, it is possible to 
determine the artifact’s age (or time since death). Carbon-14 dating can be 
used for biological tissues as old as 50 or 60 thousand years, but is most 
accurate for younger samples, since the abundance of ‘*C nuclei in them is 
greater. Very old biological materials contain no 4C at all. There are 
instances in which the date of an artifact can be determined by other means, 
such as historical knowledge or tree-ring counting. These cross-references 
have confirmed the validity of carbon-14 dating and permitted us to 
calibrate the technique as well. Carbon-14 dating revolutionized parts of 
archaeology and is of such importance that it earned the 1960 Nobel Prize 
in chemistry for its developer, the American chemist Willard Libby (1908— 
1980). 


One of the most famous cases of carbon-14 dating involves the Shroud of 
Turin, a long piece of fabric purported to be the burial shroud of Jesus (see 
[link]). This relic was first displayed in Turin in 1354 and was denounced as 
a fraud at that time by a French bishop. Its remarkable negative imprint of 
an apparently crucified body resembles the then-accepted image of Jesus, 
and so the shroud was never disregarded completely and remained 
controversial over the centuries. Carbon-14 dating was not performed on 
the shroud until 1988, when the process had been refined to the point where 
only a small amount of material needed to be destroyed. Samples were 
tested at three independent laboratories, each being given four pieces of 
cloth, with only one unidentified piece from the shroud, to avoid prejudice. 
All three laboratories found samples of the shroud contain 92% of the ‘4C 
found in living tissues, allowing the shroud to be dated (see [Link]). 


Part of the Shroud of 
Turin, which shows a 
remarkable negative 
imprint likeness of Jesus 
complete with evidence 
of crucifixion wounds. 
The shroud first surfaced 
in the 14th century and 
was only recently carbon- 
14 dated. It has not been 
determined how the 
image was placed on the 
material. (credit: Butko, 
Wikimedia Commons) 


Example: 

How Old Is the Shroud of Turin? 

Calculate the age of the Shroud of Turin given that the amount of 4C 
found in it is 92% of that in living tissue. 

Strategy 

Knowing that 92% of the ‘*C remains means that N/No = 0.92. 
Therefore, the equation N = Noe ~ can be used to find At. We also know 
that the half-life of '4C is 5730 y, and so once At is known, we can use the 


equation A = ae to find 4 and then find t as requested. Here, we 


postulate that the decrease in ‘4C is solely due to nuclear decay. 
Solution 

Solving the equation N = Noe~™ for N/ No gives 

Equation: 


Thus, 
Equation: 


O23 —=er. 


Taking the natural logarithm of both sides of the equation yields 
Equation: 


In 0.92 = -At 
so that 
Equation: 
—0.0834 = —At. 

Rearranging to isolate ¢ gives 
Equation: 

_ 0.0834 

= eae 


0.693 
ti/2 


and substituting the known half-life gives 


Now, the equation \ = can be used to find A for *C. Solving for » 


Equation: 

= 0.693 0.693 

ti = BT30y" 
We enter this value into the previous equation to find tf: 
Equation: 
0.0834 
SS oars as 690 y. 
5730 y 

Discussion 


This dates the material in the shroud to 1988-690 = a.d. 1300. Our 
calculation is only accurate to two digits, so that the year is rounded to 
1300. The values obtained at the three independent laboratories gave a 


weighted average date of a.d. 1320 + 60. The uncertainty is typical of 
carbon-14 dating and is due to the small amount of !*C in living tissues, 
the amount of material available, and experimental uncertainties (reduced 
by having three independent measurements). It is meaningful that the date 
of the shroud is consistent with the first record of its existence and 
inconsistent with the period in which Jesus lived. 


There are other forms of radioactive dating. Rocks, for example, can 
sometimes be dated based on the decay of 7°°U. The decay series for 72°U 
ends with 2°°Pb, so that the ratio of these nuclides in a rock is an indication 
of how long it has been since the rock solidified. The original composition 
of the rock, such as the absence of lead, must be known with some 
confidence. However, as with carbon-14 dating, the technique can be 
verified by a consistent body of knowledge. Since 738U has a half-life of 
4.5 x 10° y, it is useful for dating only very old materials, showing, for 
example, that the oldest rocks on Earth solidified about 3.5 x 10° years 
ago. 


Activity, the Rate of Decay 


What do we mean when we say a source is highly radioactive? Generally, 
this means the number of decays per unit time is very high. We define 
activity R to be the rate of decay expressed in decays per unit time. In 
equation form, this is 

Equation: 


AN 
R=—— 
At 


where AN is the number of decays that occur in time At. The SI unit for 
activity is one decay per second and is given the name becquerel (Bq) in 
honor of the discoverer of radioactivity. That is, 

Equation: 


1 Bq = 1 decay/s. 


Activity R is often expressed in other units, such as decays per minute or 
decays per year. One of the most common units for activity is the curie 
(Ci), defined to be the activity of 1 g of ?”°Ra, in honor of Marie Curie’s 
work with radium. The definition of curie is 

Equation: 


1 Ci = 3.70 x 10!” Bq, 


or 3.70 x 107° decays per second. A curie is a large unit of activity, while a 
becquerel is a relatively small unit. 1 MBq = 100 microcuries (Ci). In 
countries like Australia and New Zealand that adhere more to SI units, most 
radioactive sources, such as those used in medical diagnostics or in physics 
laboratories, are labeled in Bq or megabecquerel (MBq). 


Intuitively, you would expect the activity of a source to depend on two 
things: the amount of the radioactive substance present, and its half-life. 
The greater the number of radioactive nuclei present in the sample, the 
more will decay per unit of time. The shorter the half-life, the more decays 
per unit time, for a given number of nuclei. So activity R should be 
proportional to the number of radioactive nuclei, NV, and inversely 
proportional to their half-life, 1/2. In fact, your intuition is correct. It can be 
shown that the activity of a source is 

Equation: 


_ 0.693N 
t1/2 


R 


where JV is the number of radioactive nuclei present, having half-life ¢1 /2. 
This relationship is useful in a variety of calculations, as the next two 
examples illustrate. 


Example: 

How Great Is the '*C Activity in Living Tissue? 

Calculate the activity due to *C in 1.00 kg of carbon found in a living 
organism. Express the activity in units of Bq and Ci. 

Strategy 


To find the activity R using the equation R = 23 


t1/2 
and ¢1/2. The half-life of 14C can be found in Appendix B, and was stated 
above as 5730 y. To find N, we first find the number of !*C nuclei in 1.00 
kg of carbon using the concept of a mole. As indicated, we then multiply 
by 1.31071? (the abundance of !4C in a carbon sample from a living 
organism) to get the number of !*C nuclei in a living organism. 

Solution 

One mole of carbon has a mass of 12.0 g, since it is nearly pure 12C. (A 
mole has a mass in grams equal in magnitude to A found in the periodic 
table.) Thus the number of carbon nuclei in a kilogram is 

Equation: 


, we must know NV 


_ 6.02 x 103 mol * 


N(?C) = 1000 g) = 5.02 x 107°. 
©) 12.0 g/mol N g) ‘. 


So the number of !4C nuclei in 1 kg of carbon is 
Equation: 


N(**C) = (5.02 x 10°)(1.3 x 107'*) = 6.52 x 10°. 


Now the activity is found using the equation R = ane 
Entering known values gives 
Equation: 
0.693(6.52 10") aa 
= = (OU I, 


5730 y 


or 7.89 x 10° decays per year. To convert this to the unit Bq, we simply 
convert years to seconds. Thus, 
Equation: 


1.00 y 


R= (789.10? y 
( of | aiacanine 


= 250 Ba, 


or 250 decays per second. To express F in curies, we use the definition of 
a curie, 


Equation: 
250 B 
ral Gk 
3.7x 101° Bq/Ci 

Thus, 

Equation: 

R = 6.76 nCi. 

Discussion 


Our own bodies contain kilograms of carbon, and it is intriguing to think 
there are hundreds of *C decays per second taking place in us. Carbon-14 
and other naturally occurring radioactive substances in our bodies 
contribute to the background radiation we receive. The small number of 
decays per second found for a kilogram of carbon in this example gives 
you some idea of how difficult it is to detect 4C in a small sample of 
material. If there are 250 decays per second in a kilogram, then there are 
0.25 decays per second in a gram of carbon in living tissue. To observe 
this, you must be able to distinguish decays from other forms of radiation, 
in order to reduce background noise. This becomes more difficult with an 
old tissue sample, since it contains less 14C) and for samples more than 50 
thousand years old, it is impossible. 


Human-made (or artificial) radioactivity has been produced for decades and 
has many uses. Some of these include medical therapy for cancer, medical 
imaging and diagnostics, and food preservation by irradiation. Many 
applications as well as the biological effects of radiation are explored in 
Medical Applications of Nuclear Physics, but it is clear that radiation is 
hazardous. A number of tragic examples of this exist, one of the most 
disastrous being the meltdown and fire at the Chernobyl reactor complex in 


the Ukraine (see [link]). Several radioactive isotopes were released in huge 
quantities, contaminating many thousands of square kilometers and directly 
affecting hundreds of thousands of people. The most significant releases 
were of 131], Sr, 187Cg 289Py, 238U, and 22°U. Estimates are that the 


total amount of radiation released was about 100 million curies. 


Human and Medical Applications 


The Chernobyl reactor. 
More than 100 people 
died soon after its 
meltdown, and there will 
be thousands of deaths 
from radiation-induced 
cancer in the future. 
While the accident was 
due to a series of human 
errors, the cleanup efforts 
were heroic. Most of the 
immediate fatalities were 
firefighters and reactor 
personnel. (credit: Elena 
Filatova) 


Example: 

What Mass of !°’Cs Escaped Chernobyl? 

It is estimated that the Chernobyl disaster released 6.0 MCi of '°’Cs into 
the environment. Calculate the mass of !°’Cs released. 

Strategy 

We can calculate the mass released using Avogadro’s number and the 
concept of a mole if we can first find the number of nuclei N released. 
Since the activity R is given, and the half-life of 4°’Cs is found in 


Appendix B to be 30.2 y, we can use the equation R = or to find NV. 
Solution 
Solving the equation R = oa for N gives 
Equation: 
Rt 
i 
0.693 


Entering the given values yields 
Equation: 


re CE CE 


0.693 


Converting curies to becquerels and years to seconds, we get 
Equation: 


N = (6.0x10°® Ci)(3.7x10"° Bq/Ci) (30.2 y)(3.16 10’ s/y) 
0.693 


= 31x 1076. 


One mole of a nuclide 4“.X has a mass of A grams, so that one mole of 
137Cs has a mass of 137 g. A mole has 6.02 x 107° nuclei. Thus the mass 
of 137Cs released was 

Equation: 


137 
m = (<Htee J (3-1 x 10) = 70 x 103 g 


70 kg. 


Discussion 

While 70 kg of material may not be a very large mass compared to the 
amount of fuel in a power plant, it is extremely radioactive, since it only 
has a 30-year half-life. Six megacuries (6.0 MCi) is an extraordinary 
amount of activity but is only a fraction of what is produced in nuclear 
reactors. Similar amounts of the other isotopes were also released at 
Chernobyl. Although the chances of such a disaster may have seemed 
small, the consequences were extremely severe, requiring greater caution 
than was used. More will be said about safe reactor design in the next 
chapter, but it should be noted that Western reactors have a fundamentally 
safer design. 


Activity R decreases in time, going to half its original value in one half-life, 
then to one-fourth its original value in the next half-life, and so on. Since 


b= rare the activity decreases as the number of radioactive nuclei 


decreases. The equation for R as a function of time is found by combining 


the equations N = Noe~*“ and R = ee yielding 


Equation: 


R= Roe ™, 


where £po is the activity at t = 0. This equation shows exponential decay of 
radioactive nuclei. For example, if a source originally has a 1.00-mCi 
activity, it declines to 0.500 mCi in one half-life, to 0.250 mCi in two half- 
lives, to 0.125 mCi in three half-lives, and so on. For times other than 
whole half-lives, the equation R = Roe“ must be used to find R. 


Note: 

PhET Explorations: Alpha Decay 

Watch alpha particles escape from a polonium nucleus, causing radioactive 
alpha decay. See how random decay times relate to the half life. 


Section Summary 


e Half-life £1/2 is the time in which there is a 50% chance that a nucleus 
will decay. The number of nuclei N as a function of time is 
Equation: 


N=Nye, 


where Vo is the number present at t = O, and J is the decay constant, 
related to the half-life by 
Equation: 


0.693 
Ri 


tio 


¢ One of the applications of radioactive decay is radioactive dating, in 
which the age of a material is determined by the amount of radioactive 
decay that occurs. The rate of decay is called the activity R: 
Equation: 
_ AN 
At S 
e The SI unit for R is the becquerel (Bq), defined by 
Equation: 
1 Bq = 1 decay/s. 


e Fis also expressed in terms of curies (Ci), where 


Equation: 
1 Ci = 3.70 x 101° Bq. 


e The activity R of a source is related to N and fj /2 by 
Equation: 


0.693 
tio 


R 


¢ Since N has an exponential behavior as in the equation N = Noe > 
the activity also has an exponential behavior, given by 
Equation: 


R=R,e™, 


where fo is the activity at t = 0. 


Conceptual Questions 


Exercise: 
Problem: 
Ina3 x 10°-year-old rock that originally contained some 7°3U, which 
has a half-life of 4.5 x 10° years, we expect to find some 725U 
remaining in it. Why are 72°Ra, ???Rn, and ?!°Po also found in such a 


rock, even though they have much shorter half-lives (1600 years, 3.8 
days, and 138 days, respectively)? 


Exercise: 
Problem: 
Does the number of radioactive nuclei in a sample decrease to exactly 


half its original value in one half-life? Explain in terms of the 
Statistical nature of radioactive decay. 


Exercise: 


t 


b 


Problem: 


Radioactivity depends on the nucleus and not the atom or its chemical 
state. Why, then, is one kilogram of uranium more radioactive than one 
kilogram of uranium hexafluoride? 


Exercise: 
Problem: 
Explain how a bound system can have less mass than its components. 
Why is this not observed classically, say for a building made of bricks? 
Exercise: 
Problem: 
Spontaneous radioactive decay occurs only when the decay products 
have less mass than the parent, and it tends to produce a daughter that 
is more stable than the parent. Explain how this is related to the fact 


that more tightly bound nuclei are more stable. (Consider the binding 
energy per nucleon.) 


Exercise: 
Problem: 
To obtain the most precise value of BE from the equation 
BE= |ZM (1H) af Nm,| c= m(4X) c?, we should take into 
account the binding energy of the electrons in the neutral atoms. Will 


doing this produce a larger or smaller value for BE? Why is this effect 
usually negligible? 


Exercise: 


Problem: 


How does the finite range of the nuclear force relate to the fact that 
BE/A is greatest for A near 60? 


Problems & Exercises 


Data from the appendices and the periodic table may be needed for these 
problems. 
Exercise: 


Problem: 
An old campfire is uncovered during an archaeological dig. Its 
charcoal is found to contain less than 1/1000 the normal amount of 


14C, Estimate the minimum age of the charcoal, noting that 
2024, 


Solution: 


57,300 y 
Exercise: 
Problem: 
A ©°Co source is labeled 4.00 mCi, but its present activity is found to 


be 1.85 x 10’ Bq. (a) What is the present activity in mCi? (b) How 
long ago did it actually have a 4.00-mCi activity? 


Exercise: 


Problem: 


(a) Calculate the activity R in curies of 1.00 g of ??°Ra. (b) Discuss 
why your answer is not exactly 1.00 Ci, given that the curie was 
originally supposed to be exactly the activity of a gram of radium. 


Solution: 
(a) 0.988 Ci 


(b) The half-life of 27°Ra is now better known. 


Exercise: 


Problem: 
Show that the activity of the ‘*C in 1.00 g of !*C found in living tissue 
is 0.250 Ba. 

Exercise: 
Problem: 
Mantles for gas lanterns contain thorium, because it forms an oxide 
that can survive being heated to incandescence for long periods of 
time. Natural thorium is almost 100% 78?Th, with a half-life of 


1.405 x 107° y. If an average lantern mantle contains 300 mg of 
thorium, what is its activity? 


Solution: 


1.22 x 10° Bq 
Exercise: 
Problem: 
Cow’s milk produced near nuclear reactors can be tested for as little as 


1.00 pCi of !3"I per liter, to check for possible reactor leakage. What 
mass of !°"J has this activity? 


Exercise: 
Problem: 
(a) Natural potassium contains *°K, which has a half-life of 
1.277 x 10° y. What mass of “°K in a person would have a decay rate 
of 4140 Bq? (b) What is the fraction of *°K in natural potassium, 


given that the person has 140 g in his body? (These numbers are 
typical for a 70-kg adult.) 


Solution: 


(a) 16.0 mg 


(b) 0.0114% 
Exercise: 
Problem: 
There is more than one isotope of natural uranium. If a researcher 


isolates 1.00 mg of the relatively scarce ?*°U and finds this mass to 
have an activity of 80.0 Ba, what is its half-life in years? 


Exercise: 
Problem: 
5°77 has one of the longest known radioactive half-lives. In a difficult 


experiment, a researcher found that the activity of 1.00 kg of *°V is 
1.75 Bq. What is the half-life in years? 


Solution: 


1.48 x 10" y 

Exercise: 
Problem: 
You can sometimes find deep red crystal vases in antique stores, called 
uranium glass because their color was produced by doping the glass 
with uranium. Look up the natural isotopes of uranium and their half- 


lives, and calculate the activity of such a vase assuming it has 2.00 g of 
uranium in it. Neglect the activity of any daughter nuclides. 


Exercise: 


Problem: 


A tree falls in a forest. How many years must pass before the ‘4C 
activity in 1.00 g of the tree’s carbon drops to 1.00 decay per hour? 


Solution: 


5.6 x 104 y 


Exercise: 
Problem: 
What fraction of the “°K that was on Earth when it formed 4.5 x 10° 
years ago is left today? 
Exercise: 
Problem: 
A 5000-Ci ©°Co source used for cancer therapy is considered too weak 


to be useful when its activity falls to 3500 Ci. How long after its 
manufacture does this happen? 


Solution: 


2.71 y 
Exercise: 
Problem: 
Natural uranium is 0.7200% 28°U and 99.27% 28U. What were the 


percentages of 7°°U and 7°°U in natural uranium when Earth formed 
4.5 x 10° years ago? 


Exercise: 


Problem: 
The 8~ particles emitted in the decay of ?H (tritium) interact with 
matter to create light in a glow-in-the-dark exit sign. At the time of 


manufacture, such a sign contains 15.0 Ci of 3H. (a) What is the mass 
of the tritium? (b) What is its activity 5.00 y after manufacture? 


Solution: 
(a) 1.56 mg 


(b) 11.3 Ci 


Exercise: 


Problem: 


World War II aircraft had instruments with glowing radium-painted 
dials (see [link]). The activity of one such instrument was 1.0 x 10° 
Bg when new. (a) What mass of 2?6Ra, was present? (b) After some 
years, the phosphors on the dials deteriorated chemically, but the 
radium did not escape. What is the activity of this instrument 57.0 
years after it was made? 


Exercise: 


Problem: 


(a) The 7!°Po source used in a physics laboratory is labeled as having 
an activity of 1.0 Ci on the date it was prepared. A student measures 
the radioactivity of this source with a Geiger counter and observes 
1500 counts per minute. She notices that the source was prepared 120 
days before her lab. What fraction of the decays is she observing with 
her apparatus? (b) Identify some of the reasons that only a fraction of 
the a@ s emitted are observed by the detector. 


Solution: 
(a) 1,23 x10" 


(b) Only part of the emitted radiation goes in the direction of the 
detector. Only a fraction of that causes a response in the detector. 
Some of the emitted radiation (mostly a particles) is observed within 
the source. Some is absorbed within the source, some is absorbed by 
the detector, and some does not penetrate the detector. 


Exercise: 


Problem: 


Armor-piercing shells with depleted uranium cores are fired by aircraft 
at tanks. (The high density of the uranium makes them effective.) The 
uranium is called depleted because it has had its 22°U removed for 
reactor use and is nearly pure 7°°U. Depleted uranium has been 
erroneously called non-radioactive. To demonstrate that this is wrong: 
(a) Calculate the activity of 60.0 g of pure 7°5U. (b) Calculate the 
activity of 60.0 g of natural uranium, neglecting the 7°4U and all 
daughter nuclides. 


Exercise: 
Problem: 
The ceramic glaze on a red-orange Fiestaware plate is U,O3 and 
contains 50.0 grams of 7°°U , but very little 72°U. (a) What is the 
activity of the plate? (b) Calculate the total energy that will be released 
by the 2°8U decay. (c) If energy is worth 12.0 cents per kW - h, what 


is the monetary value of the energy emitted? (These plates went out of 
production some 30 years ago, but are still available as collectibles.) 


Solution: 
(a) 1.68 x 10° Ci 
(b) 8.65 x 101° J 


(c) $2.9 x 10° 


Exercise: 


Problem: 


Large amounts of depleted uranium (78U) are available as a by- 
product of uranium processing for reactor fuel and weapons. Uranium 
is very dense and makes good counter weights for aircraft. Suppose 
you have a 4000-kg block of 2°8U. (a) Find its activity. (b) How many 
calories per day are generated by thermalization of the decay energy? 
(c) Do you think you could detect this as heat? Explain. 


Exercise: 


Problem: 


The Galileo space probe was launched on its long journey past several 
planets in 1989, with an ultimate goal of Jupiter. Its power source is 
11.0 kg of 7°8Pu, a by-product of nuclear weapons plutonium 
production. Electrical energy is generated thermoelectrically from the 
heat produced when the 5.59-MeV a particles emitted in each decay 
crash to a halt inside the plutonium and its shielding. The half-life of 
238Pu is 87.7 years. (a) What was the original activity of the 7°8Pu in 
becquerel? (b) What power was emitted in kilowatts? (c) What power 
was emitted 12.0 y after launch? You may neglect any extra energy 
from daughter nuclides and any losses from escaping y rays. 


Solution: 
(a) 6.97 x 107° Bq 
(b) 6.24 kW 
(c) 5.67 kW 
Exercise: 
Problem: Construct Your Own Problem 
Consider the generation of electricity by a radioactive isotope in a 


space probe, such as described in [link]. Construct a problem in which 
you calculate the mass of a radioactive isotope you need in order to 


supply power for a long space flight. Among the things to consider are 
the isotope chosen, its half-life and decay energy, the power needs of 
the probe and the length of the flight. 


Exercise: 


Problem: Unreasonable Results 


A nuclear physicist finds 1.0 pg of 7°°U in a piece of uranium ore and 
assumes it is primordial since its half-life is 2.3 x 10’ y. (a) Calculate 
the amount of ??°Uthat would had to have been on Earth when it 
formed 4.5 x 10° y ago for 1.0 pug to be left today. (b) What is 
unreasonable about this result? (c) What assumption is responsible? 


Exercise: 


Problem: Unreasonable Results 


(a) Repeat [link] but include the 0.0055% natural abundance of 2°4U 
with its 2.45 x 10° y half-life. (b) What is unreasonable about this 
result? (c) What assumption is responsible? (d) Where does the 7°4U 
come from if it is not primordial? 


Exercise: 
Problem: Unreasonable Results 


The manufacturer of a smoke alarm decides that the smallest current of 
a radiation he can detect is 1.00 yA. (a) Find the activity in curies of 
an a emitter that produces a 1.00 wA current of a particles. (b) What is 
unreasonable about this result? (c) What assumption is responsible? 


Solution: 
(a) 84.5 Ci 


(b) An extremely large activity, many orders of magnitude greater than 
permitted for home use. 


(c) The assumption of 1.00 pA is unreasonably large. Other methods 
can detect much smaller decay rates. 


Glossary 


becquerel 
SI unit for rate of decay of a radioactive material 


half-life 
the time in which there is a 50% chance that a nucleus will decay 


radioactive dating 
an application of radioactive decay in which the age of a material is 
determined by the amount of radioactivity of a particular type that 
occurs 


decay constant 
quantity that is inversely proportional to the half-life and that is used in 
equation for number of nuclei as a function of time 


carbon-14 dating 
a radioactive dating technique based on the radioactivity of carbon-14 


activity 
the rate of decay for radioactive nuclides 


rate of decay 
the number of radioactive events per unit time 


curie 
the activity of 1g of ??®Ra, equal to 3.70 x 10° Bq 


Introduction to Applications of Nuclear Physics 
class="introduction" 


e Provide examples of various nuclear physics applications. 


Tori Randall, 
Ph.D., curator 
for the 
Department of 
Physical 
Anthropology 
at the San 
Diego Museum 
of Man, 
prepares a 550- 
year-old 
Peruvian child 
mummy fora 
CT scan at 
Naval Medical 
Center San 
Diego. (credit: 
U.S. Navy 
photo by Mass 
Communicatio 
n Specialist 3rd 
Class Samantha 
A. Lewis) 


Applications of nuclear physics have become an integral part of modern 
life. From the bone scan that detects a cancer to the radioiodine treatment 
that cures another, nuclear radiation has diagnostic and therapeutic effects 
on medicine. From the fission power reactor to the hope of controlled 
fusion, nuclear energy is now commonplace and is a part of our plans for 
the future. Yet, the destructive potential of nuclear weapons haunts us, as 
does the possibility of nuclear reactor accidents. Certainly, several 
applications of nuclear physics escape our view, as seen in [link]. Not only 
has nuclear physics revealed secrets of nature, it has an inevitable impact 
based on its applications, as they are intertwined with human values. 
Because of its potential for alleviation of suffering, and its power as an 
ultimate destructor of life, nuclear physics is often viewed with 
ambivalence. But it provides perhaps the best example that applications can 
be good or evil, while knowledge itself is neither. 
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Customs officers inspect vehicles using 
neutron irradiation. Cars and trucks pass 
through portable x-ray machines that reveal 
their contents. (credit: Gerald L. Nino, CBP, 
U.S. Dept. of Homeland Security) 


This image shows two stowaways caught 
illegally entering the United States from 
Canada. (credit: U.S. Customs and Border 
Protection) 


Medical Imaging and Diagnostics 


e Explain the working principle behind an anger camera. 
e Describe the SPECT and PET imaging techniques. 


A host of medical imaging techniques employ nuclear radiation. What 
makes nuclear radiation so useful? First, y radiation can easily penetrate 
tissue; hence, it is a useful probe to monitor conditions inside the body. 
Second, nuclear radiation depends on the nuclide and not on the chemical 
compound it is in, so that a radioactive nuclide can be put into a compound 
designed for specific purposes. The compound is said to be tagged. A 
tagged compound used for medical purposes is called a 
radiopharmaceutical. Radiation detectors external to the body can 
determine the location and concentration of a radiopharmaceutical to yield 
medically useful information. For example, certain drugs are concentrated 
in inflamed regions of the body, and this information can aid diagnosis and 
treatment as seen in [link]. Another application utilizes a 
radiopharmaceutical which the body sends to bone cells, particularly those 
that are most active, to detect cancerous tumors or healing points. Images 
can then be produced of such bone scans. Radioisotopes are also used to 
determine the functioning of body organs, such as blood flow, heart muscle 
activity, and iodine uptake in the thyroid gland. 


A 
radiopharmaceutica 
l is used to produce 
this brain image of 

a patient with 


Alzheimer’s 
disease. Certain 
features are 
computer enhanced. 
(credit: National 
Institutes of Health) 


Medical Application 


[link] lists certain medical diagnostic uses of radiopharmaceuticals, 
including isotopes and activities that are typically administered. Many 
organs can be imaged with a variety of nuclear isotopes replacing a stable 
element by a radioactive isotope. One common diagnostic employs iodine 
to image the thyroid, since iodine is concentrated in that organ. The most 
active thyroid cells, including cancerous cells, concentrate the most iodine 
and, therefore, emit the most radiation. Conversely, hypothyroidism is 
indicated by lack of iodine uptake. Note that there is more than one isotope 
that can be used for several types of scans. Another common nuclear 
diagnostic is the thallium scan for the cardiovascular system, particularly 
used to evaluate blockages in the coronary arteries and examine heart 
activity. The salt TICl can be used, because it acts like NaCl and follows the 
blood. Gallium-67 accumulates where there is rapid cell growth, such as in 
tumors and sites of infection. Hence, it is useful in cancer imaging. Usually, 
the patient receives the injection one day and has a whole body scan 3 or 4 
days later because it can take several days for the gallium to build up. 
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Diagnostic Uses of Radiopharmaceuticals 


Note that [link] lists many diagnostic uses for 9°"Tc, where “m” stands for 
a metastable state of the technetium nucleus. Perhaps 80 percent of all 
radiopharmaceutical procedures employ 9°™Tc because of its many 
advantages. One is that the decay of its metastable state produces a single, 
easily identified 0.142-MeV + ray. Additionally, the radiation dose to the 
patient is limited by the short 6.0-h half-life of 99" Tc. And, although its 
half-life is short, it is easily and continuously produced on site. The basic 
process for production is neutron activation of molybdenum, which quickly 
B decays into °°™Tc. Technetium-99m can be attached to many compounds 
to allow the imaging of the skeleton, heart, lungs, kidneys, etc. 


[link] shows one of the simpler methods of imaging the concentration of 
nuclear activity, employing a device called an Anger camera or gamma 
camera. A piece of lead with holes bored through it collimates y rays 
emerging from the patient, allowing detectors to receive rays from 
specific directions only. The computer analysis of detector signals produces 
an image. One of the disadvantages of this detection method is that there is 
no depth information (i.e., it provides a two-dimensional view of the tumor 
as opposed to a three-dimensional view), because radiation from any 
location under that detector produces a signal. 


Anger camera 


Electronic 
output to 
computer 
for image 
construction 


Lead collimator Photomultiplier 


_ tubes 
Scintillator 


An Anger or gamma camera consists of a 
lead collimator and an array of detectors. 
Gamma rays produce light flashes in the 


scintillators. The light output is converted to 
an electrical signal by the photomultipliers. 
A computer constructs an image from the 
detector output. 


Imaging techniques much like those in x-ray computed tomography (CT) 
scans use nuclear activity in patients to form three-dimensional images. 
[link] shows a patient in a circular array of detectors that may be stationary 
or rotated, with detector output used by a computer to construct a detailed 
image. This technique is called single-photon-emission computed 
tomography(SPECT) or sometimes simply SPET. The spatial resolution of 
this technique is poor, about 1 cm, but the contrast (i.e. the difference in 
visual properties that makes an object distinguishable from other objects 
and the background) is good. 


SPECT uses a geometry 
similar to a CT scanner to 
form an image of the 
concentration of a 
radiopharmaceutical 
compound. (credit: 
Woldo, Wikimedia 
Commons) 


Images produced by 8* emitters have become important in recent years. 
When the emitted positron ( 8*) encounters an electron, mutual 
annihilation occurs, producing two y rays. These ¥ rays have identical 
0.511-MeV energies (the energy comes from the destruction of an electron 
or positron mass) and they move directly away from one another, allowing 
detectors to determine their point of origin accurately, as shown in [link]. 
The system is called positron emission tomography (PET). It requires 
detectors on opposite sides to simultaneously (i.e., at the same time) detect 
photons of 0.511-MeV energy and utilizes computer imaging techniques 
similar to those in SPECT and CT scans. Examples of 8* -emitting 
isotopes used in PET are NC, 8N, 150, and !8F, as seen in [link]. This list 
includes C, N, and O, and so they have the advantage of being able to 
function as tags for natural body compounds. Its resolution of 0.5 cm is 
better than that of SPECT; the accuracy and sensitivity of PET scans make 
them useful for examining the brain’s anatomy and function. The brain’s 
use of oxygen and water can be monitored with °O. PET is used 
extensively for diagnosing brain disorders. It can note decreased 
metabolism in certain regions prior to a confirmation of Alzheimer’s 
disease. PET can locate regions in the brain that become active when a 
person carries out specific activities, such as speaking, closing their eyes, 
and so on. 


e+e 


A PET system takes 


advantage of the two 
identical y-ray 
photons produced by 
positron-electron 
annihilation. These 
rays are emitted in 
opposite directions, so 
that the line along 
which each pair is 
emitted is determined. 
Various events 
detected by several 
pairs of detectors are 
then analyzed by the 
computer to form an 
accurate image. 


Note: 
PhET Explorations: Simplified MRI 

Is it a tumor? Magnetic Resonance Imaging (MRI) can tell. Your head is 
full of tiny radio transmitters (the nuclear spins of the hydrogen nuclei of 
your water molecules). In an MRI unit, these little radios can be made to 
broadcast their positions, giving a detailed picture of the inside of your 
head. 


Simplifie 
d MRI 


Section Summary 


e Radiopharmaceuticals are compounds that are used for medical 
imaging and therapeutics. 

e The process of attaching a radioactive substance is called tagging. 

e [link] lists certain diagnostic uses of radiopharmaceuticals including 
the isotope and activity typically used in diagnostics. 

e One common imaging device is the Anger camera, which consists of a 
lead collimator, radiation detectors, and an analysis computer. 

¢ Tomography performed with y-emitting radiopharmaceuticals is called 
SPECT and has the advantages of x-ray CT scans coupled with organ- 
and function-specific drugs. 

e PET is a similar technique that uses G+ emitters and detects the two 
annihilation y rays, which aid to localize the source. 


Conceptual Questions 


Exercise: 
Problem: 
In terms of radiation dose, what is the major difference between 
medical diagnostic uses of radiation and medical therapeutic uses? 
Exercise: 
Problem: 
One of the methods used to limit radiation dose to the patient in 


medical imaging is to employ isotopes with short half-lives. How 
would this limit the dose? 


Problems & Exercises 


Exercise: 


Problem: 


A neutron generator uses an @ source, such as radium, to bombard 
beryllium, inducing the reaction “He + 9Be > C+ n. Such 
neutron sources are called RaBe sources, or PuBe sources if they use 
plutonium to get the a s. Calculate the energy output of the reaction in 
MeV. 


Solution: 


5.701 MeV 

Exercise: 
Problem: 
Neutrons from a source (perhaps the one discussed in the preceding 
problem) bombard natural molybdenum, which is 24 percent °°Mo. 
What is the energy output of the reaction Mo + n > 9Mo+ y? 


The mass of °°Mo is given in Appendix A: Atomic Masses, and that of 
°°Mo is 98.907711 u. 


Exercise: 
Problem: 
The purpose of producing °?Mo (usually by neutron activation of 
natural molybdenum, as in the preceding problem) is to produce 
99mT'c, Using the rules, verify that the 8 decay of 9°Mo produces 


99mT'c, (Most 9°™Tc nuclei produced in this decay are left in a 
metastable excited state denoted 9°™Tc.) 


Solution: 


99 99 — 
49Mos7 =] 13 C56 +p = Ug 


Exercise: 


Problem: 


(a) Two annihilation y rays in a PET scan originate at the same point 
and travel to detectors on either side of the patient. If the point of 
origin is 9.00 cm closer to one of the detectors, what is the difference 
in arrival times of the photons? (This could be used to give position 
information, but the time difference is small enough to make it 
difficult.) 


(b) How accurately would you need to be able to measure arrival time 
differences to get a position resolution of 1.00 mm? 

Exercise: 
Problem: 


[link] indicates that 7.50 mCi of 9°™Tc is used in a brain scan. What is 
the mass of technetium? 


Solution: 


1.43 x 107° g 
Exercise: 


Problem: 


The activities of !8!I and !?°1 used in thyroid scans are given in [link] 
to be 50 and 70 Ci, respectively. Find and compare the masses of !°!I 
and !81 in such scans, given their respective half-lives are 8.04 d and 
13.2 h. The masses are so small that the radioiodine is usually mixed 
with stable iodine as a carrier to ensure normal chemistry and 
distribution in the body. 


Exercise: 


Problem: 


(a) Neutron activation of sodium, which is 100%*Na, produces 24Na, 
which is used in some heart scans, as seen in [link]. The equation for 
the reaction is *7Na + n — 74Na + 4. Find its energy output, given 
the mass of 24Na is 23.990962 u. 


(b) What mass of 24Na produces the needed 5.0-mCi activity, given its 
half-life is 15.0 h? 


Solution: 
(a) 6.958 MeV 


(b)5.7x 10° g 


Glossary 


Anger camera 
a common medical imaging device that uses a scintillator connected to 
a series of photomultipliers 


gamma camera 
another name for an Anger camera 


positron emission tomography (PET) 
tomography technique that uses 8* emitters and detects the two 
annihilation 7 rays, aiding in source localization 


radiopharmaceutical 
compound used for medical imaging 


single-photon-emission computed tomography (SPECT) 
tomography performed with y-emitting radiopharmaceuticals 


tagged 
process of attaching a radioactive substance to a chemical compound 


Biological Effects of Ionizing Radiation 


e Define various units of radiation. 
e Describe RBE. 


We hear many seemingly contradictory things about the biological effects of ionizing 
radiation. It can cause cancer, burns, and hair loss, yet it is used to treat and even cure 
cancer. How do we understand these effects? Once again, there is an underlying simplicity 
in nature, even in complicated biological organisms. All the effects of ionizing radiation 
on biological tissue can be understood by knowing that ionizing radiation affects 
molecules within cells, particularly DNA molecules. 


Let us take a brief look at molecules within cells and how cells operate. Cells have long, 
double-helical DNA molecules containing chemical codes called genetic codes that 
govern the function and processes undertaken by the cell. It is for unraveling the double- 
helical structure of DNA that James Watson, Francis Crick, and Maurice Wilkins received 
the Nobel Prize. Damage to DNA consists of breaks in chemical bonds or other changes in 
the structural features of the DNA chain, leading to changes in the genetic code. In human 
cells, we can have as many as a million individual instances of damage to DNA per cell 
per day. It is remarkable that DNA contains codes that check whether the DNA is 
damaged or can repair itself. It is like an auto check and repair mechanism. This repair 
ability of DNA is vital for maintaining the integrity of the genetic code and for the normal 
functioning of the entire organism. It should be constantly active and needs to respond 
rapidly. The rate of DNA repair depends on various factors such as the cell type and age 
of the cell. A cell with a damaged ability to repair DNA, which could have been induced 
by ionizing radiation, can do one of the following: 


e The cell can go into an irreversible state of dormancy, known as senescence. 
e The cell can commit suicide, known as programmed cell death. 
e The cell can go into unregulated cell division leading to tumors and cancers. 


Since ionizing radiation damages the DNA, which is critical in cell reproduction, it has its 
greatest effect on cells that rapidly reproduce, including most types of cancer. Thus, 
cancer cells are more sensitive to radiation than normal cells and can be killed by it easily. 
Cancer is characterized by a malfunction of cell reproduction, and can also be caused by 
ionizing radiation. Without contradiction, ionizing radiation can be both a cure anda 
cause. 


To discuss quantitatively the biological effects of ionizing radiation, we need a radiation 
dose unit that is directly related to those effects. All effects of radiation are assumed to be 
directly proportional to the amount of ionization produced in the biological organism. The 
amount of ionization is in turn proportional to the amount of deposited energy. Therefore, 
we define a radiation dose unit called the rad, as 1/100 of a joule of ionizing energy 
deposited per kilogram of tissue, which is 

Equation: 


lrad = 0.01 J/kg. 


For example, if a 50.0-kg person is exposed to ionizing radiation over her entire body and 
she absorbs 1.00 J, then her whole-body radiation dose is 
Equation: 


(1.00 J)/(50.0 kg) = 0.0200 J/kg = 2.00 rad. 


If the same 1.00 J of ionizing energy were absorbed in her 2.00-kg forearm alone, then the 
dose to the forearm would be 
Equation: 


(1.00 J) /(2.00 kg) = 0.500 J/kg = 50.0 rad, 


and the unaffected tissue would have a zero rad dose. While calculating radiation doses, 
you divide the energy absorbed by the mass of affected tissue. You must specify the 
affected region, such as the whole body or forearm in addition to giving the numerical 
dose in rads. The SI unit for radiation dose is the gray (Gy), which is defined to be 
Equation: 


1 Gy = 1 J/kg = 100 rad. 


However, the rad is still commonly used. Although the energy per kilogram in 1 rad is 
small, it has significant effects since the energy causes ionization. The energy needed for a 
single ionization is a few eV, or less than 10~!8 J. Thus, 0.01 J of ionizing energy can 
create a huge number of ion pairs and have an effect at the cellular level. 


The effects of ionizing radiation may be directly proportional to the dose in rads, but they 
also depend on the type of radiation and the type of tissue. That is, for a given dose in 
rads, the effects depend on whether the radiation is a, 8, y, x-ray, or some other type of 
ionizing radiation. In the earlier discussion of the range of ionizing radiation, it was noted 
that energy is deposited in a series of ionizations and not in a single interaction. Each ion 
pair or ionization requires a certain amount of energy, so that the number of ion pairs is 
directly proportional to the amount of the deposited ionizing energy. But, if the range of 
the radiation is small, as it is for a s, then the ionization and the damage created is more 
concentrated and harder for the organism to repair, as seen in [link]. Concentrated damage 
is more difficult for biological organisms to repair than damage that is spread out, so 
short-range particles have greater biological effects. The relative biological effectiveness 
(RBE) or quality factor (QF) is given in [link] for several types of ionizing radiation— 
the effect of the radiation is directly proportional to the RBE. A dose unit more closely 
related to effects in biological tissue is called the roentgen equivalent man or rem and is 
defined to be the dose in rads multiplied by the relative biological effectiveness. 


Equation: 


rem = rad x RBE 
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The image shows ionization created in cells by a and y 
radiation. Because of its shorter range, the ionization and 
damage created by a is more concentrated and harder for the 
organism to repair. Thus, the RBE for as is greater than the 
RBE for ys, even though they create the same amount of 
ionization at the same energy. 


So, if a person had a whole-body dose of 2.00 rad of y radiation, the dose in rem would be 
(2.00 rad)(1) = 2.00 rem whole body. If the person had a whole-body dose of 2.00 rad 
of a radiation, then the dose in rem would be (2.00 rad)(20) = 40.0 rem whole body. 
The a s would have 20 times the effect on the person than the y s for the same deposited 
energy. The SI equivalent of the rem is the sievert (Sv), defined to be Sv = Gy x RBE, 
so that 

Equation: 


1 Sv = 1 Gy x RBE = 100 rem. 


The RBEs given in [link] are approximate, but they yield certain insights. For example, 
the eyes are more sensitive to radiation, because the cells of the lens do not repair 
themselves. Neutrons cause more damage than ¥ rays, although both are neutral and have 
large ranges, because neutrons often cause secondary radiation when they are captured. 
Note that the RBEs are 1 for higher-energy { s, y s, and x-rays, three of the most common 
types of radiation. For those types of radiation, the numerical values of the dose in rem 
and rad are identical. For example, 1 rad of y radiation is also 1 rem. For that reason, rads 
are still widely quoted rather than rem. [link] summarizes the units that are used for 
radiation. 


Note: 


Misconception Alert: Activity vs. Dose 


“Activity” refers to the radioactive source while “dose” refers to the amount of energy 
from the radiation that is deposited in a person or object. 


A high level of activity doesn’t mean much if a person is far away from the source. The 
activity R of a source depends upon the quantity of material (kg) as well as the half-life. A 


short half-life will produce many more disintegrations per second. Recall that R = 


. Also, the activity decreases exponentially, which is seen in the equation R = Roe 


Type and energy of radiation 
X-rays 

y rays 

B rays greater than 32 keV 
Brays less than 32 keV 


Neutrons, thermal to slow (<20 
keV) 


Neutrons, fast (1-10 MeV) 
Protons (1-10 MeV) 

a rays from radioactive decay 
Heavy ions from accelerators 


Relative Biological Effectiveness 


RBE[footnote] 

Values approximate, difficult to 
determine. 

1 

1 

1 


1.7 


2-5 


10 (body), 32 (eyes) 
10 (body), 32 (eyes) 
10-20 


10-20 


0.693.N 
t1/2 
—At 


SI unit Former 
Quantity name Definition unit Conversion 
Activity oa decay/sec a 1 Bq = 2.7 x 10 "Gi 
etna man 1 J/kg rad Gy = 100 rad 
= aed a a ° a ov Een 


Units for Radiation 


The large-scale effects of radiation on humans can be divided into two categories: 
immediate effects and long-term effects. [link] gives the immediate effects of whole-body 
exposures received in less than one day. If the radiation exposure is spread out over more 
time, greater doses are needed to cause the effects listed. This is due to the body’s ability 
to partially repair the damage. Any dose less than 100 mSv (10 rem) is called a low dose, 
0.1 Sv to 1 Sv (10 to 100 rem) is called a moderate dose, and anything greater than 1 Sv 
(100 rem) is called a high dose. There is no known way to determine after the fact if a 
person has been exposed to less than 10 mSv. 


Dose in Sv [footnote] 
Multiply by 100 to 


obtain dose in rem. Effect 


0-0.10 No observable effect. 

0.1-1 Slight to moderate decrease in white blood cell counts. 

0.5 Temporary sterility; 0.35 for women, 0.50 for men. 

12 Significant reduction in blood cell counts, brief nausea 
and vomiting. Rarely fatal. 

95 Nausea, vomiting, hair loss, severe blood damage, 


hemorrhage, fatalities. 


Dose in Sv [footnote] 


Multiply by 100 to 
obtain dose in rem. Effect 
AS LD50/32. Lethal to 50% of the population within 32 
; days after exposure if not treated. 
Worst effects due to malfunction of small intestine and 
5 — 20 bee ; 
blood systems. Limited survival. 
$90 Fatal within hours due to collapse of central nervous 


system. 


Immediate Effects of Radiation (Adults, Whole Body, Single Exposure) 


Immediate effects are explained by the effects of radiation on cells and the sensitivity of 
rapidly reproducing cells to radiation. The first clue that a person has been exposed to 
radiation is a change in blood count, which is not surprising since blood cells are the most 
rapidly reproducing cells in the body. At higher doses, nausea and hair loss are observed, 
which may be due to interference with cell reproduction. Cells in the lining of the 
digestive system also rapidly reproduce, and their destruction causes nausea. When the 
growth of hair cells slows, the hair follicles become thin and break off. High doses cause 
significant cell death in all systems, but the lowest doses that cause fatalities do so by 
weakening the immune system through the loss of white blood cells. 


The two known long-term effects of radiation are cancer and genetic defects. Both are 
directly attributable to the interference of radiation with cell reproduction. For high doses 
of radiation, the risk of cancer is reasonably well known from studies of exposed groups. 
Hiroshima and Nagasaki survivors and a smaller number of people exposed by their 
occupation, such as radium dial painters, have been fully documented. Chernobyl victims 
will be studied for many decades, with some data already available. For example, a 
significant increase in childhood thyroid cancer has been observed. The risk of a 
radiation-induced cancer for low and moderate doses is generally assumed to be 
proportional to the risk known for high doses. Under this assumption, any dose of 
radiation, no matter how small, involves a risk to human health. This is called the linear 
hypothesis and it may be prudent, but it is controversial. There is some evidence that, 
unlike the immediate effects of radiation, the long-term effects are cumulative and there is 
little self-repair. This is analogous to the risk of skin cancer from UV exposure, which is 
known to be cumulative. 


There is a latency period for the onset of radiation-induced cancer of about 2 years for 
leukemia and 15 years for most other forms. The person is at risk for at least 30 years after 
the latency period. Omitting many details, the overall risk of a radiation-induced cancer 


death per year per rem of exposure is about 10 in a million, which can be written as 
10/10° rem - y. 


If a person receives a dose of 1 rem, his risk each year of dying from radiation-induced 
cancer is 10 in a million and that risk continues for about 30 years. The lifetime risk is 
thus 300 in a million, or 0.03 percent. Since about 20 percent of all worldwide deaths are 
from cancer, the increase due to a 1 rem exposure is impossible to detect demographically. 
But 100 rem (1 Sv), which was the dose received by the average Hiroshima and Nagasaki 
survivor, Causes a 3 percent risk, which can be observed in the presence of a 20 percent 
normal or natural incidence rate. 


The incidence of genetic defects induced by radiation is about one-third that of cancer 
deaths, but is much more poorly known. The lifetime risk of a genetic defect due to a 1 
rem exposure is about 100 in a million or 3.3/10° rem - y, but the normal incidence is 
60,000 in a million. Evidence of such a small increase, tragic as it is, is nearly impossible 
to obtain. For example, there is no evidence of increased genetic defects among the 
offspring of Hiroshima and Nagasaki survivors. Animal studies do not seem to correlate 
well with effects on humans and are not very helpful. For both cancer and genetic defects, 
the approach to safety has been to use the linear hypothesis, which is likely to be an 
overestimate of the risks of low doses. Certain researchers even claim that low doses are 
beneficial. Hormesis is a term used to describe generally favorable biological responses 
to low exposures of toxins or radiation. Such low levels may help certain repair 
mechanisms to develop or enable cells to adapt to the effects of the low exposures. 
Positive effects may occur at low doses that could be a problem at high doses. 


Even the linear hypothesis estimates of the risks are relatively small, and the average 
person is not exposed to large amounts of radiation. [link] lists average annual background 
radiation doses from natural and artificial sources for Australia, the United States, 
Germany, and world-wide averages. Cosmic rays are partially shielded by the atmosphere, 
and the dose depends upon altitude and latitude, but the average is about 0.40 mSv/y. A 
good example of the variation of cosmic radiation dose with altitude comes from the 
airline industry. Monitored personnel show an average of 2 mSv/y. A 12-hour flight might 
give you an exposure of 0.02 to 0.03 mSv. 


Doses from the Earth itself are mainly due to the isotopes of uranium, thorium, and 
potassium, and vary greatly by location. Some places have great natural concentrations of 
uranium and thorium, yielding doses ten times as high as the average value. Internal doses 
come from foods and liquids that we ingest. Fertilizers containing phosphates have 
potassium and uranium. So we are all a little radioactive. Carbon-14 has about 66 Bq/kg 
radioactivity whereas fertilizers may have more than 3000 Bq/kg radioactivity. Medical 
and dental diagnostic exposures are mostly from x-rays. It should be noted that x-ray 
doses tend to be localized and are becoming much smaller with improved techniques. 
[link] shows typical doses received during various diagnostic x-ray examinations. Note 
the large dose from a CT scan. While CT scans only account for less than 20 percent of 


the x-ray procedures done today, they account for about 50 percent of the annual dose 
received. 


Radon is usually more pronounced underground and in buildings with low air exchange 
with the outside world. Almost all soil contains some ??6Ra and 222Rn, but radon is lower 
in mainly sedimentary soils and higher in granite soils. Thus, the exposure to the public 
can vary greatly, even within short distances. Radon can diffuse from the soil into homes, 
especially basements. The estimated exposure for 2”*Rn is controversial. Recent studies 
indicate there is more radon in homes than had been realized, and it is speculated that 
radon may be responsible for 20 percent of lung cancers, being particularly hazardous to 
those who also smoke. Many countries have introduced limits on allowable radon 
concentrations in indoor air, often requiring the measurement of radon concentrations in a 
house prior to its sale. Ironically, it could be argued that the higher levels of radon 
exposure and their geographic variability, taken with the lack of demographic evidence of 
any effects, means that low-level radiation is less dangerous than previously thought. 


Radiation Protection 


Laws regulate radiation doses to which people can be exposed. The greatest occupational 
whole-body dose that is allowed depends upon the country and is about 20 to 50 mSv/y 
and is rarely reached by medical and nuclear power workers. Higher doses are allowed for 
the hands. Much lower doses are permitted for the reproductive organs and the fetuses of 
pregnant women. Inadvertent doses to the public are limited to 1/10 of occupational 
doses, except for those caused by nuclear power, which cannot legally expose the public 
to more than 1/1000 of the occupational limit or 0.05 mSv/y (5 mrem/y). This has been 
exceeded in the United States only at the time of the Three Mile Island (TMI) accident in 
1979. Chernoby] is another story. Extensive monitoring with a variety of radiation 
detectors is performed to assure radiation safety. Increased ventilation in uranium mines 
has lowered the dose there to about 1 mSv/y. 


Dose (mSv/y)|footnote] 


Source Multiply by 100 to obtain dose in mrem/y. 
Source Australia Germany Umee World 
States 


Natural Radiation - 
external 


Dose (mSv/y)[footnote] 


Source Multiply by 100 to obtain dose in mrem/y. 
Cosmic Rays 0.30 0.28 0.30 0.39 
Soil, building materials 0.40 0.40 0.30 0.48 
Radon gas 0.90 1.1 2.0 1.2 
Natural Radiation - 
internal 

HK, 2 Ra 0.24 0.28 0.40 0.29 
Medical & Dental 0.80 0.90 0.53 0.40 
TOTAL 2.6 3.0 25 2.8 


Background Radiation Sources and Average Doses 


To physically limit radiation doses, we use shielding, increase the distance from a source, 
and limit the time of exposure. 


[link] illustrates how these are used to protect both the patient and the dental technician 
when an x-ray is taken. Shielding absorbs radiation and can be provided by any material, 
including sufficient air. The greater the distance from the source, the more the radiation 
spreads out. The less time a person is exposed to a given source, the smaller is the dose 
received by the person. Doses from most medical diagnostics have decreased in recent 
years due to faster films that require less exposure time. 


A lead apron is placed 
over the dental patient 
and shielding surrounds 
the x-ray tube to limit 
exposure to tissue other 
than the tissue that is 
being imaged. Fast films 
limit the time needed to 
obtain images, reducing 
exposure to the imaged 
tissue. The technician 
stands a few meters away 
behind a lead-lined door 
with a lead glass window, 
reducing her occupational 
exposure. 


Procedure 
Chest 

Dental 

Skull 

Leg 
Mammogram 
Barium enema 
Upper GI 

CT head 


CT abdomen 


Effective dose (mSv) 


0.40 


10.0 


Typical Doses Received During Diagnostic X-ray Exams 


Problem-Solving Strategy 
You need to follow certain steps for dose calculations, which are 
Step 1. Examine the situation to determine that a person is exposed to ionizing radiation. 


Step 2. Identify exactly what needs to be determined in the problem (identify the 
unknowns). The most straightforward problems ask for a dose calculation. 


Step 3. Make a list of what is given or can be inferred from the problem as stated (identify 
the knowns). Look for information on the type of radiation, the energy per event, the 
activity, and the mass of tissue affected. 


Step 4. For dose calculations, you need to determine the energy deposited. This may take 
one or more steps, depending on the given information. 


Step 5. Divide the deposited energy by the mass of the affected tissue. Use units of joules 
for energy and kilograms for mass. If a dose in Sv is involved, use the definition that 
1Sv =1 J/kg. 


Step 6. If a dose in mSv is involved, determine the RBE (QF) of the radiation. Recall that 
1 mSv = 1 mGy x RBE (or 1 rem = 1 rad x RBE). 


Step 7. Check the answer to see if it is reasonable: Does it make sense? The dose should 
be consistent with the numbers given in the text for diagnostic, occupational, and 
therapeutic exposures. 


Example: 

Dose from Inhaled Plutonium 

Calculate the dose in rem/y for the lungs of a weapons plant employee who inhales and 
retains an activity of 1.00 Ci of 7°°Pu in an accident. The mass of affected lung tissue 
is 2.00 kg, the plutonium decays by emission of a 5.23-MeV a particle, and you may 
assume the higher value of the RBE for a s from [link]. 

Strategy 

Dose in rem is defined by 1 rad = 0.01 J/kg and rem = rad x RBE. The energy 
deposited is divided by the mass of tissue affected and then multiplied by the RBE. The 
latter two quantities are given, and so the main task in this example will be to find the 
energy deposited in one year. Since the activity of the source is given, we can calculate 
the number of decays, multiply by the energy per decay, and convert MeV to joules to get 
the total energy. 

Solution 


The activity R = 1.00 pCi = 3.70 x 104 Bq = 3.70 x 10* decays/s. So, the number of 
decays per year is obtained by multiplying by the number of seconds in a year: 
Equation: 


(3.70 x 104 decays/s) (3.16 x 10’ s) = 1.17 x 10" decays. 


Thus, the ionizing energy deposited per year is 


Equation: 
ip 1.60 x 10° 8 J 
E = (1.17 x 10 decays) (5.23 MeV/decay) x career (PT ae Nae 0.978 J. 
Dividing by the mass of the affected tissue gives 
Equation: 
E 978 J 
= gl = 0.489 J/kg. 

mass 2.00 kg 
One Gray is 1.00 J/kg, and so the dose in Gy is 
Equation: 

0.489 J/k 
fiesenn ey = [ke _ 9.489 Gy. 


1.00 (J/kg) /Gy 


Now, the dose in Sv is 


Equation: 
dose in Sv = Gy x RBE 
Equation: 
= (0.489 Gy) (20) = 9.8 Sv. 
Discussion 


First note that the dose is given to two digits, because the RBE is (at best) known only to 
two digits. By any standard, this yearly radiation dose is high and will have a devastating 
effect on the health of the worker. Worse yet, plutonium has a long radioactive half-life 
and is not readily eliminated by the body, and so it will remain in the lungs. Being an a 
emitter makes the effects 10 to 20 times worse than the same ionization produced by £ s, 
7 rays, or x-rays. An activity of 1.00 pCi is created by only 16 pg of 7°°Pu (left as an 
end-of-chapter problem to verify), partly justifying claims that plutonium is the most 
toxic substance known. Its actual hazard depends on how likely it is to be spread out 
among a large population and then ingested. The Chernoby] disaster’s deadly legacy, for 
example, has nothing to do with the plutonium it put into the environment. 


Risk versus Benefit 


Medical doses of radiation are also limited. Diagnostic doses are generally low and have 


further lowered with improved techniques and faster films. With the possible exception of 


routine dental x-rays, radiation is used diagnostically only when needed so that the low 


risk is justified by the benefit of the diagnosis. Chest x-rays give the lowest doses—about 


0.1 mSv to the tissue affected, with less than 5 percent scattering into tissues that are not 


directly imaged. Other x-ray procedures range upward to about 10 mSv ina CT scan, and 


about 5 mSv (0.5 rem) per dental x-ray, again both only affecting the tissue imaged. 
Medical images with radiopharmaceuticals give doses ranging from 1 to 5 mSv, usually 


localized. One exception is the thyroid scan using ‘*4I. Because of its relatively long half- 


life, it exposes the thyroid to about 0.75 Sv. The isotope !”°I is more difficult to produce, 
but its short half-life limits thyroid exposure to about 15 mSv. 


Note: 

PhET Explorations: Alpha Decay 
Watch alpha particles escape from a polonium nucleus, causing radioactive alpha decay. 
See how random decay times relate to the half life. 


Alpha 


Deca 


ME 


Section Summary 


The biological effects of ionizing radiation are due to two effects it has on cells: 
interference with cell reproduction, and destruction of cell function. 

A radiation dose unit called the rad is defined in terms of the ionizing energy 
deposited per kilogram of tissue: 

Equation: 


1 rad = 0.01 J/kg. 


The SI unit for radiation dose is the gray (Gy), which is defined to be 

1 Gy = 1 J/kg = 100 rad. 

To account for the effect of the type of particle creating the ionization, we use the 
relative biological effectiveness (RBE) or quality factor (QF) given in [link] and 
define a unit called the roentgen equivalent man (rem) as 


Equation: 
rem = rad x RBE. 


e Particles that have short ranges or create large ionization densities have RBEs greater 
than unity. The SI equivalent of the rem is the sievert (Sv), defined to be 
Equation: 


Sv = Gy x RBE and 1 Sv = 100 rem. 


¢ Whole-body, single-exposure doses of 0.1 Sv or less are low doses while those of 0.1 
to 1 Sv are moderate, and those over 1 Sv are high doses. Some immediate radiation 
effects are given in [link]. Effects due to low doses are not observed, but their risk is 
assumed to be directly proportional to those of high doses, an assumption known as 
the linear hypothesis. Long-term effects are cancer deaths at the rate of 
10/10° rem-yand genetic defects at roughly one-third this rate. Background 
radiation doses and sources are given in [link]. World-wide average radiation 
exposure from natural sources, including radon, is about 3 mSv, or 300 mrem. 
Radiation protection utilizes shielding, distance, and time to limit exposure. 


Conceptual Questions 


Exercise: 
Problem: 
Isotopes that emit a radiation are relatively safe outside the body and exceptionally 


hazardous inside. Yet those that emit y radiation are hazardous outside and inside. 
Explain why. 


Exercise: 
Problem: 
Why is radon more closely associated with inducing lung cancer than other types of 
cancer? 

Exercise: 
Problem: 
The RBE for low-energy (s is 1.7, whereas that for higher-energy (s is only 1. 
Explain why, considering how the range of radiation depends on its energy. 


Exercise: 


Problem: 


Which methods of radiation protection were used in the device shown in the first 
photo in [link]? Which were used in the situation shown in the second photo? 


(a) This x-ray 
fluorescence machine is 
one of the thousands used 
in shoe stores to produce 
images of feet as a check 
on the fit of shoes. They 
are unshielded and 
remain on as long as the 
feet are in them, 
producing doses much 
greater than medical 
images. Children were 
fascinated with them. 
These machines were 
used in shoe stores until 
laws preventing such 
unwarranted radiation 
exposure were enacted in 
the 1950s. (credit: 
Andrew Kuchling ) (b) 
Now that we know the 
effects of exposure to 


radioactive material, 
safety is a priority. 
(credit: U.S. Navy) 


Exercise: 
Problem: 
What radioisotope could be a problem in homes built of cinder blocks made from 


uranium mine tailings? (This is true of homes and schools in certain regions near 
uranium mines.) 


Exercise: 


Problem: 


Are some types of cancer more sensitive to radiation than others? If so, what makes 
them more sensitive? 


Exercise: 


Problem: 
Suppose a person swallows some radioactive material by accident. What information 
is needed to be able to assess possible damage? 

Problems & Exercises 


Exercise: 


Problem: 


What is the dose in mSv for: (a) a 0.1 Gy x-ray? (b) 2.5 mGy of neutron exposure to 
the eye? (c) 1.5 mGy of a exposure? 


Solution: 
(a) 100 mSv 
(b) 80 mSv 


(c) ~30 mSv 


Exercise: 


Problem: 


Find the radiation dose in Gy for: (a) A 10-mSv fluoroscopic x-ray series. (b) 50 mSv 
of skin exposure by an a emitter. (c) 160 mSv of B and ¥ rays from the #°K in your 
body. 


Exercise: 
Problem: 


How many Gy of exposure is needed to give a cancerous tumor a dose of 40 Sv if it 
is exposed to a@ activity? 


Solution: 


a2GY 
Exercise: 
Problem: 
What is the dose in Sv in a cancer treatment that exposes the patient to 200 Gy of + 
rays? 
Exercise: 
Problem: 
One half the y rays from °°" Tc are absorbed by a 0.170-mm-thick lead shielding. 


Half of the + rays that pass through the first layer of lead are absorbed in a second 


layer of equal thickness. What thickness of lead will absorb all but one in 1000 of 
these -y rays? 


Solution: 


1.69 mm 
Exercise: 
Problem: 
A plumber at a nuclear power plant receives a whole-body dose of 30 mSv in 15 
minutes while repairing a crucial valve. Find the radiation-induced yearly risk of 


death from cancer and the chance of genetic defect from this maximum allowable 
exposure. 


Exercise: 


Problem: 


In the 1980s, the term picowave was used to describe food irradiation in order to 
overcome public resistance by playing on the well-known safety of microwave 
radiation. Find the energy in MeV of a photon having a wavelength of a picometer. 


Solution: 
1.24 MeV 


Exercise: 


Problem: Find the mass of 7°°Pu that has an activity of 1.00 pCi. 


Glossary 


gray (Gy) 
the SI unit for radiation dose which is defined to be 1 Gy = 1 J/kg = 100 rad 


linear hypothesis 
assumption that risk is directly proportional to risk from high doses 


rad 
the ionizing energy deposited per kilogram of tissue 


sievert 
the SI equivalent of the rem 


relative biological effectiveness (RBE) 
a number that expresses the relative amount of damage that a fixed amount of 
ionizing radiation of a given type can inflict on biological tissues 


quality factor 
same as relative biological effectiveness 


roentgen equivalent man (rem) 
a dose unit more closely related to effects in biological tissue 


low dose 
a dose less than 100 mSv (10 rem) 


moderate dose 
a dose from 0.1 Sv to 1 Sv (10 to 100 rem) 


high dose 
a dose greater than 1 Sv (100 rem) 


hormesis 
a term used to describe generally favorable biological responses to low exposures of 
toxins or radiation 


shielding 
a technique to limit radiation exposure 


Therapeutic Uses of Ionizing Radiation 


e Explain the concept of radiotherapy and list typical doses for cancer 
therapy. 


Therapeutic applications of ionizing radiation, called radiation therapy or 
radiotherapy, have existed since the discovery of x-rays and nuclear 
radioactivity. Today, radiotherapy is used almost exclusively for cancer 
therapy, where it saves thousands of lives and improves the quality of life 
and longevity of many it cannot save. Radiotherapy may be used alone or in 
combination with surgery and chemotherapy (drug treatment) depending on 
the type of cancer and the response of the patient. A careful examination of 
all available data has established that radiotherapy’s beneficial effects far 
outweigh its long-term risks. 


Medical Application 


The earliest uses of ionizing radiation on humans were mostly harmful, 
with many at the level of snake oil as seen in [link]. Radium-doped 
cosmetics that glowed in the dark were used around the time of World War 
I. As recently as the 1950s, radon mine tours were promoted as healthful 
and rejuvenating—those who toured were exposed but gained no benefits. 
Radium salts were sold as health elixirs for many years. The gruesome 
death of a wealthy industrialist, who became psychologically addicted to 
the brew, alerted the unsuspecting to the dangers of radium salt elixirs. 
Most abuses finally ended after the legislation in the 1950s. 
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is radium, combined in exactly the proper manner with 
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Shines in the Dark 


The properties of radiation 
were once touted for far 
more than its modern use in 
cancer therapy. Until 1932, 
radium was advertised for a 
variety of uses, often with 
tragic results. (credit: 
Struthious Bandersnatch.) 


Radiotherapy is effective against cancer because cancer cells reproduce 
rapidly and, consequently, are more sensitive to radiation. The central 
problem in radiotherapy is to make the dose for cancer cells as high as 
possible while limiting the dose for normal cells. The ratio of abnormal 
cells killed to normal cells killed is called the therapeutic ratio, and all 
radiotherapy techniques are designed to enhance this ratio. Radiation can be 
concentrated in cancerous tissue by a number of techniques. One of the 
most prevalent techniques for well-defined tumors is a geometric technique 


shown in [link]. A narrow beam of radiation is passed through the patient 
from a variety of directions with a common crossing point in the tumor. 
This concentrates the dose in the tumor while spreading it out over a large 
volume of normal tissue. The external radiation can be x-rays, °°Co ¥ rays, 
or ionizing-particle beams produced by accelerators. Accelerator-produced 
beams of neutrons, m-mesons, and heavy ions such as nitrogen nuclei have 
been employed, and these can be quite effective. These particles have larger 
QFs or RBEs and sometimes can be better localized, producing a greater 
therapeutic ratio. But accelerator radiotherapy is much more expensive and 
less frequently employed than other forms. 


The ©°Co source of y-radiation 
is rotated around the patient so 
that the common crossing point 
is in the tumor, concentrating 
the dose there. This geometric 
technique works for well- 
defined tumors. 


Another form of radiotherapy uses chemically inert radioactive implants. 
One use is for prostate cancer. Radioactive seeds (about 40 to 100 and the 
size of a grain of rice) are placed in the prostate region. The isotopes used 


are usually '8°I (6-month half life) or !°8Pd (3-month half life). Alpha 
emitters have the dual advantages of a large QF and a small range for better 
localization. 


Radiopharmaceuticals are used for cancer therapy when they can be 
localized well enough to produce a favorable therapeutic ratio. Thyroid 
cancer is commonly treated utilizing radioactive iodine. Thyroid cells 
concentrate iodine, and cancerous thyroid cells are more aggressive in 
doing this. An ingenious use of radiopharmaceuticals in cancer therapy tags 
antibodies with radioisotopes. Antibodies produced by a patient to combat 
his cancer are extracted, cultured, loaded with a radioisotope, and then 
returned to the patient. The antibodies are concentrated almost entirely in 
the tissue they developed to fight, thus localizing the radiation in abnormal 
tissue. The therapeutic ratio can be quite high for short-range radiation. 
There is, however, a significant dose for organs that eliminate 
radiopharmaceuticals from the body, such as the liver, kidneys, and bladder. 
As with most radiotherapy, the technique is limited by the tolerable amount 
of damage to the normal tissue. 


[link] lists typical therapeutic doses of radiation used against certain 
cancers. The doses are large, but not fatal because they are localized and 
spread out in time. Protocols for treatment vary with the type of cancer and 
the condition and response of the patient. Three to five 200-rem treatments 
per week for a period of several weeks is typical. Time between treatments 
allows the body to repair normal tissue. This effect occurs because damage 
is concentrated in the abnormal tissue, and the abnormal tissue is more 
sensitive to radiation. Damage to normal tissue limits the doses. You will 
note that the greatest doses are given to any tissue that is not rapidly 
reproducing, such as in the adult brain. Lung cancer, on the other end of the 
scale, cannot ordinarily be cured with radiation because of the sensitivity of 
lung tissue and blood to radiation. But radiotherapy for lung cancer does 
alleviate symptoms and prolong life and is therefore justified in some cases. 


Type of Cancer Typical dose (Sv) 
Lung 10-20 
Hodgkin’s disease 40-45 
Skin 40—50 
Ovarian 50-75 
Breast 50—80+ 
Brain 80+ 
Neck 80+ 
Bone 80+ 
Soft tissue 80+ 
Thyroid 80+ 


Cancer Radiotherapy 


Finally, it is interesting to note that chemotherapy employs drugs that 
interfere with cell division and is, thus, also effective against cancer. It also 
has almost the same side effects, such as nausea and hair loss, and risks, 
such as the inducement of another cancer. 


Section Summary 


e Radiotherapy is the use of ionizing radiation to treat ailments, now 
limited to cancer therapy. 

e The sensitivity of cancer cells to radiation enhances the ratio of cancer 
cells killed to normal cells killed, which is called the therapeutic ratio. 


¢ Doses for various organs are limited by the tolerance of normal tissue 
for radiation. Treatment is localized in one region of the body and 
spread out in time. 


Conceptual Questions 


Exercise: 
Problem: 
Radiotherapy is more likely to be used to treat cancer in elderly 


patients than in young ones. Explain why. Why is radiotherapy used to 
treat young people at all? 


Problems & Exercises 


Exercise: 


Problem: 


A beam of 168-MeV nitrogen nuclei is used for cancer therapy. If this 
beam is directed onto a 0.200-kg tumor and gives it a 2.00-Sv dose, 
how many nitrogen nuclei were stopped? (Use an RBE of 20 for heavy 
ions.) 


Solution: 


7.44 x 108 
Exercise: 


Problem: 


(a) If the average molecular mass of compounds in food is 50.0 g, how 
many molecules are there in 1.00 kg of food? (b) How many ion pairs 
are created in 1.00 kg of food, if it is exposed to 1000 Sv and it takes 
32.0 eV to create an ion pair? (c) Find the ratio of ion pairs to 
molecules. (d) If these ion pairs recombine into a distribution of 2000 
new compounds, how many parts per billion is each? 


Exercise: 
Problem: 
Calculate the dose in Sv to the chest of a patient given an x-ray under 
the following conditions. The x-ray beam intensity is 1.50 W/ m’, the 


area of the chest exposed is 0.0750 m2, 35.0% of the x-rays are 
absorbed in 20.0 kg of tissue, and the exposure time is 0.250 s. 


Solution: 


4.92 x 10-4 Sv 

Exercise: 
Problem: 
(a) A cancer patient is exposed to ¥ rays from a 5000-Ci °°Co 
transillumination unit for 32.0 s. The y rays are collimated in such a 
manner that only 1.00% of them strike the patient. Of those, 20.0% are 
absorbed in a tumor having a mass of 1.50 kg. What is the dose in rem 
to the tumor, if the average yy energy per decay is 1.25 MeV? None of 


the 6s from the decay reach the patient. (b) Is the dose consistent with 
stated therapeutic doses? 


Exercise: 


Problem: 


What is the mass of ®©°Co in a cancer therapy transillumination unit 
containing 5.00 kCi of ®°Co? 


Solution: 


4.43 g 


Exercise: 


Problem: 


Large amounts of ®°Zn are produced in copper exposed to accelerator 
beams. While machining contaminated copper, a physicist ingests 

50.0 Ci of Zn. Each ©°Zn decay emits an average y-ray energy of 
0.550 MeV, 40.0% of which is absorbed in the scientist’s 75.0-kg body. 
What dose in mSv is caused by this in one day? 


Exercise: 


Problem: 


Naturally occurring *°K is listed as responsible for 16 mrem/y of 
background radiation. Calculate the mass of *°K that must be inside 
the 55-kg body of a woman to produce this dose. Each *°K decay 
emits a 1.32-MeV £, and 50% of the energy is absorbed inside the 
body. 


Solution: 


0.010 g 
Exercise: 


Problem: 


(a) Background radiation due to 2*°Ra averages only 0.01 mSv/y, but 
it can range upward depending on where a person lives. Find the mass 
of 2?6Ra in the 80.0-kg body of a man who receives a dose of 2.50- 
mSv/y from it, noting that each ?2°Ra decay emits a 4.80-MeV a 
particle. You may neglect dose due to daughters and assume a constant 
amount, evenly distributed due to balanced ingestion and bodily 
elimination. (b) Is it surprising that such a small mass could cause a 
measurable radiation dose? Explain. 


Exercise: 


Problem: 


The annual radiation dose from '4C in our bodies is 0.01 mSv/y. Each 
M4C decay emits a 6 averaging 0.0750 MeV. Taking the fraction of 
'4C to be 1.3 x 10°! N of normal !2C, and assuming the body is 13% 
carbon, estimate the fraction of the decay energy absorbed. (The rest 
escapes, exposing those close to you.) 


Solution: 


95% 
Exercise: 


Problem: 


If everyone in Australia received an extra 0.05 mSv per year of 
radiation, what would be the increase in the number of cancer deaths 
per year? (Assume that time had elapsed for the effects to become 
apparent.) Assume that there are 200 x 10°“ deaths per Sv of 
radiation per year. What percent of the actual number of cancer deaths 
recorded is this? 


Glossary 


radiotherapy 
the use of ionizing radiation to treat ailments 


therapeutic ratio 
the ratio of abnormal cells killed to normal cells killed 


Food Irradiation 
e Define food irradiation low dose, and free radicals. 


Ionizing radiation is widely used to sterilize medical supplies, such as 
bandages, and consumer products, such as tampons. Worldwide, it is also 
used to irradiate food, an application that promises to grow in the future. 
Food irradiation is the treatment of food with ionizing radiation. It is used 
to reduce pest infestation and to delay spoilage and prevent illness caused 
by microorganisms. Food irradiation is controversial. Proponents see it as 
superior to pasteurization, preservatives, and insecticides, supplanting 
dangerous chemicals with a more effective process. Opponents see its 
safety as unproven, perhaps leaving worse toxic residues as well as 
presenting an environmental hazard at treatment sites. In developing 
countries, food irradiation might increase crop production by 25.0% or 
more, and reduce food spoilage by a similar amount. It is used chiefly to 
treat spices and some fruits, and in some countries, red meat, poultry, and 
vegetables. Over 40 countries have approved food irradiation at some level. 


Food irradiation exposes food to large doses of -y rays, x-rays, or electrons. 
These photons and electrons induce no nuclear reactions and thus create no 
residual radioactivity. (Some forms of ionizing radiation, such as neutron 
irradiation, cause residual radioactivity. These are not used for food 
irradiation.) The yy source is usually ®°Co or °’Cs, the latter isotope being a 
major by-product of nuclear power. Cobalt-60 + rays average 1.25 MeV, 
while those of !°’Cs are 0.67 MeV and are less penetrating. X-rays used for 
food irradiation are created with voltages of up to 5 million volts and, thus, 
have photon energies up to 5 MeV. Electrons used for food irradiation are 
accelerated to energies up to 10 MeV. The higher the energy per particle, 
the more penetrating the radiation is and the more ionization it can create. 
[link] shows a typical y-irradiation plant. 


Irradiation room 


“Conveyor system 


Control console 


Radiation source rack 


A food irradiation plant has a 
conveyor system to pass items 
through an intense radiation field 
behind thick shielding walls. The y 
source is lowered into a deep pool of 
water for safe storage when not in use. 
Exposure times of up to an hour 
expose food to doses up to 104 Gy. 


Owing to the fact that food irradiation seeks to destroy organisms such as 
insects and bacteria, much larger doses than those fatal to humans must be 
applied. Generally, the simpler the organism, the more radiation it can 
tolerate. (Cancer cells are a partial exception, because they are rapidly 
reproducing and, thus, more sensitive.) Current licensing allows up to 1000 
Gy to be applied to fresh fruits and vegetables, called a low dose in food 
irradiation. Such a dose is enough to prevent or reduce the growth of many 
microorganisms, but about 10,000 Gy is needed to kill salmonella, and even 
more is needed to kill fungi. Doses greater than 10,000 Gy are considered to 
be high doses in food irradiation and product sterilization. 


The effectiveness of food irradiation varies with the type of food. Spices 
and many fruits and vegetables have dramatically longer shelf lives. These 
also show no degradation in taste and no loss of food value or vitamins. If 
not for the mandatory labeling, such foods subjected to low-level irradiation 
(up to 1000 Gy) could not be distinguished from untreated foods in quality. 


However, some foods actually spoil faster after irradiation, particularly 
those with high water content like lettuce and peaches. Others, such as milk, 
are given a noticeably unpleasant taste. High-level irradiation produces 
significant and chemically measurable changes in foods. It produces about a 
15% loss of nutrients and a 25% loss of vitamins, as well as some change in 
taste. Such losses are similar to those that occur in ordinary freezing and 
cooking. 


How does food irradiation work? Ionization produces a random assortment 
of broken molecules and ions, some with unstable oxygen- or hydrogen- 
containing molecules known as free radicals. These undergo rapid 
chemical reactions, producing perhaps four or five thousand different 
compounds called radiolytic products, some of which make cell function 
impossible by breaking cell membranes, fracturing DNA, and so on. How 
safe is the food afterward? Critics argue that the radiolytic products present 
a lasting hazard, perhaps being carcinogenic. However, the safety of 
irradiated food is not known precisely. We do know that low-level food 
irradiation produces no compounds in amounts that can be measured 
chemically. This is not surprising, since trace amounts of several thousand 
compounds may be created. We also know that there have been no 
observable negative short-term effects on consumers. Long-term effects 
may show up if large number of people consume large quantities of 
irradiated food, but no effects have appeared due to the small amounts of 
irradiated food that are consumed regularly. The case for safety is supported 
by testing of animal diets that were irradiated; no transmitted genetic effects 
have been observed. Food irradiation (at least up to a million rad) has been 
endorsed by the World Health Organization and the UN Food and 
Agricultural Organization. Finally, the hazard to consumers, if it exists, 
must be weighed against the benefits in food production and preservation. It 
must also be weighed against the very real hazards of existing insecticides 
and food preservatives. 


Section Summary 


¢ Food irradiation is the treatment of food with ionizing radiation. 
e Irradiating food can destroy insects and bacteria by creating free 
radicals and radiolytic products that can break apart cell membranes. 


e Food irradiation has produced no observable negative short-term 
effects for humans, but its long-term effects are unknown. 


Conceptual Questions 


Exercise: 
Problem: 
Does food irradiation leave the food radioactive? To what extent is the 
food altered chemically for low and high doses in food irradiation? 
Exercise: 
Problem: 
Compare a low dose of radiation to a human with a low dose of 
radiation used in food treatment. 
Exercise: 
Problem: 
Suppose one food irradiation plant uses a !°’Cs source while another 
uses an equal activity of °°Co. Assuming equal fractions of the + rays 


from the sources are absorbed, why is more time needed to get the 
same dose using the !8’Cs source? 


Glossary 


food irradiation 
treatment of food with ionizing radiation 


free radicals 
ions with unstable oxygen- or hydrogen-containing molecules 


radiolytic products 
compounds produced due to chemical reactions of free radicals 


Fusion 


e Define nuclear fusion. 
e Discuss processes to achieve practical fusion energy generation. 


While basking in the warmth of the summer sun, a student reads of the latest 
breakthrough in achieving sustained thermonuclear power and vaguely recalls hearing 
about the cold fusion controversy. The three are connected. The Sun’s energy is produced 
by nuclear fusion (see [link]). Thermonuclear power is the name given to the use of 
controlled nuclear fusion as an energy source. While research in the area of thermonuclear 
power is progressing, high temperatures and containment difficulties remain. The cold 
fusion controversy centered around unsubstantiated claims of practical fusion power at 
room temperatures. 


The Sun’s energy is 
produced by nuclear fusion. 
(credit: Spiralz) 


Nuclear fusion is a reaction in which two nuclei are combined, or fused, to form a larger 
nucleus. We know that all nuclei have less mass than the sum of the masses of the protons 
and neutrons that form them. The missing mass times c? equals the binding energy of the 
nucleus—the greater the binding energy, the greater the missing mass. We also know that 
BE/A, the binding energy per nucleon, is greater for medium-mass nuclei and has a 
maximum at Fe (iron). This means that if two low-mass nuclei can be fused together to 
form a larger nucleus, energy can be released. The larger nucleus has a greater binding 
energy and less mass per nucleon than the two that combined. Thus mass is destroyed in 
the fusion reaction, and energy is released (see [link]). On average, fusion of low-mass 
nuclei releases energy, but the details depend on the actual nuclides involved. 


Fusion 
produces 
energy 


Binding energy per nucleon (MeV per nucleon) 


Atomic mass 


Fusion of light nuclei to form medium-mass nuclei 
destroys mass, because BE/A is greater for the 
product nuclei. The larger BE/A is, the less mass 
per nucleon, and so mass is converted to energy 
and released in these fusion reactions. 


The major obstruction to fusion is the Coulomb repulsion between nuclei. Since the 
attractive nuclear force that can fuse nuclei together is short ranged, the repulsion of like 
positive charges must be overcome to get nuclei close enough to induce fusion. [link] 
shows an approximate graph of the potential energy between two nuclei as a function of 
the distance between their centers. The graph is analogous to a hill with a well in its 
center. A ball rolled from the right must have enough kinetic energy to get over the hump 
before it falls into the deeper well with a net gain in energy. So it is with fusion. If the 
nuclei are given enough kinetic energy to overcome the electric potential energy due to 
repulsion, then they can combine, release energy, and fall into a deep well. One way to 
accomplish this is to heat fusion fuel to high temperatures so that the kinetic energy of 


thermal motion is sufficient to get the nuclei together. 
PE io 


Pulled 


together Repelled 


Repulsive 
Coulomb 


Attractive nuclear 


Potential energy between 
two light nuclei graphed as a 
function of distance between 

them. If the nuclei have 
enough kinetic energy to get 
over the Coulomb repulsion 
hump, they combine, release 
energy, and drop into a deep 
attractive well. Tunneling 
through the barrier is 
important in practice. The 
greater the kinetic energy 
and the higher the particles 
get up the barrier (or the 
lower the barrier), the more 
likely the tunneling. 


You might think that, in the core of our Sun, nuclei are coming into contact and fusing. 
However, in fact, temperatures on the order of 10°K are needed to actually get the nuclei 
in contact, exceeding the core temperature of the Sun. Quantum mechanical tunneling is 
what makes fusion in the Sun possible, and tunneling is an important process in most 
other practical applications of fusion, too. Since the probability of tunneling is extremely 
sensitive to barrier height and width, increasing the temperature greatly increases the rate 
of fusion. The closer reactants get to one another, the more likely they are to fuse (see 
[link]). Thus most fusion in the Sun and other stars takes place at their centers, where 
temperatures are highest. Moreover, high temperature is needed for thermonuclear power 
to be a practical source of energy. 


CR -~ Bw Gk ~—Bso 


(a) (b) 


(a) Two nuclei heading toward each 
other slow down, then stop, and then 
fly away without touching or fusing. 
(b) At higher energies, the two nuclei 
approach close enough for fusion via 
tunneling. The probability of 
tunneling increases as they approach, 


but they do not have to touch for the 
reaction to occur. 


The Sun produces energy by fusing protons or hydrogen nuclei 'H (by far the Sun’s most 
abundant nuclide) into helium nuclei “He. The principal sequence of fusion reactions 
forms what is called the proton-proton cycle: 


Equation: 

"H+ 1H > 7H + et + v6 (0.42 MeV) 
Equation: 

*H +°H > He + 7 (5.49 MeV) 
Equation: 


°He + "He > “He + ’H+'+H (12.86 MeV) 


where e* stands for a positron and ve is an electron neutrino. (The energy in parentheses 
is released by the reaction.) Note that the first two reactions must occur twice for the third 
to be possible, so that the cycle consumes six protons (1H) but gives back two. 
Furthermore, the two positrons produced will find two electrons and annihilate to form 
four more 7¥ rays, for a total of six. The overall effect of the cycle is thus 

Equation: 


2e + 4'H — “He + 2ve + 6y (26.7 MeV) 


where the 26.7 MeV includes the annihilation energy of the positrons and electrons and is 
distributed among all the reaction products. The solar interior is dense, and the reactions 
occur deep in the Sun where temperatures are highest. It takes about 32,000 years for the 
energy to diffuse to the surface and radiate away. However, the neutrinos escape the Sun 
in less than two seconds, carrying their energy with them, because they interact so weakly 
that the Sun is transparent to them. Negative feedback in the Sun acts as a thermostat to 
regulate the overall energy output. For instance, if the interior of the Sun becomes hotter 
than normal, the reaction rate increases, producing energy that expands the interior. This 
cools it and lowers the reaction rate. Conversely, if the interior becomes too cool, it 
contracts, increasing the temperature and reaction rate (see [link]). Stars like the Sun are 
stable for billions of years, until a significant fraction of their hydrogen has been depleted. 
What happens then is discussed in Introduction to Frontiers of Physics . 
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Photon 


Nuclear fusion in the Sun 
converts hydrogen nuclei 
into helium; fusion occurs 
primarily at the boundary 
of the helium core, where 
temperature is highest 
and sufficient hydrogen 
remains. Energy released 
diffuses slowly to the 
surface, with the 
exception of neutrinos, 
which escape 
immediately. Energy 
production remains stable 
because of negative 
feedback effects. 


Theories of the proton-proton cycle (and other energy-producing cycles in stars) were 
pioneered by the German-born, American physicist Hans Bethe (1906-2005), starting in 
1938. He was awarded the 1967 Nobel Prize in physics for this work, and he has made 
many other contributions to physics and society. Neutrinos produced in these cycles 
escape so readily that they provide us an excellent means to test these theories and study 
stellar interiors. Detectors have been constructed and operated for more than four decades 
now to measure solar neutrinos (see [link]). Although solar neutrinos are detected and 
neutrinos were observed from Supernova 1987A ([link]), too few solar neutrinos were 
observed to be consistent with predictions of solar energy production. After many years, 
this solar neutrino problem was resolved with a blend of theory and experiment that 
showed that the neutrino does indeed have mass. It was also found that there are three 
types of neutrinos, each associated with a different type of nuclear decay. 


This array of 
photomultiplier tubes is 
part of the large solar 
neutrino detector at the 
Fermi National 
Accelerator Laboratory in 
Illinois. In these 
experiments, the 
neutrinos interact with 
heavy water and produce 
flashes of light, which are 
detected by the 
photomultiplier tubes. In 
spite of its size and the 
huge flux of neutrinos 
that strike it, very few are 
detected each day since 
they interact so weakly. 
This, of course, is the 
same reason they escape 
the Sun so readily. 
(credit: Fred Ullrich) 


Supernovas are the source 
of elements heavier than 


iron. Energy released 
powers nucleosynthesis. 
Spectroscopic analysis of 
the ring of material 
ejected by Supernova 
1987A observable in the 
southern hemisphere, 
shows evidence of heavy 
elements. The study of 
this supernova also 
provided indications that 
neutrinos might have 
mass. (credit: NASA, 
ESA, and P. Challis) 


The proton-proton cycle is not a practical source of energy on Earth, in spite of the great 
abundance of hydrogen (+H). The reaction 'H + 'H — 2H + e* + v, has a very low 
probability of occurring. (This is why our Sun will last for about ten billion years.) 
However, a number of other fusion reactions are easier to induce. Among them are: 
Equation: 


2H +°H > 3H+1H = (4.03 MeV) 


Equation: 

7H + 7H — 27He +n (3.27 MeV) 
Equation: 

°H+°H > “*He+n (17.59 MeV) 
Equation: 


°H + 7H > “He+¥ (23.85 MeV). 


Deuterium (7H) is about 0.015% of natural hydrogen, so there is an immense amount of it 
in sea water alone. In addition to an abundance of deuterium fuel, these fusion reactions 
produce large energies per reaction (in parentheses), but they do not produce much 
radioactive waste. Tritium (?H) is radioactive, but it is consumed as a fuel (the reaction 
7H + 3H — “He + n), and the neutrons and ys can be shielded. The neutrons produced 
can also be used to create more energy and fuel in reactions like 

Equation: 


n+‘H>?H+y7 = (20.68 MeV) 


and 
Equation: 


n+‘H—>?H+y7 (2.22 MeV). 


Note that these last two reactions, and 7H + 7H — 4He + 4, put most of their energy 
output into the y ray, and such energy is difficult to utilize. 


The three keys to practical fusion energy generation are to achieve the temperatures 
necessary to make the reactions likely, to raise the density of the fuel, and to confine it 
long enough to produce large amounts of energy. These three factors—temperature, 
density, and time—complement one another, and so a deficiency in one can be 
compensated for by the others. Ignition is defined to occur when the reactions produce 
enough energy to be self-sustaining after external energy input is cut off. This goal, which 
must be reached before commercial plants can be a reality, has not been achieved. 
Another milestone, called break-even, occurs when the fusion power produced equals the 
heating power input. Break-even has nearly been reached and gives hope that ignition and 
commercial plants may become a reality in a few decades. 


Two techniques have shown considerable promise. The first of these is called magnetic 
confinement and uses the property that charged particles have difficulty crossing 
magnetic field lines. The tokamak, shown in [link], has shown particular promise. The 
tokamak’s toroidal coil confines charged particles into a circular path with a helical twist 
due to the circulating ions themselves. In 1995, the Tokamak Fusion Test Reactor at 
Princeton in the US achieved world-record plasma temperatures as high as 500 million 
degrees Celsius. This facility operated between 1982 and 1997. A joint international effort 
is underway in France to build a tokamak-type reactor that will be the stepping stone to 
commercial power. ITER, as it is called, will be a full-scale device that aims to 
demonstrate the feasibility of fusion energy. It will generate 500 MW of power for 
extended periods of time and will achieve break-even conditions. It will study plasmas in 
conditions similar to those expected in a fusion power plant. Completion is scheduled for 
2018. 


(a) Artist’s rendition of 
ITER, a tokamak-type 
fusion reactor being built 
in southern France. It is 
hoped that this gigantic 
machine will reach the 
break-even point. 
Completion is scheduled 
for 2018. (credit: Stephan 
Mosel, Flickr) 


The second promising technique aims multiple lasers at tiny fuel pellets filled with a 
mixture of deuterium and tritium. Huge power input heats the fuel, evaporating the 
confining pellet and crushing the fuel to high density with the expanding hot plasma 
produced. This technique is called inertial confinement, because the fuel’s inertia 
prevents it from escaping before significant fusion can take place. Higher densities have 
been reached than with tokamaks, but with smaller confinement times. In 2009, the 
Lawrence Livermore Laboratory (CA) completed a laser fusion device with 192 
ultraviolet laser beams that are focused upon a D-T pellet (see [link]). 
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National Ignition Facility 
(CA). This image shows a 
laser bay where 192 laser 
beams will focus onto a 
small D-T target, 
producing fusion. (credit: 
Lawrence Livermore 
National Laboratory, 
Lawrence Livermore 
National Security, LLC, 
and the Department of 
Energy) 


Example: 

Calculating Energy and Power from Fusion 

(a) Calculate the energy released by the fusion of a 1.00-kg mixture of deuterium and 
tritium, which produces helium. There are equal numbers of deuterium and tritium nuclei 
in the mixture. 

(b) If this takes place continuously over a period of a year, what is the average power 
output? 

Strategy 

According to 7H + °H — *He + n, the energy per reaction is 17.59 MeV. To find the 
total energy released, we must find the number of deuterium and tritium atoms in a 
kilogram. Deuterium has an atomic mass of about 2 and tritium has an atomic mass of 
about 3, for a total of about 5 g per mole of reactants or about 200 mol in 1.00 kg. To get 
a more precise figure, we will use the atomic masses from Appendix A. The power 
output is best expressed in watts, and so the energy output needs to be calculated in 
joules and then divided by the number of seconds in a year. 

Solution for (a) 

The atomic mass of deuterium (7H) is 2.014102 u, while that of tritium (*H) is 3.016049 
u, for a total of 5.032151 u per reaction. So a mole of reactants has a mass of 5.03 g, and 
in 1.00 kg there are (1000 g)/(5.03 g/mol)=198.8 mol of reactants. The number of 
reactions that take place is therefore 

Equation: 


(198.8 mol) (6.02 x 107 mol") = 1.20 x 10°° reactions. 


The total energy output is the number of reactions times the energy per reaction: 
Equation: 


E = (1.20 x 10° reactions) (17.59 MeV/reaction) (1.602 x 10°’ J/MeV) 
= eek 


Solution for (b) 
Power is energy per unit time. One year has 3.16 x 10’ s, so 


Equation: 
=) ESS S3rxl0 I, 
co t —-3.16x10"s 
1.07 x 10’ W = 10.7 MW. 
Discussion 


By now we expect nuclear processes to yield large amounts of energy, and we are not 
disappointed here. The energy output of 3.37 x 10'* J from fusing 1.00 kg of deuterium 


and tritium is equivalent to 2.6 million gallons of gasoline and about eight times the 
energy output of the bomb that destroyed Hiroshima. Yet the average backyard 
swimming pool has about 6 kg of deuterium in it, so that fuel is plentiful if it can be 
utilized in a controlled manner. The average power output over a year is more than 10 
MW, impressive but a bit small for a commercial power plant. About 32 times this power 
output would allow generation of 100 MW of electricity, assuming an efficiency of one- 
third in converting the fusion energy to electrical energy. 


Section Summary 


e Nuclear fusion is a reaction in which two nuclei are combined to form a larger 
nucleus. It releases energy when light nuclei are fused to form medium-mass nuclei. 
e Fusion is the source of energy in stars, with the proton-proton cycle, 


Equation: 
H+ 4H > H+ et +4, (0.42 MeV) 
Equation: 
4H + °H > °He + (5.49 MeV) 
Equation: 


3He + °He — “He + 1H + /H (12.86 MeV) 


being the principal sequence of energy-producing reactions in our Sun. 
e The overall effect of the proton-proton cycle is 
Equation: 


2e + 4'H — *He + 2u, + 6y (26.7 MeV), 


where the 26.7 MeV includes the energy of the positrons emitted and annihilated. 

e Attempts to utilize controlled fusion as an energy source on Earth are related to 
deuterium and tritium, and the reactions play important roles. 

e Ignition is the condition under which controlled fusion is self-sustaining; it has not 
yet been achieved. Break-even, in which the fusion energy output is as great as the 
external energy input, has nearly been achieved. 

e Magnetic confinement and inertial confinement are the two methods being 
developed for heating fuel to sufficiently high temperatures, at sufficient density, and 
for sufficiently long times to achieve ignition. The first method uses magnetic fields 


and the second method uses the momentum of impinging laser beams for 
confinement. 


Conceptual Questions 


Exercise: 


Problem: Why does the fusion of light nuclei into heavier nuclei release energy? 
Exercise: 
Problem: 
Energy input is required to fuse medium-mass nuclei, such as iron or cobalt, into 
more massive nuclei. Explain why. 
Exercise: 
Problem: 
In considering potential fusion reactions, what is the advantage of the reaction 
2H + 39H — 4He + n over the reaction 7H + 27H > *He+ n? 
Exercise: 
Problem: 


Give reasons justifying the contention made in the text that energy from the fusion 
reaction 7H + 7H — *He + 7 is relatively difficult to capture and utilize. 


Problems & Exercises 


Exercise: 
Problem: 
Verify that the total number of nucleons, total charge, and electron family number are 


conserved for each of the fusion reactions in the proton-proton cycle in 
Equation: 


1H + 1H > "H+ et + v,, 


Equation: 


'H + 7H — *He + 7, 


and 
Equation: 


3He + °He — 4He + !H + 1H. 


(List the value of each of the conserved quantities before and after each of the 
reactions.) 


Solution: 


(a) A=141=2, Z=1+1=141, efn =0=-141 


(b) A=14+2=3, Z=1+1=2, efn=0=0 


(c) A=34+3=4+4141, Z=2+2=2+1+1, efn=0=0 
Exercise: 
Problem: 
Calculate the energy output in each of the fusion reactions in the proton-proton 
cycle, and verify the values given in the above summary. 
Exercise: 
Problem: 
Show that the total energy released in the proton-proton cycle is 26.7 MeV, 
considering the overall effect in tH + 1H > 7H+ e++%,'H+?H —> *He +4, 
and “He + He > *He + 'H + !H and being certain to include the annihilation 
energy. 
Solution: 
E = (m,—me)c? 
= [4m (*H) — m(“He) | e 
= [4(1.007825) — 4.002603](931.5 MeV) 
26.73 MeV 
Exercise: 
Problem: 
Verify by listing the number of nucleons, total charge, and electron family number 


before and after the cycle that these quantities are conserved in the overall proton- 
proton cycle in 2e” + 44H —> “He + 2u, + 6y. 


Exercise: 


Problem: 


The energy produced by the fusion of a 1.00-kg mixture of deuterium and tritium 
was found in Example Calculating Energy_and Power from Fusion. Approximately 
how many kilograms would be required to supply the annual energy use in the 
United States? 


Solution: 


3.12 x 10° kg (about 200 tons) 
Exercise: 


Problem: 


Tritium is naturally rare, but can be produced by the reaction n + 7H — ?H + 4. 
How much energy in MeV is released in this neutron capture? 


Exercise: 
Problem: Two fusion reactions mentioned in the text are 
n+ He > *He + y 
and 
n+'H>?H+4. 
Both reactions release energy, but the second also creates more fuel. Confirm that the 


energies produced in the reactions are 20.58 and 2.22 MeV, respectively. Comment 
on which product nuclide is most tightly bound, “He or 7H. 


Solution: 

E = (m—me)c? 

E, = (1.008665 + 3.016030 — 4.002603) (931.5 MeV) 
= 20.58 MeV 

E, = (1.008665 + 1.007825 — 2.014102)(931.5 MeV) 
= 2.224 MeV 


“He is more tightly bound, since this reaction gives off more energy per nucleon. 


Exercise: 


Problem: 


(a) Calculate the number of grams of deuterium in an 80,000-L swimming pool, 
given deuterium is 0.0150% of natural hydrogen. 


(b) Find the energy released in joules if this deuterium is fused via the reaction 
7H + 7H > 2He+ n. 


(c) Could the neutrons be used to create more energy? 


(d) Discuss the amount of this type of energy in a swimming pool as compared to 
that in, say, a gallon of gasoline, also taking into consideration that water is far more 
abundant. 


Exercise: 
Problem: 


How many kilograms of water are needed to obtain the 198.8 mol of deuterium, 
assuming that deuterium is 0.01500% (by number) of natural hydrogen? 


Solution: 


1.19 x 10*kg 


Exercise: 


Problem: The power output of the Sun is 4 x 107° W. 


(a) If 90% of this is supplied by the proton-proton cycle, how many protons are 
consumed per second? 


(b) How many neutrinos per second should there be per square meter at the Earth 
from this process? This huge number is indicative of how rarely a neutrino interacts, 
since large detectors observe very few per day. 


Exercise: 
Problem: 
Another set of reactions that result in the fusing of hydrogen into helium in the Sun 


and especially in hotter stars is called the carbon cycle. It is 
Equation: 


PC+IH +> BN+y, 

13 = BO tert y,, 
BOLI + MNS Y; 
MN+1H >= 4O+%, 

15O > PN+et++w, 
BN+1H ~~ 2C+4He 


Write down the overall effect of the carbon cycle (as was done for the proton-proton 
cycle in 2e~ + 44H — *He + 2v, + 6y). Note the number of protons ( 'H) required 
and assume that the positrons ( e*) annihilate electrons to form more 7¥ rays. 


Solution: 


2e- + 4H — “He + 7y + 2, 
Exercise: 
Problem: 
(a) Find the total energy released in MeV in each carbon cycle (elaborated in the 
above problem) including the annihilation energy. 
(b) How does this compare with the proton-proton cycle output? 
Exercise: 
Problem: 
Verify that the total number of nucleons, total charge, and electron family number are 
conserved for each of the fusion reactions in the carbon cycle given in the above 


problem. (List the value of each of the conserved quantities before and after each of 
the reactions.) 


Solution: 


(a) A=12+1=13, Z=6+1=7, efn = 0 =0 


(b) A=13=13, Z=7=6+41, efn =0 =-1+1 
(c) A=13 + 1=14, Z=6+1=7, efn = 0 =0 


(d) A=14 + 1=15, Z=7+1=8, efn = 0 =0 


(e) A=15=15, Z=8=7+1, efn =0 = —-1+41 


(f) A=15 + 1=12 + 4, Z=7-+1=6 + 2, efn =0 =0 


Exercise: 


Problem: Integrated Concepts 


The laser system tested for inertial confinement can produce a 100-kJ pulse only 
1.00 ns in duration. (a) What is the power output of the laser system during the brief 
pulse? 


(b) How many photons are in the pulse, given their wavelength is 1.06 m? 
(c) What is the total momentum of all these photons? 


(d) How does the total photon momentum compare with that of a single 1.00 MeV 
deuterium nucleus? 


Exercise: 


Problem: Integrated Concepts 


Find the amount of energy given to the *He nucleus and to the ¥ ray in the reaction 
n +? He —>* He + 4, using the conservation of momentum principle and taking the 
reactants to be initially at rest. This should confirm the contention that most of the 
energy goes to the ¥ ray. 


Solution: 
E, = 20.6 MeV 


Es. = 5.68 x 10°? MeV 


Exercise: 


Problem: Integrated Concepts 


(a) What temperature gas would have atoms moving fast enough to bring two *He 
nuclei into contact? Note that, because both are moving, the average kinetic energy 
only needs to be half the electric potential energy of these doubly charged nuclei 
when just in contact with one another. 


(b) Does this high temperature imply practical difficulties for doing this in controlled 
fusion? 


Exercise: 


Problem: Integrated Concepts 


(a) Estimate the years that the deuterium fuel in the oceans could supply the energy 
needs of the world. Assume world energy consumption to be ten times that of the 
United States which is 8 x 10! J/y and that the deuterium in the oceans could be 
converted to energy with an efficiency of 32%. You must estimate or look up the 
amount of water in the oceans and take the deuterium content to be 0.015% of 
natural hydrogen to find the mass of deuterium available. Note that approximate 
energy yield of deuterium is 3.37 x 10‘ J/kg. 


(b) Comment on how much time this is by any human measure. (It is not an 
unreasonable result, only an impressive one.) 


Solution: 
(a)3 x 10° y 


(b) This is approximately half the lifetime of the Earth. 


Glossary 


break-even 
when fusion power produced equals the heating power input 


ignition 


when a fusion reaction produces enough energy to be self-sustaining after external 


energy input is cut off 


inertial confinement 
a technique that aims multiple lasers at tiny fuel pellets evaporating and crushing 
them to high density 


magnetic confinement 
a technique in which charged particles are trapped in a small region because of 
difficulty in crossing magnetic field lines 


nuclear fusion 
a reaction in which two nuclei are combined, or fused, to form a larger nucleus 


proton-proton cycle 
the combined reactions 'H+'H > *H+e*+v,, H+*H > °Het+y, and 
3He+°He — *He+!H+'H 


Fission 


e Define nuclear fission. 
e Discuss how fission fuel reacts and describe what it produces. 
e Describe controlled and uncontrolled chain reactions. 


Nuclear fission is a reaction in which a nucleus is split (or fissured). Controlled 
fission is a reality, whereas controlled fusion is a hope for the future. Hundreds of 
nuclear fission power plants around the world attest to the fact that controlled 
fission is practical and, at least in the short term, economical, as seen in [Link]. 
Whereas nuclear power was of little interest for decades following TMI and 
Chermoby1 (and now Fukushima Daiichi), growing concerns over global warming 
has brought nuclear power back on the table as a viable energy alternative. By the 
end of 2009, there were 442 reactors operating in 30 countries, providing 15% of 
the world’s electricity. France provides over 75% of its electricity with nuclear 
power, while the US has 104 operating reactors providing 20% of its electricity. 
Australia and New Zealand have none. China is building nuclear power plants at 
the rate of one start every month. 


The people living near 
this nuclear power plant 
have no measurable 
exposure to radiation that 
is traceable to the plant. 
About 16% of the world’s 
electrical power is 
generated by controlled 
nuclear fission in such 
plants. The cooling 
towers are the most 
prominent features but 
are not unique to nuclear 


power. The reactor is in 
the small domed building 
to the left of the towers. 
(credit: Kalmthouts) 


Fission is the opposite of fusion and releases energy only when heavy nuclei are 
split. As noted in Fusion, energy is released if the products of a nuclear reaction 
have a greater binding energy per nucleon (BE/ A) than the parent nuclei. [link] 
shows that BE/A is greater for medium-mass nuclei than heavy nuclei, implying 
that when a heavy nucleus is split, the products have less mass per nucleon, so 
that mass is destroyed and energy is released in the reaction. The amount of 
energy per fission reaction can be large, even by nuclear standards. The graph in 
[link] shows BE/A to be about 7.6 MeV/nucleon for the heaviest nuclei (A 
about 240), while BE/A is about 8.6 MeV/nucleon for nuclei having A about 
120. Thus, if a heavy nucleus splits in half, then about 1 MeV per nucleon, or 
approximately 240 MeV per fission, is released. This is about 10 times the energy 
per fusion reaction, and about 100 times the energy of the average a, 8, or y 
decay. 


Example: 

Calculating Energy Released by Fission 

Calculate the energy released in the following spontaneous fission reaction: 
Equation: 


PE a Oe Ge to Bp 


given the atomic masses to be m(7?°U) = 238.050784 u, 

m(®°Sr) = 94.919388 u, m(14°Xe) = 139.921610 u, and 

m(n) = 1.008665 u. 

Strategy 

As always, the energy released is equal to the mass destroyed times c” , so we 
must find the difference in mass between the parent 7°°U and the fission 
products. 

Solution 

The products have a total mass of 

Equation: 


Mproducts = 94.919388 u + 139.921610 u + 3(1.008665 u) 
237.866993 u. 


The mass lost is the mass of 2°°U minus Mproducts, OF 
Equation: 


Am = 238.050784 u — 237.8669933 u = 0.183791 u, 


so the energy released is 
Equation: 


E = (Am)c? 
(0.183791 w) SSMeW/e 2 _ 171.2 MeV. 


Discussion 

A number of important things arise in this example. The 171-MeV energy 
released is large, but a little less than the earlier estimated 240 MeV. This is 
because this fission reaction produces neutrons and does not split the nucleus 
into two equal parts. Fission of a given nuclide, such as 7°8U , does not always 
produce the same products. Fission is a statistical process in which an entire 
range of products are produced with various probabilities. Most fission produces 
neutrons, although the number varies with each fission. This is an extremely 
important aspect of fission, because neutrons can induce more fission, enabling 
self-sustaining chain reactions. 


Spontaneous fission can occur, but this is usually not the most common decay 
mode for a given nuclide. For example, 7°°U can spontaneously fission, but it 
decays mostly by a emission. Neutron-induced fission is crucial as seen in [link]. 
Being chargeless, even low-energy neutrons can strike a nucleus and be absorbed 
once they feel the attractive nuclear force. Large nuclei are described by a liquid 
drop model with surface tension and oscillation modes, because the large 
number of nucleons act like atoms in a drop. The neutron is attracted and thus, 
deposits energy, causing the nucleus to deform as a liquid drop. If stretched 
enough, the nucleus narrows in the middle. The number of nucleons in contact 
and the strength of the nuclear force binding the nucleus together are reduced. 
Coulomb repulsion between the two ends then succeeds in fissioning the nucleus, 
which pops like a water drop into two large pieces and a few neutrons. Neutron- 
induced fission can be written as 


Equation: 


n+ 4X > FF, + FF) + xn, 


where FF and FF are the two daughter nuclei, called fission fragments, and x 
is the number of neutrons produced. Most often, the masses of the fission 
fragments are not the same. Most of the released energy goes into the kinetic 
energy of the fission fragments, with the remainder going into the neutrons and 
excited states of the fragments. Since neutrons can induce fission, a self- 
sustaining chain reaction is possible, provided more than one neutron is produced 
on average — that is, if x > linn+ AX + FF, + FF, + xn. This can also be 
seen in [link]. 


An example of a typical neutron-induced fission reaction is 
Equation: 


n+ aU = a Ba ++ ogKr + 3n. 
Note that in this equation, the total charge remains the same (is conserved): 
92 +0 = 56+ 36. Also, as far as whole numbers are concerned, the mass is 


constant: 1 + 235 = 142 + 91+ 3. This is not true when we consider the 
masses out to 6 or 7 significant places, as in the previous example. 


~~ 


~~ 


8 BB 
(d) FF; a * 


Neutron-induced 


235) 
ape oJ nucleus 3 


fission is shown. First, 
energy is put into this 
large nucleus when it 
absorbs a neutron. 
Acting like a struck 
liquid drop, the 
nucleus deforms and 
begins to narrow in the 
middle. Since fewer 
nucleons are in 
contact, the repulsive 
Coulomb force is able 
to break the nucleus 
into two parts with 
some neutrons also 
flying away. 


~~ Neutron 


32 Fission fragment 
nuclei 


A chain reaction can produce self- 
sustained fission if each fission 


produces enough neutrons to induce at 
least one more fission. This depends 
on several factors, including how 
many neutrons are produced in an 
average fission and how easy it is to 
make a particular type of nuclide 
fission. 


Not every neutron produced by fission induces fission. Some neutrons escape the 
fissionable material, while others interact with a nucleus without making it 
fission. We can enhance the number of fissions produced by neutrons by having a 
large amount of fissionable material. The minimum amount necessary for self- 
sustained fission of a given nuclide is called its critical mass. Some nuclides, 
such as 7?9Pu , produce more neutrons per fission than others, such as 2°°U . 
Additionally, some nuclides are easier to make fission than others. In particular, 
23517 and 78°Pu are easier to fission than the much more abundant 7°°U . Both 
factors affect critical mass, which is smallest for 2°9Pu . 


The reason 7°°U and 7°?Pu are easier to fission than 7°°U is that the nuclear 
force is more attractive for an even number of neutrons in a nucleus than for an 
odd number. Consider that U3 has 143 neutrons, and rel 145 has 145 
neutrons, whereas SU 146 has 146. When a neutron encounters a nucleus with an 
odd number of neutrons, the nuclear force is more attractive, because the 
additional neutron will make the number even. About 2-MeV more energy is 
deposited in the resulting nucleus than would be the case if the number of 
neutrons was already even. This extra energy produces greater deformation, 
making fission more likely. Thus, 2°°U and 7°?Pu are superior fission fuels. The 
isotope 89 U)15 only 0.72 % of natural uranium, while 238TJ is 99.27%, and °Pu 
does not exist in nature. Australia has the largest deposits of uranium in the 
world, standing at 28% of the total. This is followed by Kazakhstan and Canada. 
The US has only 3% of global reserves. 


Most fission reactors utilize 2°°U , which is separated from 2°°U at some 
expense. This is called enrichment. The most common separation method is 
gaseous diffusion of uranium hexafluoride (UF’g) through membranes. Since 
23517 has less mass than 7°°U , its UFg molecules have higher average velocity at 
the same temperature and diffuse faster. Another interesting characteristic of 
2351] is that it preferentially absorbs very slow moving neutrons (with energies a 


fraction of an eV), whereas fission reactions produce fast neutrons with energies 
in the order of an MeV. To make a self-sustained fission reactor with 2°°U , it is 
thus necessary to slow down (“thermalize”) the neutrons. Water is very effective, 
since neutrons collide with protons in water molecules and lose energy. [link] 
shows a schematic of a reactor design, called the pressurized water reactor. 


Primary system Secondary system 


Electric 
Hot water generator 
Steam turbine 


Control rods 
exchanger 
Fuel rods 


Containment 
vessel 
(shielding) 


Condenser 


(fuel and moderator) 
Neutrons not 


thermalized, 
reaction stops. 


Shielding Cooling water 


A pressurized water reactor is cleverly designed to control the 
fission of large amounts of 7°°U , while using the heat 
produced in the fission reaction to create steam for generating 
electrical energy. Control rods adjust neutron flux so that 
criticality is obtained, but not exceeded. In case the reactor 
overheats and boils the water away, the chain reaction 
terminates, because water is needed to thermalize the neutrons. 
This inherent safety feature can be overwhelmed in extreme 
circumstances. 


Control rods containing nuclides that very strongly absorb neutrons are used to 
adjust neutron flux. To produce large power, reactors contain hundreds to 
thousands of critical masses, and the chain reaction easily becomes self- 
sustaining, a condition called criticality. Neutron flux should be carefully 
regulated to avoid an exponential increase in fissions, a condition called 
supercriticality. Control rods help prevent overheating, perhaps even a 
meltdown or explosive disassembly. The water that is used to thermalize 


neutrons, necessary to get them to induce fission in 2®°U , and achieve criticality, 
provides a negative feedback for temperature increases. In case the reactor 
overheats and boils the water to steam or is breached, the absence of water kills 
the chain reaction. Considerable heat, however, can still be generated by the 
reactor’s radioactive fission products. Other safety features, thus, need to be 
incorporated in the event of a loss of coolant accident, including auxiliary cooling 
water and pumps. 


Example: 

Calculating Energy from a Kilogram of Fissionable Fuel 

Calculate the amount of energy produced by the fission of 1.00 kg of 7°°U , 
given the average fission reaction of 7?°U produces 200 MeV. 

Strategy 

The total energy produced is the number of 7°°U atoms times the given energy 
per 2°°U fission. We should therefore find the number of 72°U atoms in 1.00 kg. 
Solution 

The number of 7?°U atoms in 1.00 kg is Avogadro’s number times the number of 
moles. One mole of 7°°U has a mass of 235.04 g; thus, there are 

(1000 g)/(235.04 g/mol) = 4.25 mol. The number of 7°U atoms is therefore, 
Equation: 


(4.25 mol) (6.02 x 107° °U/mol) = 2.56 x 10°* 7° U. 


So the total energy released is 
Equation: 


E 


(2.56 x 1074 235[)) ( 200MeV ) (.00g9 "S| 


8.21 x 101° J. 


Discussion 

This is another impressively large amount of energy, equivalent to about 14,000 
barrels of crude oil or 600,000 gallons of gasoline. But, it is only one-fourth the 
energy produced by the fusion of a kilogram mixture of deuterium and tritium as 
seen in [link]. Even though each fission reaction yields about ten times the 
energy of a fusion reaction, the energy per kilogram of fission fuel is less, 
because there are far fewer moles per kilogram of the heavy nuclides. Fission 


fuel is also much more scarce than fusion fuel, and less than 1% of uranium 
(the 73°U) is readily usable. 


One nuclide already mentioned is 239Py , which has a 24,120-y half-life and does 
not exist in nature. Plutonium-239 is manufactured from 2?°U in reactors, and it 
provides an opportunity to utilize the other 99% of natural uranium as an energy 
source. The following reaction sequence, called breeding, produces 7°9Pu . 
Breeding begins with neutron capture by 7°°U : 

Equation: 


PU sige ee ae 


Uranium-239 then 8 decays: 
Equation: 


23977 a 39ND +p + Ve(t1/2 = 23 min). 


Neptunium-239 also B decays: 
Equation: 


39ND a 239Dy + Bp + Ve(t12 24 d). 


Plutonium-239 builds up in reactor fuel at a rate that depends on the probability 
of neutron capture by 7°8U (all reactor fuel contains more 7°°U than 7*°U ). 
Reactors designed specifically to make plutonium are called breeder reactors. 
They seem to be inherently more hazardous than conventional reactors, but it 
remains unknown whether their hazards can be made economically acceptable. 
The four reactors at Chernobyl, including the one that was destroyed, were built 
to breed plutonium and produce electricity. These reactors had a design that was 
significantly different from the pressurized water reactor illustrated above. 


Plutonium-239 has advantages over 2*°U as a reactor fuel — it produces more 
neutrons per fission on average, and it is easier for a thermal neutron to cause it 
to fission. It is also chemically different from uranium, so it is inherently easier to 
separate from uranium ore. This means 2°?Pu has a particularly small critical 
mass, an advantage for nuclear weapons. 


Note: 

PhET Explorations: Nuclear Fission 

Start a chain reaction, or introduce non-radioactive isotopes to prevent one. 
Control energy production in a nuclear reactor! 


https://archive.cnx.org/specials/O1caf0d0-116f-11e6-b891- 
abfdaa77b03b/nuclear-fission/#sim-one-nucleus 


Section Summary 


e Nuclear fission is a reaction in which a nucleus is split. 

e Fission releases energy when heavy nuclei are split into medium-mass 
nuclei. 

e Self-sustained fission is possible, because neutron-induced fission also 
produces neutrons that can induce other fissions, 
a. FF, + FF», + xn, where FF; and FF» are the two daughter 
nuclei, or fission fragments, and x is the number of neutrons produced. 

e¢ A minimum mass, called the critical mass, should be present to achieve 
criticality. 

e More than a critical mass can produce supercriticality. 

e The production of new or different isotopes (especially 2°9Pu ) by nuclear 
transformation is called breeding, and reactors designed for this purpose are 
called breeder reactors. 


Conceptual Questions 


Exercise: 
Problem: 
Explain why the fission of heavy nuclei releases energy. Similarly, why is it 
that energy input is required to fission light nuclei? 


Exercise: 


Problem: 


Explain, in terms of conservation of momentum and energy, why collisions 
of neutrons with protons will thermalize neutrons better than collisions with 
oxygen. 


Exercise: 
Problem: 
The ruins of the Chernobyl reactor are enclosed in a huge concrete structure 
built around it after the accident. Some rain penetrates the building in winter, 


and radioactivity from the building increases. What does this imply is 
happening inside? 


Exercise: 
Problem: 
Since the uranium or plutonium nucleus fissions into several fission 


fragments whose mass distribution covers a wide range of pieces, would you 
expect more residual radioactivity from fission than fusion? Explain. 


Exercise: 
Problem: 
The core of a nuclear reactor generates a large amount of thermal energy 
from the decay of fission products, even when the power-producing fission 
chain reaction is turned off. Would this residual heat be greatest after the 


reactor has run for a long time or short time? What if the reactor has been 
shut down for months? 


Exercise: 
Problem: 
How can a nuclear reactor contain many critical masses and not go 
supercritical? What methods are used to control the fission in the reactor? 


Exercise: 


Problem: 


Why can heavy nuclei with odd numbers of neutrons be induced to fission 
with thermal neutrons, whereas those with even numbers of neutrons require 
more energy input to induce fission? 


Exercise: 


Problem: 


Why is a conventional fission nuclear reactor not able to explode as a bomb? 


Problem Exercises 


Exercise: 


Problem: 


(a) Calculate the energy released in the neutron-induced fission (similar to 
the spontaneous fission in [link]) 
Equation: 


n+ 38y — gr + “Xe + 3n, 


given m(%Sr) = 95.921750 u and m(14°Xe) = 139.92164. (b) This result 
is about 6 MeV greater than the result for spontaneous fission. Why? (c) 
Confirm that the total number of nucleons and total charge are conserved in 
this reaction. 


Solution: 
(a) 177.1 MeV 


(b) Because the gain of an external neutron yields about 6 MeV, which is the 
average BE/A for heavy nuclei. 


(c) 
A=14238 = 964+ 140+1+1+4+1, Z=92 = 38+ 53, efn=0=0 


Exercise: 


Problem: 


(a) Calculate the energy released in the neutron-induced fission reaction 
Equation: 


n+7U > ?Kr+!Ba + 2n, 


given m(92Kr) = 91.926269 u and m(1**Ba) = 141.916361 u. 


(b) Confirm that the total number of nucleons and total charge are conserved 
in this reaction. 


Exercise: 


Problem: 


(a) Calculate the energy released in the neutron-induced fission reaction 
Equation: 


n+?%Py > Sr + Ba + An, 


given m(%6Sr) = 95.921750 u and m(1*°Ba) = 139.910581 u. 


(b) Confirm that the total number of nucleons and total charge are conserved 
in this reaction. 


Solution: 


(a) 180.6 MeV 


(b) 
A=1+ 239 = 96+ 140+1+1+1+41, Z=94 = 384+ 56, efn =0=0 


Exercise: 


Problem: 


Confirm that each of the reactions listed for plutonium breeding just 
following [link] conserves the total number of nucleons, the total charge, 
and electron family number. 


Exercise: 


Problem: 


Breeding plutonium produces energy even before any plutonium is 
fissioned. (The primary purpose of the four nuclear reactors at Chernobyl 
was breeding plutonium for weapons. Electrical power was a by-product 
used by the civilian population.) Calculate the energy produced in each of 
the reactions listed for plutonium breeding just following [link]. The 
pertinent masses are m(?3°U) = 239.054289 u, 

m(?°Np) = 239.052932 u, and m(7°?Pu) = 239.052157 u. 


Solution: 
28 +n > 8U +7481 MeV 
239T] 5 39Np + B~ + v- 0.753 MeV 


239Np > 789Pu + B- + ve 0.211 MeV 
Exercise: 
Problem: 
The naturally occurring radioactive isotope 7??Th does not make good 


fission fuel, because it has an even number of neutrons; however, it can be 
bred into a suitable fuel (much as 7?°U is bred into 7°°P). 


(a) What are Z and NN for 2°?Th? 


(b) Write the reaction equation for neutron captured by 2°?Th and identify 
the nuclide 4X produced in n + Th > 4X + 4. 


(c) The product nucleus @~ decays, as does its daughter. Write the decay 
equations for each, and identify the final nucleus. 


(d) Confirm that the final nucleus has an odd number of neutrons, making it 
a better fission fuel. 


(e) Look up the half-life of the final nucleus to see if it lives long enough to 
be a useful fuel. 


Exercise: 


Problem: 


The electrical power output of a large nuclear reactor facility is 900 MW. It 
has a 35.0% efficiency in converting nuclear power to electrical. 


(a) What is the thermal nuclear power output in megawatts? 


(b) How many 2*°U nuclei fission each second, assuming the average 
fission produces 200 MeV? 


(c) What mass of 7°°U is fissioned in one year of full-power operation? 
Solution: 

(a) 2.57 x 10° MW 

(b) 8.03 x 107° fission/s 


(c) 991 kg 
Exercise: 


Problem: 


A large power reactor that has been in operation for some months is turned 
off, but residual activity in the core still produces 150 MW of power. If the 
average energy per decay of the fission products is 1.00 MeV, what is the 
core activity in curies? 


Glossary 


breeder reactors 
reactors that are designed specifically to make plutonium 


breeding 
reaction process that produces 7°°Pu 


criticality 
condition in which a chain reaction easily becomes self-sustaining 


critical mass 


minimum amount necessary for self-sustained fission of a given nuclide 


fission fragments 
a daughter nuclei 


liquid drop model 
a model of nucleus (only to understand some of its features) in which 
nucleons in a nucleus act like atoms in a drop 


nuclear fission 
reaction in which a nucleus splits 


neutron-induced fission 
fission that is initiated after the absorption of neutron 


supercriticality 
an exponential increase in fissions 


Nuclear Weapons 


e Discuss different types of fission and thermonuclear bombs. 
e Explain the ill effects of nuclear explosion. 


The world was in turmoil when fission was discovered in 1938. The 
discovery of fission, made by two German physicists, Otto Hahn and Fritz 
Strassman, was quickly verified by two Jewish refugees from Nazi 
Germany, Lise Meitner and her nephew Otto Frisch. Fermi, among others, 
soon found that not only did neutrons induce fission; more neutrons were 
produced during fission. The possibility of a self-sustained chain reaction 
was immediately recognized by leading scientists the world over. The 
enormous energy known to be in nuclei, but considered inaccessible, now 
seemed to be available on a large scale. 


Within months after the announcement of the discovery of fission, Adolf 
Hitler banned the export of uranium from newly occupied Czechoslovakia. 
It seemed that the military value of uranium had been recognized in Nazi 
Germany, and that a serious effort to build a nuclear bomb had begun. 


Alarmed scientists, many of them who fled Nazi Germany, decided to take 
action. None was more famous or revered than Einstein. It was felt that his 
help was needed to get the American government to make a serious effort at 
nuclear weapons as a matter of survival. Leo Szilard, an escaped Hungarian 
physicist, took a draft of a letter to Einstein, who, although pacifistic, 
signed the final version. The letter was for President Franklin Roosevelt, 
warning of the German potential to build extremely powerful bombs of a 
new type. It was sent in August of 1939, just before the German invasion of 
Poland that marked the start of World War II. 


It was not until December 6, 1941, the day before the Japanese attack on 
Pearl Harbor, that the United States made a massive commitment to 
building a nuclear bomb. The top secret Manhattan Project was a crash 
program aimed at beating the Germans. It was carried out in remote 
locations, such as Los Alamos, New Mexico, whenever possible, and 
eventually came to cost billions of dollars and employ the efforts of more 
than 100,000 people. J. Robert Oppenheimer (1904-1967), whose talent 
and ambitions made him ideal, was chosen to head the project. The first 


major step was made by Enrico Fermi and his group in December 1942, 
when they achieved the first self-sustained nuclear reactor. This first 
“atomic pile”, built in a squash court at the University of Chicago, used 
carbon blocks to thermalize neutrons. It not only proved that the chain 
reaction was possible, it began the era of nuclear reactors. Glenn Seaborg, 
an American chemist and physicist, received the Nobel Prize in physics in 
1951 for discovery of several transuranic elements, including plutonium. 
Carbon-moderated reactors are relatively inexpensive and simple in design 
and are still used for breeding plutonium, such as at Chernobyl, where two 
such reactors remain in operation. 


Plutonium was recognized as easier to fission with neutrons and, hence, a 
superior fission material very early in the Manhattan Project. Plutonium 
availability was uncertain, and so a uranium bomb was developed 
simultaneously. [link] shows a gun-type bomb, which takes two subcritical 
uranium masses and blows them together. To get an appreciable yield, the 
critical mass must be held together by the explosive charges inside the 
cannon barrel for a few microseconds. Since the buildup of the uranium 
chain reaction is relatively slow, the device to hold the critical mass 
together can be relatively simple. Owing to the fact that the rate of 
spontaneous fission is low, a neutron source is triggered at the same time 


the critical mass is assembled. 


“Gun barrel” 7 
Supercritical 
mass 


Explosive 
propellant *°U/ **U 


mass mass 
Neutron source initiator ' 
(Immediately 
(Before firing) after firing) 


A gun-type fission bomb for 2*°U utilizes 
two subcritical masses forced together by 
explosive charges inside a cannon barrel. 
The energy yield depends on the amount 
of uranium and the time it can be held 
together before it disassembles itself. 


Plutonium’s special properties necessitated a more sophisticated critical 
mass assembly, shown schematically in [link]. A spherical mass of 
plutonium is surrounded by shape charges (high explosives that release 
most of their blast in one direction) that implode the plutonium, crushing it 
into a smaller volume to form a critical mass. The implosion technique is 
faster and more effective, because it compresses three-dimensionally rather 
than one-dimensionally as in the gun-type bomb. Again, a neutron source 


must be triggered at just the correct time to initiate the chain reaction. 
High explosive 
lenses 


Neutron source 
initiator 


An implosion 
created by high 
explosives 
compresses a 
sphere of 7?°Pu 
into a critical mass. 
The superior 
fissionability of 
plutonium has 
made it the 


universal bomb 
material. 


Owing to its complexity, the plutonium bomb needed to be tested before 
there could be any attempt to use it. On July 16, 1945, the test named 
Trinity was conducted in the isolated Alamogordo Desert about 200 miles 
south of Los Alamos (see [link]). A new age had begun. The yield of this 
device was about 10 kilotons (kT), the equivalent of 5000 of the largest 
conventional bombs. 


Trinity test (1945), the 
first nuclear bomb (credit: 
United States Department 

of Energy) 


Although Germany surrendered on May 7, 1945, Japan had been steadfastly 
refusing to surrender for many months, forcing large casualties. Invasion 
plans by the Allies estimated a million casualties of their own and untold 
losses of Japanese lives. The bomb was viewed as a way to end the war. 
The first was a uranium bomb dropped on Hiroshima on August 6. Its yield 
of about 15 kT destroyed the city and killed an estimated 80,000 people, 
with 100,000 more being seriously injured (see [link]). The second was a 
plutonium bomb dropped on Nagasaki only three days later, on August 9. 
Its 20 kT yield killed at least 50,000 people, something less than Hiroshima 
because of the hilly terrain and the fact that it was a few kilometers off 
target. The Japanese were told that one bomb a week would be dropped 


until they surrendered unconditionally, which they did on August 14. In 
actuality, the United States had only enough plutonium for one more and as 
yet unassembled bomb. 


Destruction in Hiroshima 
(credit: United States Federal 
Government) 


Knowing that fusion produces several times more energy per kilogram of 
fuel than fission, some scientists pushed the idea of a fusion bomb starting 
very early on. Calling this bomb the Super, they realized that it could have 
another advantage over fission—high-energy neutrons would aid fusion, 
while they are ineffective in 7°°Pu fission. Thus the fusion bomb could be 
virtually unlimited in energy release. The first such bomb was detonated by 
the United States on October 31, 1952, at Eniwetok Atoll with a yield of 10 
megatons (MT), about 670 times that of the fission bomb that destroyed 
Hiroshima. The Soviets followed with a fusion device of their own in 
August 1953, and a weapons race, beyond the aim of this text to discuss, 
continued until the end of the Cold War. 


[link] shows a simple diagram of how a thermonuclear bomb is constructed. 
A fission bomb is exploded next to fusion fuel in the solid form of lithium 
deuteride. Before the shock wave blows it apart, y rays heat and compress 
the fuel, and neutrons create tritium through the reaction 

n+° Li >? H +4 He. Additional fusion and fission fuels are enclosed in a 
dense shell of 7°°U. The shell reflects some of the neutrons back into the 
fuel to enhance its fusion, but at high internal temperatures fast neutrons are 


created that also cause the plentiful and inexpensive 7°8U to fission, part of 


what allows thermonuclear bombs to be so large. 
Reflector and fission 
material (fast ns) 


Beryllium 
neutron reflector 


Shape 
charges 


Lithium 
deuteride 


239Py and U 


Styrofoam 
with y absorbers 


This schematic of a 
fusion bomb (H- 
bomb) gives some idea 
of how the 7°°Pu 
fission trigger is used 
to ignite fusion fuel. 
Neutrons and ¥ rays 
transmit energy to the 
fusion fuel, create 
tritium from 
deuterium, and heat 
and compress the 
fusion fuel. The outer 
shell of 738U serves to 
reflect some neutrons 
back into the fuel, 
causing more fusion, 


and it boosts the 

energy output by 
fissioning itself when 

neutron energies 
become high enough. 


The energy yield and the types of energy produced by nuclear bombs can be 
varied. Energy yields in current arsenals range from about 0.1 kT to 20 MT, 
although the Soviets once detonated a 67 MT device. Nuclear bombs differ 
from conventional explosives in more than size. [link] shows the 
approximate fraction of energy output in various forms for conventional 
explosives and for two types of nuclear bombs. Nuclear bombs put a much 
larger fraction of their output into thermal energy than do conventional 
bombs, which tend to concentrate the energy in blast. Another difference is 
the immediate and residual radiation energy from nuclear weapons. This 
can be adjusted to put more energy into radiation (the so-called neutron 
bomb) so that the bomb can be used to irradiate advancing troops without 
killing friendly troops with blast and heat. 


(a) Conventional 
chemical bomb 


Thermal 10% 


(b) Conventional 
nuclear bomb 


Delayed 


ae 5 
Prompt radiation 10% 


radiation 5% 


(Cc) Radiation-enhanced 
nuclear bomb 
(neutron bomb) 


radiation 5% 


Approximate 
fractions of energy 
output by 
conventional and 
two types of 
nuclear weapons. 
In addition to 
yielding more 
energy than 
conventional 
weapons, nuclear 


bombs put a much 
larger fraction into 
thermal energy. 
This can be 
adjusted to enhance 
the radiation output 
to be more 
effective against 
troops. An 
enhanced radiation 
bomb is also called 
a neutron bomb. 


At its peak in 1986, the combined arsenals of the United States and the 
Soviet Union totaled about 60,000 nuclear warheads. In addition, the 
British, French, and Chinese each have several hundred bombs of various 
sizes, and a few other countries have a small number. Nuclear weapons are 
generally divided into two categories. Strategic nuclear weapons are those 
intended for military targets, such as bases and missile complexes, and 
moderate to large cities. There were about 20,000 strategic weapons in 
1988. Tactical weapons are intended for use in smaller battles. Since the 
collapse of the Soviet Union and the end of the Cold War in 1989, most of 
the 32,000 tactical weapons (including Cruise missiles, artillery shells, land 
mines, torpedoes, depth charges, and backpacks) have been demobilized, 
and parts of the strategic weapon systems are being dismantled with 
warheads and missiles being disassembled. According to the Treaty of 
Moscow of 2002, Russia and the United States have been required to reduce 
their strategic nuclear arsenal down to about 2000 warheads each. 


A few small countries have built or are capable of building nuclear bombs, 
as are some terrorist groups. Two things are needed—a minimum level of 
technical expertise and sufficient fissionable material. The first is easy. 
Fissionable material is controlled but is also available. There are 
international agreements and organizations that attempt to control nuclear 
proliferation, but it is increasingly difficult given the availability of 


fissionable material and the small amount needed for a crude bomb. The 
production of fissionable fuel itself is technologically difficult. However, 
the presence of large amounts of such material worldwide, though in the 
hands of a few, makes control and accountability crucial. 


Section Summary 


e There are two types of nuclear weapons—fission bombs use fission 
alone, whereas thermonuclear bombs use fission to ignite fusion. 

e Both types of weapons produce huge numbers of nuclear reactions in a 
very short time. 

e Energy yields are measured in kilotons or megatons of equivalent 
conventional explosives and range from 0.1 kT to more than 20 MT. 

e Nuclear bombs are characterized by far more thermal output and 
nuclear radiation output than conventional explosives. 


Conceptual Questions 


Exercise: 
Problem: 
What are some of the reasons that plutonium rather than uranium is 
used in all fission bombs and as the trigger in all fusion bombs? 
Exercise: 
Problem: 
Use the laws of conservation of momentum and energy to explain how 
a shape charge can direct most of the energy released in an explosion 


in a specific direction. (Note that this is similar to the situation in guns 
and cannons—most of the energy goes into the bullet.) 


Exercise: 
Problem: 


How does the lithium deuteride in the thermonuclear bomb shown in 
[link] supply tritium (?H) as well as deuterium (7H)? 


Exercise: 
Problem: 
Fallout from nuclear weapons tests in the atmosphere is mainly 9°Sr 
and !37Cs, which have 28.6- and 32.2-y half-lives, respectively. 
Atmospheric tests were terminated in most countries in 1963, although 
China only did so in 1980. It has been found that environmental 


activities of these two isotopes are decreasing faster than their half- 
lives. Why might this be? 


Problems & Exercises 


Exercise: 


Problem: Find the mass converted into energy by a 12.0-kT bomb. 


Solution: 


0.56 g 


Exercise: 


Problem: What mass is converted into energy by a 1.00-MT bomb? 
Exercise: 

Problem: 

Fusion bombs use neutrons from their fission trigger to create tritium 


fuel in the reaction n +° Li +° H +* He. What is the energy released 
by this reaction in MeV? 


Solution: 


4.781 MeV 


Exercise: 


Problem: 


It is estimated that the total explosive yield of all the nuclear bombs in 
existence currently is about 4,000 MT. 


(a) Convert this amount of energy to kilowatt-hours, noting that 
1kW-h=3.60 x 10° J. 


(b) What would the monetary value of this energy be if it could be 
converted to electricity costing 10 cents per kW-h? 

Exercise: 
Problem: 
A radiation-enhanced nuclear weapon (or neutron bomb) can have a 
smaller total yield and still produce more prompt radiation than a 
conventional nuclear bomb. This allows the use of neutron bombs to 
kill nearby advancing enemy forces with radiation without blowing up 
your own forces with the blast. For a 0.500-kT radiation-enhanced 


weapon and a 1.00-kT conventional nuclear bomb: (a) Compare the 
blast yields. (b) Compare the prompt radiation yields. 


Solution: 


(a) Blast yields 2.1 x 10’* J to 8.4 x 10" J, or 2.5 to 1, conventional 
to radiation enhanced. 


(b) Prompt radiation yields 6.3 x 101! J to 2.1 x 10" J, or 3 to 1, 
radiation enhanced to conventional. 
Exercise: 
Problem: 
(a) How many 7°°Pu nuclei must fission to produce a 20.0-kT yield, 


assuming 200 MeV per fission? (b) What is the mass of this much 
AYP? 


Exercise: 


Problem: 


Assume one-fourth of the yield of a typical 320-kT strategic bomb 
comes from fission reactions averaging 200 MeV and the remainder 
from fusion reactions averaging 20 MeV. 


(a) Calculate the number of fissions and the approximate mass of 
uranium and plutonium fissioned, taking the average atomic mass to be 
238, 


(b) Find the number of fusions and calculate the approximate mass of 
fusion fuel, assuming an average total atomic mass of the two nuclei in 
each reaction to be 5. 


(c) Considering the masses found, does it seem reasonable that some 
missiles could carry 10 warheads? Discuss, noting that the nuclear fuel 
is only a part of the mass of a warhead. 


Solution: 
(a) 1.1 x 10”? fissions , 4.4 kg 
(b) 3.2 x 10° fusions , 2.7 kg 


(c) The nuclear fuel totals only 6 kg, so it is quite reasonable that some 
missiles carry 10 overheads. The mass of the fuel would only be 60 kg 
and therefore the mass of the 10 warheads, weighing about 10 times 
the nuclear fuel, would be only 1500 lbs. If the fuel for the missiles 
weighs 5 times the total weight of the warheads, the missile would 
weigh about 9000 lbs or 4.5 tons. This is not an unreasonable weight 
for a missile. 


Exercise: 


Problem: 


This problem gives some idea of the magnitude of the energy yield of 
a small tactical bomb. Assume that half the energy of a 1.00-kT 
nuclear depth charge set off under an aircraft carrier goes into lifting it 
out of the water—that is, into gravitational potential energy. How high 
is the carrier lifted if its mass is 90,000 tons? 


Exercise: 
Problem: 
It is estimated that weapons tests in the atmosphere have deposited 


approximately 9 MCi of °Sr on the surface of the earth. Find the mass 
of this amount of °Sr. 


Solution: 


7 x 10* g 
Exercise: 
Problem: 


A 1.00-MT bomb exploded a few kilometers above the ground 
deposits 25.0% of its energy into radiant heat. 


(a) Find the calories per cm? at a distance of 10.0 km by assuming a 
uniform distribution over a spherical surface of that radius. 


(b) If this heat falls on a person’s body, what temperature increase does 
it cause in the affected tissue, assuming it is absorbed in a layer 1.00- 
cm deep? 


Exercise: 
Problem: Integrated Concepts 


One scheme to put nuclear weapons to nonmilitary use is to explode 
them underground in a geologically stable region and extract the 


geothermal energy for electricity production. There was a total yield of 
about 4,000 MT in the combined arsenals in 2006. If 1.00 MT per day 
could be converted to electricity with an efficiency of 10.0%: 

(a) What would the average electrical power output be? 

(b) How many years would the arsenal last at this rate? 

Solution: 


(a) 4.86 x 10° W 


(b) 11.0 y 


Introduction to Particle Physics 
class="introduction" 


e Explore the substructures of matter. 
e Define particle physics. 


Part of the 
Large 
Hadron 
Collider at 
CERN, on 
the border 
of 
Switzerland 
and France. 
The LHC is 
a particle 
accelerator, 
designed to 
study 
fundamenta 
| particles. 
(credit: 
Image 
Editor, 
Flickr) 


Following ideas remarkably similar to those of the ancient Greeks, we 
continue to look for smaller and smaller structures in nature, hoping 
ultimately to find and understand the most fundamental building blocks that 
exist. Atomic physics deals with the smallest units of elements and 
compounds. In its study, we have found a relatively small number of atoms 
with systematic properties that explained a tremendous range of 
phenomena. Nuclear physics is concerned with the nuclei of atoms and their 
substructures. Here, a smaller number of components—the proton and 
neutron—make up all nuclei. Exploring the systematic behavior of their 
interactions has revealed even more about matter, forces, and energy. 
Particle physics deals with the substructures of atoms and nuclei and is 
particularly aimed at finding those truly fundamental particles that have no 
further substructure. Just as in atomic and nuclear physics, we have found a 
complex array of particles and properties with systematic characteristics 
analogous to the periodic table and the chart of nuclides. An underlying 
structure is apparent, and there is some reason to think that we are finding 
particles that have no substructure. Of course, we have been in similar 
situations before. For example, atoms were once thought to be the ultimate 
substructure. Perhaps we will find deeper and deeper structures and never 
come to an ultimate substructure. We may never really know, as indicated in 
[link]. 
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The properties of matter are based on substructures called 
molecules and atoms. Atoms have the substructure of a nucleus 
with orbiting electrons, the interactions of which explain atomic 

properties. Protons and neutrons, the interactions of which explain 

the stability and abundance of elements, form the substructure of 
nuclei. Protons and neutrons are not fundamental—they are 

composed of quarks. Like electrons and a few other particles, 
quarks may be the fundamental building blocks of all there is, 
lacking any further substructure. But the story is not complete, 

because quarks and electrons may have substructure smaller than 

details that are presently observable. 


This chapter covers the basics of particle physics as we know it today. An 
amazing convergence of topics is evolving in particle physics. We find that 
some particles are intimately related to forces, and that nature on the 
smallest scale may have its greatest influence on the large-scale character of 
the universe. It is an adventure exceeding the best science fiction because it 
is not only fantastic, it is real. 


Summary 


e Particle physics is the study of and the quest for those truly 
fundamental particles having no substructure. 


Glossary 


particle physics 


the study of and the quest for those truly fundamental particles having 
no substructure 


The Yukawa Particle and the Heisenberg Uncertainty Principle Revisited 


e Define Yukawa particle. 

e State the Heisenberg uncertainty principle. 
e Describe pion. 

e Estimate the mass of a pion. 

e Explain meson. 


Particle physics as we know it today began with the ideas of Hideki Yukawa 
in 1935. Physicists had long been concerned with how forces are 
transmitted, finding the concept of fields, such as electric and magnetic 
fields to be very useful. A field surrounds an object and carries the force 
exerted by the object through space. Yukawa was interested in the strong 
nuclear force in particular and found an ingenious way to explain its short 
range. His idea is a blend of particles, forces, relativity, and quantum 
mechanics that is applicable to all forces. Yukawa proposed that force is 
transmitted by the exchange of particles (called carrier particles). The field 
consists of these carrier particles. 
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The strong 
nuclear 
force is 

transmitted 

between a 

proton and 

neutron by 

the creation 
and 

exchange of 

a pion. The 


pion is 
created 
through a 
temporary 
violation of 
conservatio 
n of mass- 
energy and 
travels from 
the proton 
to the 
neutron and 
is 
recaptured. 
It is not 
directly 
observable 
and is called 
a virtual 
particle. 
Note that 
the proton 
and neutron 
change 
identity in 
the process. 
The range 
of the force 
is limited by 
the fact that 
the pion can 
only exist 
for the short 
time 
allowed by 
the 
Heisenberg 


uncertainty 
principle. 
Yukawa 
used the 
finite range 
of the strong 
nuclear 
force to 
estimate the 
mass of the 
pion; the 
shorter the 
range, the 
larger the 
mass of the 
carrier 
particle. 


Specifically for the strong nuclear force, Yukawa proposed that a previously 
unknown particle, now called a pion, is exchanged between nucleons, 
transmitting the force between them. [link] illustrates how a pion would 
carry a force between a proton and a neutron. The pion has mass and can 
only be created by violating the conservation of mass-energy. This is 
allowed by the Heisenberg uncertainty principle if it occurs for a 
sufficiently short period of time. As discussed in Probability: The 
Heisenberg Uncertainty Principle the Heisenberg uncertainty principle 
relates the uncertainties AF in energy and At in time by 

Equation: 


AEAt > mM 
Art 


where hi is Planck’s constant. Therefore, conservation of mass-energy can 


be violated by an amount AF for a time At = wis in which time no 


process can detect the violation. This allows the temporary creation of a 
particle of mass m, where AE = mc?. The larger the mass and the greater 
the AF, the shorter is the time it can exist. This means the range of the 
force is limited, because the particle can only travel a limited distance in a 
finite amount of time. In fact, the maximum distance is d ~ cAt, where c is 
the speed of light. The pion must then be captured and, thus, cannot be 
directly observed because that would amount to a permanent violation of 
mass-energy conservation. Such particles (like the pion above) are called 
virtual particles, because they cannot be directly observed but their effects 
can be directly observed. Realizing all this, Yukawa used the information on 
the range of the strong nuclear force to estimate the mass of the pion, the 
particle that carries it. The steps of his reasoning are approximately retraced 
in the following worked example: 


Example: 

Calculating the Mass of a Pion 

Taking the range of the strong nuclear force to be about 1 fermi (10°! m), 
calculate the approximate mass of the pion carrying the force, assuming it 
moves at nearly the speed of light. 

Strategy 

The calculation is approximate because of the assumptions made about the 
range of the force and the speed of the pion, but also because a more 
accurate calculation would require the sophisticated mathematics of 
quantum mechanics. Here, we use the Heisenberg uncertainty principle in 
the simple form stated above, as developed in Probability: The Heisenberg 
Uncertainty Principle. First, we must calculate the time Az that the pion 
exists, given that the distance it travels at nearly the speed of light is about 
1 fermi. Then, the Heisenberg uncertainty principle can be solved for the 
energy AF, and from that the mass of the pion can be determined. We will 
use the units of MeV/ c? for mass, which are convenient since we are often 
considering converting mass to energy and vice versa. 

Solution 

The distance the pion travels is d ~ cAt, and so the time during which it 
exists is approximately 

Equation: 


a nlOe 
At c —  3.0x108 m/s 


3.3 x 10°*4 s. 


2 


Now, solving the Heisenberg uncertainty principle for AF gives 
Equation: 

h _ 663x 10 “J-s 
4nAt — 4n(3.3 x 10°74 s) © 


~~ 
~~ 


Solving this and converting the energy to MeV gives 
Equation: 


1 MeV 
AE = (1.6 x 10-4 J) ———_ = 100 Mev. 
16x10 ¥ J 


Mass is related to energy by AE’ = mc’, so that the mass of the pion is 
AE) C7 Or 
Equation: 


m = 100 MeV/c?. 


Discussion 

This is about 200 times the mass of an electron and about one-tenth the 
mass of a nucleon. No such particles were known at the time Yukawa made 
his bold proposal. 


Yukawa’s proposal of particle exchange as the method of force transfer is 
intriguing. But how can we verify his proposal if we cannot observe the 
virtual pion directly? If sufficient energy is in a nucleus, it would be 
possible to free the pion—that is, to create its mass from external energy 
input. This can be accomplished by collisions of energetic particles with 
nuclei, but energies greater than 100 MeV are required to conserve both 
energy and momentum. In 1947, pions were observed in cosmic-ray 
experiments, which were designed to supply a small flux of high-energy 
protons that may collide with nuclei. Soon afterward, accelerators of 


sufficient energy were creating pions in the laboratory under controlled 
conditions. Three pions were discovered, two with charge and one neutral, 
and given the symbols 7+, 7~, and 7°, respectively. The masses of 7* and 
mare identical at 139.6 MeV/ c”, whereas 7° has a mass of 

135.0 MeV/c?. These masses are close to the predicted value of 

100 MeV / c? and, since they are intermediate between electron and nucleon 
masses, the particles are given the name meson (now an entire class of 
particles, as we shall see in Particles, Patterns, and Conservation Laws). 


The pions, or 7-mesons as they are also called, have masses close to those 
predicted and feel the strong nuclear force. Another previously unknown 
particle, now called the muon, was discovered during cosmic-ray 
experiments in 1936 (one of its discoverers, Seth Neddermeyer, also 
originated the idea of implosion for plutonium bombs). Since the mass of a 
muon is around 106 MeV / c?, at first it was thought to be the particle 
predicted by Yukawa. But it was soon realized that muons do not feel the 
strong nuclear force and could not be Yukawa’s particle. Their role was 
unknown, causing the respected physicist I. I. Rabi to comment, “Who 
ordered that?” This remains a valid question today. We have discovered 
hundreds of subatomic particles; the roles of some are only partially 
understood. But there are various patterns and relations to forces that have 
led to profound insights into nature’s secrets. 


Summary 
¢ Yukawa’s idea of virtual particle exchange as the carrier of forces is 
crucial, with virtual particles being formed in temporary violation of 


the conservation of mass-energy as allowed by the Heisenberg 
uncertainty principle. 


Problems & Exercises 


Exercise: 


Problem: 


A virtual particle having an approximate mass of 10'4 GeV /c? may 
be associated with the unification of the strong and electroweak forces. 
For what length of time could this virtual particle exist (in temporary 
violation of the conservation of mass-energy as allowed by the 
Heisenberg uncertainty principle)? 


Solution: 


3x 107% 5 
Exercise: 


Problem: 


Calculate the mass in GeV/c? of a virtual carrier particle that has a 
range limited to 10 °° m by the Heisenberg uncertainty principle. 
Such a particle might be involved in the unification of the strong and 
electroweak forces. 


Exercise: 


Problem: 

Another component of the strong nuclear force is transmitted by the 
exchange of virtual K-mesons. Taking K-mesons to have an average 
mass of 495 MeV / c’, what is the approximate range of this 
component of the strong force? 


Solution: 


1.99 x 10°-'° m (0.2 fm) 


Glossary 


pion 


particle exchanged between nucleons, transmitting the force between 
them 


virtual particles 
particles which cannot be directly observed but their effects can be 
directly observed 


meson 
particle whose mass is intermediate between the electron and nucleon 
masses 


The Four Basic Forces 


e State the four basic forces. 

e Explain the Feynman diagram for the exchange of a virtual photon between two positive charges. 
e Define QED. 

¢ Describe the Feynman diagram for the exchange of a between a proton and a neutron. 


As first discussed in Problem-Solving Strategies and mentioned at various points in the text since then, 
there are only four distinct basic forces in all of nature. This is a remarkably small number considering 
the myriad phenomena they explain. Particle physics is intimately tied to these four forces. Certain 
fundamental particles, called carrier particles, carry these forces, and all particles can be classified 
according to which of the four forces they feel. The table given below summarizes important 
characteristics of the four basic forces. 


+/- 
[footnote] 
+ 
attractive; 
Approximate repulsive; 
relative tp 
Force strength Range both. Carrier particle 
Gravity 10738 oe) + only Graviton (conjectured) 
Electromagnetic 1072 oe) +/- Photon (observed) 
wt,w-,Z° 
Weak force 19713 <10°'8m +/— (observed|footnote]) 
Predicted by theory 
and first observed in 
1983. 
Gluons 
(conjectured| footnote |) 
Strong force 1 <10°%m +/- Biet DODOoe = 


indirect evidence of 
existence. Underlie 
meson exchange. 


Properties of the Four Basic Forces 
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The first image shows the exchange of 
a virtual photon transmitting the 
electromagnetic force between 
charges, just as virtual pion exchange 
carries the strong nuclear force 
between nucleons. The second image 
shows that the photon cannot be 
directly observed in its passage, 
because this would disrupt it and alter 
the force. In this case it does not get to 
the other charge. 


a | 


The Feynman diagram for 
the exchange of a virtual 
photon between two 
positive charges 
illustrates how the 
electromagnetic force is 
transmitted on a quantum 
mechanical scale. Time is 
graphed vertically while 
the distance is graphed 
horizontally. The two 
positive charges are seen 
to be repelled by the 
photon exchange. 


Although these four forces are distinct and differ greatly from one another under all but the most 
extreme circumstances, we can see similarities among them. (In GUTs: the Unification of Forces, we 
will discuss how the four forces may be different manifestations of a single unified force.) Perhaps the 
most important characteristic among the forces is that they are all transmitted by the exchange of a 
carrier particle, exactly like what Yukawa had in mind for the strong nuclear force. Each carrier particle 
is a virtual particle—it cannot be directly observed while transmitting the force. [link] shows the 
exchange of a virtual photon between two positive charges. The photon cannot be directly observed in 
its passage, because this would disrupt it and alter the force. 


[link] shows a way of graphing the exchange of a virtual photon between two positive charges. This 
graph of time versus position is called a Feynman diagram, after the brilliant American physicist 
Richard Feynman (1918-1988) who developed it. 


[link] is a Feynman diagram for the exchange of a virtual pion between a proton and a neutron 
representing the same interaction as in [link]. Feynman diagrams are not only a useful tool for 
visualizing interactions at the quantum mechanical level, they are also used to calculate details of 
interactions, such as their strengths and probability of occurring. Feynman was one of the theorists who 
developed the field of quantum electrodynamics (QED), which is the quantum mechanics of 
electromagnetism. QED has been spectacularly successful in describing electromagnetic interactions on 
the submicroscopic scale. Feynman was an inspiring teacher, had a colorful personality, and made a 
profound impact on generations of physicists. He shared the 1965 Nobel Prize with Julian Schwinger 
and S. I. Tomonaga for work in QED with its deep implications for particle physics. 


Why is it that particles called gluons are listed as the carrier particles for the strong nuclear force when, 
in The Yukawa Particle and the Heisenberg Uncertainty Principle Revisited, we saw that pions 
apparently carry that force? The answer is that pions are exchanged but they have a substructure and, as 
we explore it, we find that the strong force is actually related to the indirectly observed but more 
fundamental gluons. In fact, all the carrier particles are thought to be fundamental in the sense that they 
have no substructure. Another similarity among carrier particles is that they are all bosons (first 
mentioned in Patterns in Spectra Reveal More Quantization), having integral intrinsic spins. 
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The image shows a 
Feynman diagram for the 
exchange of a 7* 
between a proton anda 
neutron, carrying the 
strong nuclear force 
between them. This 
diagram represents the 


situation shown more 
pictorially in [link]. 


There is a relationship between the mass of the carrier particle and the range of the force. The photon is 
massless and has energy. So, the existence of (virtual) photons is possible only by virtue of the 
Heisenberg uncertainty principle and can travel an unlimited distance. Thus, the range of the 
electromagnetic force is infinite. This is also true for gravity. It is infinite in range because its carrier 
particle, the graviton, has zero rest mass. (Gravity is the most difficult of the four forces to understand 
on a quantum scale because it affects the space and time in which the others act. But gravity is so weak 
that its effects are extremely difficult to observe quantum mechanically. We shall explore it further in 
General Relativity and Quantum Gravity). The W+, W~, and Z° particles that carry the weak nuclear 
force have mass, accounting for the very short range of this force. In fact, the W*, W~, and Z° are 
about 1000 times more massive than pions, consistent with the fact that the range of the weak nuclear 
force is about 1/1000 that of the strong nuclear force. Gluons are actually massless, but since they act 
inside massive carrier particles like pions, the strong nuclear force is also short ranged. 


The relative strengths of the forces given in the [link] are those for the most common situations. When 
particles are brought very close together, the relative strengths change, and they may become identical 
at extremely close range. As we shall see in GUTs: the Unification of Forces, carrier particles may be 
altered by the energy required to bring particles very close together—in such a manner that they 
become identical. 


Summary 
e The four basic forces and their carrier particles are summarized in the [link]. 
e Feynman diagrams are graphs of time versus position and are highly useful pictorial 


representations of particle processes. 
¢ The theory of electromagnetism on the particle scale is called quantum electrodynamics (QED). 


Problems & Exercises 


Exercise: 


Problem: 


(a) Find the ratio of the strengths of the weak and electromagnetic forces under ordinary 
circumstances. 


(b) What does that ratio become under circumstances in which the forces are unified? 
Solution: 
(a) 10-* to 1, weak to EM 


(b) 1to1 


Exercise: 


Problem: 


The ratio of the strong to the weak force and the ratio of the strong force to the electromagnetic 
force become 1 under circumstances where they are unified. What are the ratios of the strong force 
to those two forces under normal circumstances? 


Glossary 


Feynman diagram 
a graph of time versus position that describes the exchange of virtual particles between subatomic 
particles 


gluons 
exchange particles, analogous to the exchange of photons that gives rise to the electromagnetic 
force between two charged particles 


quantum electrodynamics 
the theory of electromagnetism on the particle scale 


Accelerators Create Matter from Energy 


e State the principle of a cyclotron. 

e Explain the principle of a synchrotron. 

e Describe the voltage needed by an accelerator between accelerating 
tubes. 

State Fermilab’s accelerator principle. 


Before looking at all the particles we now know about, let us examine some 
of the machines that created them. The fundamental process in creating 
previously unknown particles is to accelerate known particles, such as 
protons or electrons, and direct a beam of them toward a target. Collisions 
with target nuclei provide a wealth of information, such as information 
obtained by Rutherford using energetic helium nuclei from natural a 
radiation. But if the energy of the incoming particles is large enough, new 
matter is sometimes created in the collision. The more energy input or AE, 
the more matter m can be created, since m = AE// c?. Limitations are 
placed on what can occur by known conservation laws, such as 
conservation of mass-energy, momentum, and charge. Even more 
interesting are the unknown limitations provided by nature. Some expected 
reactions do occur, while others do not, and still other unexpected reactions 
may appear. New laws are revealed, and the vast majority of what we know 
about particle physics has come from accelerator laboratories. It is the 
particle physicist’s favorite indoor sport, which is partly inspired by theory. 


Early Accelerators 


An early accelerator is a relatively simple, large-scale version of the 
electron gun. The Van de Graaff (named after the Dutch physicist), which 
you have likely seen in physics demonstrations, is a small version of the 
ones used for nuclear research since their invention for that purpose in 
1932. For more, see [link]. These machines are electrostatic, creating 
potentials as great as 50 MV, and are used to accelerate a variety of nuclei 
for a range of experiments. Energies produced by Van de Graaffs are 
insufficient to produce new particles, but they have been instrumental in 
exploring several aspects of the nucleus. Another, equally famous, early 
accelerator is the cyclotron, invented in 1930 by the American physicist, E. 


O. Lawrence (1901-1958). For a visual representation with more detail, see 
[link]. Cyclotrons use fixed-frequency alternating electric fields to 
accelerate particles. The particles spiral outward in a magnetic field, 
making increasingly larger radius orbits during acceleration. This clever 
arrangement allows the successive addition of electric potential energy and 
So greater particle energies are possible than in a Van de Graaff. Lawrence 
was involved in many early discoveries and in the promotion of physics 
programs in American universities. He was awarded the 1939 Nobel Prize 
in Physics for the cyclotron and nuclear activations, and he has an element 
and two major laboratories named for him. 


A synchrotron is a version of a cyclotron in which the frequency of the 
alternating voltage and the magnetic field strength are increased as the 
beam particles are accelerated. Particles are made to travel the same 
distance in a shorter time with each cycle in fixed-radius orbits. A ring of 
magnets and accelerating tubes, as shown in [link], are the major 
components of synchrotrons. Accelerating voltages are synchronized (i.e., 
occur at the same time) with the particles to accelerate them, hence the 
name. Magnetic field strength is increased to keep the orbital radius 
constant as energy increases. High-energy particles require strong magnetic 
fields to steer them, so superconducting magnets are commonly employed. 
Still limited by achievable magnetic field strengths, synchrotrons need to be 
very large at very high energies, since the radius of a high-energy particle’s 
orbit is very large. Radiation caused by a magnetic field accelerating a 
charged particle perpendicular to its velocity is called synchrotron 
radiation in honor of its importance in these machines. Synchrotron 
radiation has a characteristic spectrum and polarization, and can be 
recognized in cosmic rays, implying large-scale magnetic fields acting on 
energetic and charged particles in deep space. Synchrotron radiation 
produced by accelerators is sometimes used as a source of intense energetic 
electromagnetic radiation for research purposes. 


An artist’s rendition of a 
Van de Graaff generator. 


External beam 


Cyclotrons use a magnetic 
field to cause particles to 
move in circular orbits. As 
the particles pass between 
the plates of the Ds, the 
voltage across the gap is 
oscillated to accelerate 

them twice in each orbit. 


Modern Behemoths and Colliding Beams 


Physicists have built ever-larger machines, first to reduce the wavelength of 
the probe and obtain greater detail, then to put greater energy into collisions 
to create new particles. Each major energy increase brought new 
information, sometimes producing spectacular progress, motivating the next 
step. One major innovation was driven by the desire to create more massive 
particles. Since momentum needs to be conserved in a collision, the 
particles created by a beam hitting a stationary target should recoil. This 
means that part of the energy input goes into recoil kinetic energy, 
significantly limiting the fraction of the beam energy that can be converted 
into new particles. One solution to this problem is to have head-on 
collisions between particles moving in opposite directions. Colliding 
beams are made to meet head-on at points where massive detectors are 
located. Since the total incoming momentum is zero, it is possible to create 
particles with momenta and kinetic energies near zero. Particles with 
masses equivalent to twice the beam energy can thus be created. Another 
innovation is to create the antimatter counterpart of the beam particle, 
which thus has the opposite charge and circulates in the opposite direction 
in the same beam pipe. For a schematic representation, see [link]. 


(a) 


(a) A synchrotron has a ring of magnets and 
accelerating tubes. The frequency of the 
accelerating voltages is increased to cause the 
beam particles to travel the same distance in 
shorter time. The magnetic field should also be 


increased to keep each beam burst traveling in a 
fixed-radius path. Limits on magnetic field 
strength require these machines to be very large 
in order to accelerate particles to very high 
energies. (b) A positive particle is shown in the 
gap between accelerating tubes. (c) While the 
particle passes through the tube, the potentials 
are reversed so that there is another acceleration 
at the next gap. The frequency of the reversals 
needs to be varied as the particle is accelerated 
to achieve successive accelerations in each gap. 


Main ring 
Proton source 


Tevatron rin 
Antiproton source . 


This schematic shows the two rings of 
Fermilab’s accelerator and the scheme for 
colliding protons and antiprotons (not to scale). 


Detectors capable of finding the new particles in the spray of material that 
emerges from colliding beams are as impressive as the accelerators. While 
the Fermilab Tevatron had proton and antiproton beam energies of about 1 
TeV, so that it can create particles up to 2 TeV/c?, the Large Hadron 
Collider (LHC) at the European Center for Nuclear Research (CERN) has 
achieved beam energies of 3.5 TeV, so that it has a 7-TeV collision energy; 
CERN hopes to double the beam energy in 2014. The now-canceled 
Superconducting Super Collider was being constructed in Texas with a 


design energy of 20 TeV to give a 40-TeV collision energy. It was to be an 
oval 30 km in diameter. Its cost as well as the politics of international 
research funding led to its demise. 


In addition to the large synchrotrons that produce colliding beams of 
protons and antiprotons, there are other large electron-positron accelerators. 
The oldest of these was a straight-line or linear accelerator, called the 
Stanford Linear Accelerator (SLAC), which accelerated particles up to 50 
GeV as seen in [link]. Positrons created by the accelerator were brought to 
the same energy and collided with electrons in specially designed detectors. 
Linear accelerators use accelerating tubes similar to those in synchrotrons, 
but aligned in a straight line. This helps eliminate synchrotron radiation 
losses, which are particularly severe for electrons made to follow curved 
paths. CERN had an electron-positron collider appropriately called the 
Large Electron-Positron Collider (LEP), which accelerated particles to 100 
GeV and created a collision energy of 200 GeV. It was 8.5 km in diameter, 
while the SLAC machine was 3.2 km long. 
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The Stanford Linear Accelerator was 3.2 km 
long and had the capability of colliding 
electron and positron beams. SLAC was also 


used to probe nucleons by scattering 
extremely short wavelength electrons from 
them. This produced the first convincing 
evidence of a quark structure inside 
nucleons in an experiment analogous to 
those performed by Rutherford long ago. 


Example: 

Calculating the Voltage Needed by the Accelerator Between 
Accelerating Tubes 

A linear accelerator designed to produce a beam of 800-MeV protons has 
2000 accelerating tubes. What average voltage must be applied between 
tubes (such as in the gaps in [link]) to achieve the desired energy? 
Strategy 

The energy given to the proton in each gap between tubes is PEeje. = qV 
where q is the proton’s charge and V is the potential difference (voltage) 
across the gap. Since gq = qe = 1.6 x 10 19 C and 

LeV = (1 V)(1.6 x 10> C), the proton gains 1 eV in energy for each 
volt across the gap that it passes through. The AC voltage applied to the 
tubes is timed so that it adds to the energy in each gap. The effective 
voltage is the sum of the gap voltages and equals 800 MV to give each 
proton an energy of 800 MeV. 

Solution 

There are 2000 gaps and the sum of the voltages across them is 800 MV; 
thus, 

Equation: 


800 MV 
Veen = ~ 9000 _ = 400 kV. 


Discussion 
A voltage of this magnitude is not difficult to achieve in a vacuum. Much 
larger gap voltages would be required for higher energy, such as those at 


the 50-GeV SLAC facility. Synchrotrons are aided by the circular path of 
the accelerated particles, which can orbit many times, effectively 
multiplying the number of accelerations by the number of orbits. This 
makes it possible to reach energies greater than 1 TeV. 


Summary 


e A variety of particle accelerators have been used to explore the nature 
of subatomic particles and to test predictions of particle theories. 

e Modern accelerators used in particle physics are either large 
synchrotrons or linear accelerators. 

e The use of colliding beams makes much greater energy available for 
the creation of particles, and collisions between matter and antimatter 
allow a greater range of final products. 


Conceptual Questions 


Exercise: 
Problem: 
The total energy in the beam of an accelerator is far greater than the 


energy of the individual beam particles. Why isn’t this total energy 
available to create a single extremely massive particle? 


Exercise: 
Problem: 
Synchrotron radiation takes energy from an accelerator beam and is 


related to acceleration. Why would you expect the problem to be more 
severe for electron accelerators than proton accelerators? 


Exercise: 


Problem: 


What two major limitations prevent us from building high-energy 
accelerators that are physically small? 


Exercise: 


Problem: 
What are the advantages of colliding-beam accelerators? What are the 
disadvantages? 

Problems & Exercises 


Exercise: 


Problem: 

At full energy, protons in the 2.00-km-diameter Fermilab synchrotron 
travel at nearly the speed of light, since their energy is about 1000 
times their rest mass energy. 

(a) How long does it take for a proton to complete one trip around? 
(b) How many times per second will it pass through the target area? 
Solution: 


(a) 2.09 %:10°6 


(b) 4.77 x 104 Hz 


Exercise: 


Problem: 


Suppose a W ~~ created in a bubble chamber lives for 5.00 x 10°2° s. 
What distance does it move in this time if it is traveling at 0.900 c? 
Since this distance is too short to make a track, the presence of the 

W must be inferred from its decay products. Note that the time is 
longer than the given W ~ lifetime, which can be due to the statistical 
nature of decay or time dilation. 


Exercise: 
Problem: 
What length track does a 7* traveling at 0.100 c leave in a bubble 
chamber if it is created there and lives for 2.60 x 10°~° s? (Those 


moving faster or living longer may escape the detector before 
decaying.) 


Solution: 


78.0 cm 
Exercise: 
Problem: 
The 3.20-km-long SLAC produces a beam of 50.0-GeV electrons. If 


there are 15,000 accelerating tubes, what average voltage must be 
across the gaps between them to achieve this energy? 


Exercise: 
Problem: 
Because of energy loss due to synchrotron radiation in the LHC at 
CERN, only 5.00 MeV is added to the energy of each proton during 
each revolution around the main ring. How many revolutions are 


needed to produce 7.00-TeV (7000 GeV) protons, if they are injected 
with an initial energy of 8.00 GeV? 


Solution: 


1.40 x 10° 

Exercise: 
Problem: 
A proton and an antiproton collide head-on, with each having a kinetic 
energy of 7.00 TeV (such as in the LHC at CERN). How much 
collision energy is available, taking into account the annihilation of the 


two masses? (Note that this is not significantly greater than the 
extremely relativistic kinetic energy.) 


Exercise: 
Problem: 
When an electron and positron collide at the SLAC facility, they each 
have 50.0 GeV kinetic energies. What is the total collision energy 
available, taking into account the annihilation energy? Note that the 


annihilation energy is insignificant, because the electrons are highly 
relativistic. 


Solution: 


100 GeV 


Glossary 


colliding beams 
head-on collisions between particles moving in opposite directions 


cyclotron 
accelerator that uses fixed-frequency alternating electric fields and 
fixed magnets to accelerate particles in a circular spiral path 


linear accelerator 
accelerator that accelerates particles in a straight line 


synchrotron 


a version of a cyclotron in which the frequency of the alternating 
voltage and the magnetic field strength are increased as the beam 
particles are accelerated 


synchrotron radiation 
radiation caused by a magnetic field accelerating a charged particle 
perpendicular to its velocity 


Van de Graaff 
early accelerator: simple, large-scale version of the electron gun 


Particles, Patterns, and Conservation Laws 


e Define matter and antimatter. 
¢ Outline the differences between hadrons and leptons. 
e State the differences between mesons and baryons. 


In the early 1930s only a small number of subatomic particles were known to exist—the proton, neutron, electron, 
photon and, indirectly, the neutrino. Nature seemed relatively simple in some ways, but mysterious in others. Why, 
for example, should the particle that carries positive charge be almost 2000 times as massive as the one carrying 
negative charge? Why does a neutral particle like the neutron have a magnetic moment? Does this imply an 
internal structure with a distribution of moving charges? Why is it that the electron seems to have no size other 
than its wavelength, while the proton and neutron are about 1 fermi in size? So, while the number of known 
particles was small and they explained a great deal of atomic and nuclear phenomena, there were many 
unexplained phenomena and hints of further substructures. 


Things soon became more complicated, both in theory and in the prediction and discovery of new particles. In 
1928, the British physicist P.A.M. Dirac (see [link]) developed a highly successful relativistic quantum theory that 
laid the foundations of quantum electrodynamics (QED). His theory, for example, explained electron spin and 
magnetic moment in a natural way. But Dirac’s theory also predicted negative energy states for free electrons. By 
1931, Dirac, along with Oppenheimer, realized this was a prediction of positively charged electrons (or positrons). 
In 1932, American physicist Carl Anderson discovered the positron in cosmic ray studies. The positron, or e~ , is 
the same particle as emitted in 8 decay and was the first antimatter that was discovered. In 1935, Yukawa 
predicted pions as the carriers of the strong nuclear force, and they were eventually discovered. Muons were 
discovered in cosmic ray experiments in 1937, and they seemed to be heavy, unstable versions of electrons and 
positrons. After World War II, accelerators energetic enough to create these particles were built. Not only were 
predicted and known particles created, but many unexpected particles were observed. Initially called elementary 
particles, their numbers proliferated to dozens and then hundreds, and the term “particle zoo” became the 
physicist’s lament at the lack of simplicity. But patterns were observed in the particle zoo that led to simplifying 
ideas such as quarks, as Ewe shall soon see. 


P.A.M. Dirac’s 
theory of 
relativistic quantum 
mechanics not only 
explained a great 
deal of what was 


known, it also 
predicted 
antimatter. (credit: 
Cambridge 
University, 
Cavendish 
Laboratory) 


Matter and Antimatter 


The positron was only the first example of antimatter. Every particle in nature has an antimatter counterpart, 
although some particles, like the photon, are their own antiparticles. Antimatter has charge opposite to that of 
matter (for example, the positron is positive while the electron is negative) but is nearly identical otherwise, having 
the same mass, intrinsic spin, half-life, and so on. When a particle and its antimatter counterpart interact, they 
annihilate one another, usually totally converting their masses to pure energy in the form of photons as seen in 
[link]. Neutral particles, such as neutrons, have neutral antimatter counterparts, which also annihilate when they 
interact. Certain neutral particles are their own antiparticle and live correspondingly short lives. For example, the 
neutral pion 7° is its own antiparticle and has a half-life about 10~® shorter than 1* and 7, which are each 
other’s antiparticles. Without exception, nature is symmetric—all particles have antimatter counterparts. For 
example, antiprotons and antineutrons were first created in accelerator experiments in 1956 and the antiproton is 
negative. Antihydrogen atoms, consisting of an antiproton and antielectron, were observed in 1995 at CERN, too. 
It is possible to contain large-scale antimatter particles such as antiprotons by using electromagnetic traps that 
confine the particles within a magnetic field so that they don't annihilate with other particles. However, particles of 
the same charge repel each other, so the more particles that are contained in a trap, the more energy is needed to 
power the magnetic field that contains them. It is not currently possible to store a significant quantity of 
antiprotons. At any rate, we now see that negative charge is associated with both low-mass (electrons) and high- 
mass particles (antiprotons) and the apparent asymmetry is not there. But this knowledge does raise another 
question—why is there such a predominance of matter and so little antimatter? Possible explanations emerge later 
in this and the next chapter. 


Hadrons and Leptons 


Particles can also be revealingly grouped according to what forces they feel between them. All particles (even 
those that are massless) are affected by gravity, since gravity affects the space and time in which particles exist. All 
charged particles are affected by the electromagnetic force, as are neutral particles that have an internal distribution 
of charge (such as the neutron with its magnetic moment). Special names are given to particles that feel the strong 
and weak nuclear forces. Hadrons are particles that feel the strong nuclear force, whereas leptons are particles 
that do not. The proton, neutron, and the pions are examples of hadrons. The electron, positron, muons, and 
neutrinos are examples of leptons, the name meaning low mass. Leptons feel the weak nuclear force. In fact, all 
particles feel the weak nuclear force. This means that hadrons are distinguished by being able to feel both the 
strong and weak nuclear forces. 


[link] lists the characteristics of some of the most important subatomic particles, including the directly observed 
carrier particles for the electromagnetic and weak nuclear forces, all leptons, and some hadrons. Several hints 
related to an underlying substructure emerge from an examination of these particle characteristics. Note that the 
carrier particles are called gauge bosons. First mentioned in Patterns in Spectra Reveal More Quantization, a 
boson is a particle with zero or an integer value of intrinsic spin (such as s = 0, 1, 2, ...), whereas a fermion is a 
particle with a half-integer value of intrinsic spin (s = 1/2, 3/2, ...). Fermions obey the Pauli exclusion principle 
whereas bosons do not. All the known and conjectured carrier particles are bosons. 
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When a particle 


encounters its 
antiparticle, they 
annihilate, often 
producing pure energy 
in the form of photons. 
In this case, an 
electron and a positron 
convert all their mass 
into two identical 
energy rays, which 
move away in opposite 
directions to keep total 
momentum zero as it 
was before. Similar 
annihilations occur for 
other combinations of 
a particle with its 
antiparticle, 
sometimes producing 
more particles while 
obeying all 
conservation laws. 


Particle 
Category name 
Gauge Photon 
WwW 
Bosons 
Z 
Leptons 
Electron 


Symbol 


wr 


Zo 


Self 


Self 


Antiparticle 


Rest mass 


(MeV/c’) B 


80.39 x 10° 0 


91.19 x 10° 0 


0.511 0 


+1 


Neutrino 


(e) 


Muon 


Neutrino 


(u) 


Tau 


Neutrino 


(7) 


Hadrons (selected) 


Pion 


Mesons 


Kaon 


Eta 


(many other mesons known) 


Un 


Ur 


K?® 


Ve 


Self 


Self 


0(7.0eV) 


[footnote] 
Neutrino 
masses may 
be zero. 
Experimental 
upper limits 
are given in 
parentheses. 


105.7 


0(< 0.27) 


1777 


0(< 31) 


139.6 


135.0 


493.7 


497.6 


547.9 


+1 


ze 


+1 


+1 


+1 


| 
+ 


Proton Pp p 938.3 1 0 0 0 
= + 
Neutron n na 939.6 1 0 0 0 
0 0 + 
Lambda A A 1115.7 1 0 0 0 
st 7 1189.4 + 1 oo 0 0 
zr 1 
Baryons 
7 0 0 + 
Sigma > s 1192.6 ; | 0 0 0 
- : 1197.4 + 10 0 0 
x z ; 1 
50 = 1314.9 ; | 0 0 0 
Xi 
+ 
= Bt 1321.7 1 0 0 0 
Omega an at 1672.5 7 | 0 0 0 


(many other baryons known) 


Selected Particle Characteristics| footnote | 
The lower of the + or + symbols are the values for antiparticles. 


All known leptons are listed in the table given above. There are only six leptons (and their antiparticles), and they 
seem to be fundamental in that they have no apparent underlying structure. Leptons have no discernible size other 
than their wavelength, so that we know they are pointlike down to about 10~!® m. The leptons fall into three 
families, implying three conservation laws for three quantum numbers. One of these was known from 6 decay, 
where the existence of the electron’s neutrino implied that a new quantum number, called the electron family 


number L, is conserved. Thus, in § decay, an antielectron’s neutrino ve Must be created with L. = —1 when an 
electron with L,=+1 is created, so that the total remains 0 as it was before decay. 


Once the muon was discovered in cosmic rays, its decay mode was found to be 
Equation: 


Bw +e +¥et+vy,, 


which implied another “family” and associated conservation principle. The particle v,, is a muon’s neutrino, and it 
is created to conserve muon family numberL,,. So muons are leptons with a family of their own, and 
conservation of total L,, also seems to be obeyed in many experiments. 


More recently, a third lepton family was discovered when 7 particles were created and observed to decay ina 
manner similar to muons. One principal decay mode is 
Equation: 


T Sp + yt Ur. 


Conservation of total L., seems to be another law obeyed in many experiments. In fact, particle experiments have 
found that lepton family number is not universally conserved, due to neutrino “oscillations,” or transformations of 
neutrinos from one family type to another. 


Mesons and Baryons 


Now, note that the hadrons in the table given above are divided into two subgroups, called mesons (originally for 
medium mass) and baryons (the name originally meaning large mass). The division between mesons and baryons 
is actually based on their observed decay modes and is not strictly associated with their masses. Mesons are 
hadrons that can decay to leptons and leave no hadrons, which implies that mesons are not conserved in number. 
Baryons are hadrons that always decay to another baryon. A new physical quantity called baryon number B 
seems to always be conserved in nature and is listed for the various particles in the table given above. Mesons and 
leptons have B = 0 so that they can decay to other particles with B = 0. But baryons have B=-+1 if they are 
matter, and B = —1 if they are antimatter. The conservation of total baryon number is a more general rule than 
first noted in nuclear physics, where it was observed that the total number of nucleons was always conserved in 
nuclear reactions and decays. That rule in nuclear physics is just one consequence of the conservation of the total 
baryon number. 


Forces, Reactions, and Reaction Rates 


The forces that act between particles regulate how they interact with other particles. For example, pions feel the 
strong force and do not penetrate as far in matter as do muons, which do not feel the strong force. (This was the 
way those who discovered the muon knew it could not be the particle that carries the strong force—its penetration 
or range was too great for it to be feeling the strong force.) Similarly, reactions that create other particles, like 
cosmic rays interacting with nuclei in the atmosphere, have greater probability if they are caused by the strong 
force than if they are caused by the weak force. Such knowledge has been useful to physicists while analyzing the 
particles produced by various accelerators. 


The forces experienced by particles also govern how particles interact with themselves if they are unstable and 
decay. For example, the stronger the force, the faster they decay and the shorter is their lifetime. An example of a 
nuclear decay via the strong force is °Be + a + a with a lifetime of about 10~1° s. The neutron is a good 


example of decay via the weak force. The process n —> p + e~ + v¢ has a longer lifetime of 882 s. The weak 
force causes this decay, as it does all 6 decay. An important clue that the weak force is responsible for 8 decay is 


the creation of leptons, such as e~ and ve. None would be created if the strong force was responsible, just as no 
leptons are created in the decay of ®Be. The systematics of particle lifetimes is a little simpler than nuclear 
lifetimes when hundreds of particles are examined (not just the ones in the table given above). Particles that decay 
via the weak force have lifetimes mostly in the range of 10~'° to 10-1? s, whereas those that decay via the strong 
force have lifetimes mostly in the range of 10~*° to 10-8 s. Turning this around, if we measure the lifetime of a 
particle, we can tell if it decays via the weak or strong force. 


Yet another quantum number emerges from decay lifetimes and patterns. Note that the particles A, ©, &, and Q 
decay with lifetimes on the order of 10~'° s (the exception is 5°, whose short lifetime is explained by its 
particular quark substructure.), implying that their decay is caused by the weak force alone, although they are 
hadrons and feel the strong force. The decay modes of these particles also show patterns—in particular, certain 
decays that should be possible within all the known conservation laws do not occur. Whenever something is 
possible in physics, it will happen. If something does not happen, it is forbidden by a rule. All this seemed strange 
to those studying these particles when they were first discovered, so they named a new quantum number 
strangeness, given the symbol S in the table given above. The values of strangeness assigned to various particles 
are based on the decay systematics. It is found that strangeness is conserved by the strong force, which governs 
the production of most of these particles in accelerator experiments. However, strangeness is not conserved by 
the weak force. This conclusion is reached from the fact that particles that have long lifetimes decay via the weak 
force and do not conserve strangeness. All of this also has implications for the carrier particles, since they transmit 
forces and are thus involved in these decays. 


Example: 

Calculating Quantum Numbers in Two Decays 

(a) The most common decay mode of the S~ particle is =~ —> A° + 2~. Using the quantum numbers in the table 
given above, show that strangeness changes by 1, baryon number and charge are conserved, and lepton family 
numbers are unaffected. 

(b) Is the decay K* — yx* + v, allowed, given the quantum numbers in the table given above? 

Strategy 

In part (a), the conservation laws can be examined by adding the quantum numbers of the decay products and 
comparing them with the parent particle. In part (b), the same procedure can reveal if a conservation law is broken 
or not. 

Solution for (a) 

Before the decay, the = has strangeness S = —2. After the decay, the total strangeness is —1 for the AY plus 0 
for the 7. Thus, total strangeness has gone from —2 to —1 or a change of +1. Baryon number for the Sis 

B= +1 before the decay, and after the decay the A° has B = +1 and the r~ has B = 0so that the total baryon 
number remains +1. Charge is —1 before the decay, and the total charge after is also 0 — 1 = —1. Lepton numbers 
for all the particles are zero, and so lepton numbers are conserved. 

Discussion for (a) 

The = decay is caused by the weak interaction, since strangeness changes, and it is consistent with the relatively 
long 1.64 x 10~*°-s lifetime of the 2. 

Solution for (b) 

The decay K* — p* + v,, is allowed if charge, baryon number, mass-energy, and lepton numbers are conserved. 
Strangeness can change due to the weak interaction. Charge is conserved as s —> d. Baryon number is conserved, 
since all particles have B = 0. Mass-energy is conserved in the sense that the K * has a greater mass than the 
products, so that the decay can be spontaneous. Lepton family numbers are conserved at 0 for the electron and tau 
family for all particles. The muon family number is L,, = 0 before and L,, = —1 + 1 = 0 after. Strangeness 
changes from +1 before to 0 + 0 after, for an allowed change of 1. The decay is allowed by all these measures. 
Discussion for (b) 

This decay is not only allowed by our reckoning, it is, in fact, the primary decay mode of the A * meson and is 
caused by the weak force, consistent with the long 1.24 x 10~®-s lifetime. 


There are hundreds of particles, all hadrons, not listed in [link], most of which have shorter lifetimes. The 
systematics of those particle lifetimes, their production probabilities, and decay products are completely consistent 
with the conservation laws noted for lepton families, baryon number, and strangeness, but they also imply other 
quantum numbers and conservation laws. There are a finite, and in fact relatively small, number of these conserved 
quantities, however, implying a finite set of substructures. Additionally, some of these short-lived particles 
resemble the excited states of other particles, implying an internal structure. All of this jigsaw puzzle can be tied 
together and explained relatively simply by the existence of fundamental substructures. Leptons seem to be 


fundamental structures. Hadrons seem to have a substructure called quarks. Quarks: Is That All There Is? explores 
the basics of the underlying quark building blocks. 


Murray Gell-Mann 
(b. 1929) proposed 
quarks as a 
substructure of 
hadrons in 1963 
and was already 
known for his work 
on the concept of 
strangeness. 
Although quarks 
have never been 
directly observed, 
several predictions 
of the quark model 
were quickly 
confirmed, and 
their properties 
explain all known 
hadron 
characteristics. 
Gell-Mann was 
awarded the Nobel 
Prize in 1969. 
(credit: Lubos 
Motl) 


Summary 


e All particles of matter have an antimatter counterpart that has the opposite charge and certain other quantum 
numbers as seen in [link]. These matter-antimatter pairs are otherwise very similar but will annihilate when 
brought together. Known particles can be divided into three major groups—leptons, hadrons, and carrier 
particles (gauge bosons). 


¢ Leptons do not feel the strong nuclear force and are further divided into three groups—electron family 
designated by electron family number L.; muon family designated by muon family number L,,; and tau 
family designated by tau family number L,. The family numbers are not universally conserved due to 
neutrino oscillations. 

e Hadrons are particles that feel the strong nuclear force and are divided into baryons, with the baryon family 
number B being conserved, and mesons. 


Conceptual Questions 


Exercise: 
Problem: 
Large quantities of antimatter isolated from normal matter should behave exactly like normal matter. An 
antiatom, for example, composed of positrons, antiprotons, and antineutrons should have the same atomic 


spectrum as its matter counterpart. Would you be able to tell it is antimatter by its emission of antiphotons? 
Explain briefly. 


Exercise: 


Problem: Massless particles are not only neutral, they are chargeless (unlike the neutron). Why is this so? 
Exercise: 

Problem: 

Massless particles must travel at the speed of light, while others cannot reach this speed. Why are all massless 


particles stable? If evidence is found that neutrinos spontaneously decay into other particles, would this imply 
they have mass? 


Exercise: 
Problem: 
When a Star erupts in a supernova explosion, huge numbers of electron neutrinos are formed in nuclear 
reactions. Such neutrinos from the 1987A supernova in the relatively nearby Magellanic Cloud were observed 
within hours of the initial brightening, indicating they traveled to earth at approximately the speed of light. 
Explain how this data can be used to set an upper limit on the mass of the neutrino, noting that if the mass is 


small the neutrinos could travel very close to the speed of light and have a reasonable energy (on the order of 
MeV). 


Exercise: 


Problem: 


Theorists have had spectacular success in predicting previously unknown particles. Considering past 
theoretical triumphs, why should we bother to perform experiments? 


Exercise: 


Problem: What lifetime do you expect for an antineutron isolated from normal matter? 


Exercise: 


Problem:Why does the 7° meson have such a short lifetime compared to most other mesons? 


Exercise: 


Problem: (a) Is a hadron always a baryon? 


(b) Is a baryon always a hadron? 


(c) Can an unstable baryon decay into a meson, leaving no other baryon? 
Exercise: 


Problem: 
Explain how conservation of baryon number is responsible for conservation of total atomic mass (total 
number of nucleons) in nuclear decay and reactions. 

Problems & Exercises 


Exercise: 


Problem: 


The 7° is its own antiparticle and decays in the following manner: 7° — + y. What is the energy of each y 
ray if the 7° is at rest when it decays? 


Solution: 


67.5 MeV 
Exercise: 


Problem: 


The primary decay mode for the negative pion is m= — yo~ + Vi. What is the energy release in MeV in this 
decay? 


Exercise: 


Problem: 


The mass of a theoretical particle that may be associated with the unification of the electroweak and strong 
forces is 10! GeV/c?. 


(a) How many proton masses is this? 


(b) How many electron masses is this? (This indicates how extremely relativistic the accelerator would have 
to be in order to make the particle, and how large the relativistic quantity -~ would have to be.) 


Solution: 

(a)1 x 10” 

(b) 2 x 101” 
Exercise: 

Problem: The decay mode of the negative muon is u~ + e~ + Ve + Vp. 

(a) Find the energy released in MeV. 

(b) Verify that charge and lepton family numbers are conserved. 
Exercise: 

Problem: The decay mode of the positive tau is T* — pot +, + i 


(a) What energy is released? 


(b) Verify that charge and lepton family numbers are conserved. 


(c) The T* is the antiparticle of the 7~. Verify that all the decay products of the 7* are the antiparticles of 
those in the decay of the 7~ given in the text. 


Solution: 
(a) 1671 MeV 
(b) Q = 1, Q=14040=1.L, =-1; bir =-1; Lu =0; Lys = -1+1=0 


T > p ++ Ur 


=p” antiparticle of u*; vu, of vp; v7 of vu; 


(c) 


Exercise: 


Problem: The principal decay mode of the sigma zero is es Ae Ys 
(a) What energy is released? 


(b) Considering the quark structure of the two baryons, does it appear that the 1° is an excited state of the A° 
? 


(c) Verify that strangeness, charge, and baryon number are conserved in the decay. 


(d) Considering the preceding and the short lifetime, can the weak force be responsible? State why or why 
not. 


Exercise: 
Problem: (a) What is the uncertainty in the energy released in the decay of a 1° due to its short lifetime? 


(b) What fraction of the decay energy is this, noting that the decay mode is 7° —> + + ¥ (so that all the 7° 
mass is destroyed)? 


Solution: 
(a) 3.9 eV 
(b) 2.9 x 10°* 

Exercise: 
Problem: (a) What is the uncertainty in the energy released in the decay of at due to its short lifetime? 
(b) Is the uncertainty in this energy greater than or less than the uncertainty in the mass of the tau neutrino? 
Discuss the source of the uncertainty. 

Glossary 


boson 
particle with zero or an integer value of intrinsic spin 


baryons 
hadrons that always decay to another baryon 


baryon number 


a conserved physical quantity that is zero for mesons and leptons and +1 for baryons and antibaryons, 
respectively 


conservation of total baryon number 
a general rule based on the observation that the total number of nucleons was always conserved in nuclear 
reactions and decays 


conservation of total electron family number 
a general rule stating that the total electron family number stays the same through an interaction 


conservation of total muon family number 
a general rule stating that the total muon family number stays the same through an interaction 


electron family number 
the number +1 that is assigned to all members of the electron family, or the number 0 that is assigned to all 
particles not in the electron family 


fermion 
particle with a half-integer value of intrinsic spin 


gauge boson 
particle that carries one of the four forces 


hadrons 
particles that feel the strong nuclear force 


leptons 
particles that do not feel the strong nuclear force 


meson 
hadrons that can decay to leptons and leave no hadrons 


muon family number 
the number +1 that is assigned to all members of the muon family, or the number 0 that is assigned to all 
particles not in the muon family 


strangeness 
a physical quantity assigned to various particles based on decay systematics 


tau family number 
the number +1 that is assigned to all members of the tau family, or the number 0 that is assigned to all 
particles not in the tau family 


Quarks: Is That All There Is? 


e Define fundamental particle. 

e Describe quark and antiquark. 

List the flavors of quark. 

¢ Outline the quark composition of hadrons. 

e Determine quantum numbers from quark composition. 


Quarks have been mentioned at various points in this text as fundamental building blocks and members of the 
exclusive club of truly elementary particles. Note that an elementary or fundamental particle has no substructure 
(it is not made of other particles) and has no finite size other than its wavelength. This does not mean that 
fundamental particles are stable—some decay, while others do not. Keep in mind that all leptons seem to be 
fundamental, whereasno hadrons are fundamental. There is strong evidence that quarks are the fundamental 
building blocks of hadrons as seen in [link]. Quarks are the second group of fundamental particles (leptons are the 
first). The third and perhaps final group of fundamental particles is the carrier particles for the four basic forces. 
Leptons, quarks, and carrier particles may be all there is. In this module we will discuss the quark substructure of 
hadrons and its relationship to forces as well as indicate some remaining questions and problems. 
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All baryons, such as the proton and neutron shown here, 
are composed of three quarks. All mesons, such as the 
pions shown here, are composed of a quark-antiquark 

pair. Arrows represent the spins of the quarks, which, as 
we shall see, are also colored. The colors are such that 
they need to add to white for any possible combination 

of quarks. 


Conception of Quarks 


Quarks were first proposed independently by American physicists Murray Gell-Mann and George Zweig in 1963. 
Their quaint name was taken by Gell-Mann from a James Joyce novel—Gell-Mann was also largely responsible 
for the concept and name of strangeness. (Whimsical names are common in particle physics, reflecting the 
personalities of modern physicists.) Originally, three quark types—or flavors—were proposed to account for the 
then-known mesons and baryons. These quark flavors are named up (u), down (d), and strange (s). All quarks 
have half-integral spin and are thus fermions. All mesons have integral spin while all baryons have half-integral 
spin. Therefore, mesons should be made up of an even number of quarks while baryons need to be made up of an 
odd number of quarks. [link] shows the quark substructure of the proton, neutron, and two pions. The most radical 
proposal by Gell-Mann and Zweig is the fractional charges of quarks, which are + (4) de and ($)4e, whereas all 
directly observed particles have charges that are integral multiples of g-. Note that the fractional value of the quark 
does not violate the fact that the e is the smallest unit of charge that is observed, because a free quark cannot exist. 
[link] lists characteristics of the six quark flavors that are now thought to exist. Discoveries made since 1963 have 
required extra quark flavors, which are divided into three families quite analogous to leptons. 


How Does it Work? 


To understand how these quark substructures work, let us specifically examine the proton, neutron, and the two 
pions pictured in [link] before moving on to more general considerations. First, the proton p is composed of the 


three quarks uud, so that its total charge is +4 ( H )qe ( 5 )qe ( ; )de = de, as expected. With the spins aligned 


as in the figure, the proton’s intrinsic spin is 4 ( ; ) ( ; ) ( 5 ) = ( ; ); also as expected. Note that the spins of 
the up quarks are aligned, so that they would be in the same state except that they have different colors (another 
quantum number to be elaborated upon a little later). Quarks obey the Pauli exclusion principle. Similar comments 
apply to the neutron n, which is composed of the three quarks udd. Note also that the neutron is made of charges 
that add to zero but move internally, producing its well-known magnetic moment. When the neutron G~ decays, it 
does so by changing the flavor of one of its quarks. Writing neutron @~ decay in terms of quarks, 

Equation: 


n—+p+ + 4v-_ becomes udd > uud + 8 + v¢. 


We see that this is equivalent to a down quark changing flavor to become an up quark: 


Equation: 
d>ut+Bp +4. 
B 
[footnote] 
Bis baryon 
number, S 
is 
strangeness, 
cis charm, 
bis 
bottomness, S . b 
Name Symbol Antiparticle Spin Charge t is topness. 
_ 2 1 
Up uU u 1/2 +—de +— 0 0 0 
3 3 
Down d 7 1/2 es oe 0) 0 0) 
Ww) ae iach 
d 3 qe 3 
_ 1 1 
Strange s 5 1/2 F-—de +— #1 0 0 
3 3 
= 2 1 
Charmed c Cc 1/2 + 3 de + 3 0 +1 0 


Bottom b h 1/2 — +— 0 0 
b 3 de 3 
_ 2 1 

Top t t 1/2 + 3 de + 3 0 0 


Quarks and Antiquarks| footnote | 
The lower of the + symbols are the values for antiquarks. 


Particle Quark Composition 
Mesons 
x 7 
w ud 
7 ud 
UU 
a = 
dd 
mixture footnote | 


These two mesons are different mixtures, but each is its own antiparticle, as indicated by its 
quark composition. 


mixture| footnote | 
These two mesons are different mixtures, but each is its own antiparticle, as indicated by its 
quark composition. 


K® ds 


Particle Quark Composition 


K = 
ds 
Kt us 
Kk~ us 
I/p ce 
y bb 


Baryons[footnote],[footnote] 


Antibaryons have the antiquarks of their counterparts. The antiproton p is uud, for example. 
Baryons composed of the same quarks are different states of the same particle. For example, the A* is an 
excited state of the proton. 


Pp uud 
n udd 
A? udd 
At uud 
AT ddd 
Att uuu 


Ao uds 


Particle Quark Composition 


= uds 
xt uus 
a7 dds 
= uss 
=e dss 
Q- sss 


Quark Composition of Selected Hadrons| footnote] 
These two mesons are different mixtures, but each is its own antiparticle, as indicated by its quark composition. 


This is an example of the general fact that the weak nuclear force can change the flavor of a quark. By general, 
we mean that any quark can be converted to any other (change flavor) by the weak nuclear force. Not only can we 
get d — u, we can also get wu — d. Furthermore, the strange quark can be changed by the weak force, too, making 
s — wand s — d possible. This explains the violation of the conservation of strangeness by the weak force noted 
in the preceding section. Another general fact is that the strong nuclear force cannot change the flavor of a 
quark. 


Again, from [link], we see that the 7* meson (one of the three pions) is composed of an up quark plus an antidown 


quark, or ud. Its total charge is thus + (4) qe == (+) ae = de, as expected. Its baryon number is 0, since it has a 


quark and an antiquark with baryon numbers + (+) — (3) = 0. The z* half-life is relatively long since, although 


it is composed of matter and antimatter, the quarks are different flavors and the weak force should cause the decay 
by changing the flavor of one into that of the other. The spins of the u and d quarks are antiparallel, enabling the 
pion to have spin zero, as observed experimentally. Finally, the ~ meson shown in [link] is the antiparticle of the 


a* meson, and it is composed of the corresponding quark antiparticles. That is, the 7* meson is ud, while the 1 


meson is ud. These two pions annihilate each other quickly, because their constituent quarks are each other’s 
antiparticles. 


Two general rules for combining quarks to form hadrons are: 


1. Baryons are composed of three quarks, and antibaryons are composed of three antiquarks. 
2. Mesons are combinations of a quark and an antiquark. 


One of the clever things about this scheme is that only integral charges result, even though the quarks have 
fractional charge. 


All Combinations are Possible 


All quark combinations are possible. [link] lists some of these combinations. When Gell-Mann and Zweig 
proposed the original three quark flavors, particles corresponding to all combinations of those three had not been 
observed. The pattern was there, but it was incomplete—much as had been the case in the periodic table of the 
elements and the chart of nuclides. The 2~ particle, in particular, had not been discovered but was predicted by 
quark theory. Its combination of three strange quarks, sss, gives it a strangeness of —3 (see [link]) and other 
predictable characteristics, such as spin, charge, approximate mass, and lifetime. If the quark picture is complete, 
the Q~ should exist. It was first observed in 1964 at Brookhaven National Laboratory and had the predicted 
characteristics as seen in [link]. The discovery of the Q~ was convincing indirect evidence for the existence of the 
three original quark flavors and boosted theoretical and experimental efforts to further explore particle physics in 
terms of quarks. 


Note: 

Patterns and Puzzles: Atoms, Nuclei, and Quarks 

Patterns in the properties of atoms allowed the periodic table to be developed. From it, previously unknown 
elements were predicted and observed. Similarly, patterns were observed in the properties of nuclei, leading to the 
chart of nuclides and successful predictions of previously unknown nuclides. Now with particle physics, patterns 
imply a quark substructure that, if taken literally, predicts previously unknown particles. These have now been 
observed in another triumph of underlying unity. 


The image relates to the 
discovery of the Q~. It isa 
secondary reaction in which 

an accelerator-produced K ~ 

collides with a proton via the 
strong force and conserves 
strangeness to produce the 

Q~ with characteristics 

predicted by the quark 

model. As with other 
predictions of previously 
unobserved particles, this 
gave a tremendous boost to 
quark theory. (credit: 
Brookhaven National 
Laboratory) 


Example: 

Quantum Numbers From Quark Composition 

Verify the quantum numbers given for the 2° particle in [link] by adding the quantum numbers for its quark 
composition as given in [link]. 

Strategy 

The composition of the 2° is given as uss in [link]. The quantum numbers for the constituent quarks are given in 
[link]. We will not consider spin, because that is not given for the =°. But we can check on charge and the other 
quantum numbers given for the quarks. 


Solution 

The total charge of uss is + ( ; )qe ( 4 )de ( 4 )qe = 0, which is correct for the 2’. The baryon number is 
+(¥) + (3) + (3) = 1, also correct since the =° is a matter baryon and has B = 1, as listed in [link]. Its 
strangeness is S = 0 — 1 — 1 = —2, also as expected from [link]. Its charm, bottomness, and topness are 0, as are 
its lepton family numbers (it is not a lepton). 

Discussion 


This procedure is similar to what the inventors of the quark hypothesis did when checking to see if their solution 
to the puzzle of particle patterns was correct. They also checked to see if all combinations were known, thereby 
predicting the previously unobserved ()~ as the completion of a pattern. 


Now, Let Us Talk About Direct Evidence 


At first, physicists expected that, with sufficient energy, we should be able to free quarks and observe them 
directly. This has not proved possible. There is still no direct observation of a fractional charge or any isolated 
quark. When large energies are put into collisions, other particles are created—but no quarks emerge. There is 
nearly direct evidence for quarks that is quite compelling. By 1967, experiments at SLAC scattering 20-GeV 
electrons from protons had produced results like Rutherford had obtained for the nucleus nearly 60 years earlier. 
The SLAC scattering experiments showed unambiguously that there were three pointlike (meaning they had sizes 
considerably smaller than the probe’s wavelength) charges inside the proton as seen in [link]. This evidence made 
all but the most skeptical admit that there was validity to the quark substructure of hadrons. 


Proton 


Scattering of high-energy 
electrons from protons at 
facilities like SLAC 
produces evidence of 
three point-like charges 
consistent with proposed 
quark properties. This 
experiment is analogous 
to Rutherford’s discovery 


of the small size of the 
nucleus by scattering a 
particles. High-energy 
electrons are used so that 
the probe wavelength is 
small enough to see 
details smaller than the 
proton. 


More recent and higher-energy experiments have produced jets of particles in collisions, highly suggestive of three 
quarks in a nucleon. Since the quarks are very tightly bound, energy put into separating them pulls them only so 
far apart before it starts being converted into other particles. More energy produces more particles, not a separation 
of quarks. Conservation of momentum requires that the particles come out in jets along the three paths in which 
the quarks were being pulled. Note that there are only three jets, and that other characteristics of the particles are 
consistent with the three-quark substructure. 


Simulation of a proton-proton 
collision at 14-TeV center-of- 


mass energy in the ALICE 
detector at CERN LHC. The 
lines follow particle trajectories 
and the cyan dots represent the 
energy depositions in the 
sensitive detector elements. 
(credit: Matevz Tadel) 


Quarks Have Their Ups and Downs 


The quark model actually lost some of its early popularity because the original model with three quarks had to be 
modified. The up and down quarks seemed to compose normal matter as seen in [link], while the single strange 
quark explained strangeness. Why didn’t it have a counterpart? A fourth quark flavor called charm (c) was 
proposed as the counterpart of the strange quark to make things symmetric—there would be two normal quarks (u 
and d) and two exotic quarks (s and c). Furthermore, at that time only four leptons were known, two normal and 
two exotic. It was attractive that there would be four quarks and four leptons. The problem was that no known 
particles contained a charmed quark. Suddenly, in November of 1974, two groups (one headed by C. C. Ting at 
Brookhaven National Laboratory and the other by Burton Richter at SLAC) independently and nearly 


simultaneously discovered a new meson with characteristics that made it clear that its substructure is cc. It was 
called J by one group and psi (7) by the other and now is known as the J/w meson. Since then, numerous 


particles have been discovered containing the charmed quark, consistent in every way with the quark model. The 
discovery of the J/) meson had such a rejuvenating effect on quark theory that it is now called the November 
Revolution. Ting and Richter shared the 1976 Nobel Prize. 


History quickly repeated itself. In 1975, the tau (7) was discovered, and a third family of leptons emerged as seen 
in [link]). Theorists quickly proposed two more quark flavors called top (t) or truth and bottom (b) or beauty to 
keep the number of quarks the same as the number of leptons. And in 1976, the upsilon (1) meson was discovered 


and shown to be composed of a bottom and an antibottom quark or bb, quite analogous to the J/ being cc as 
seen in [link]. Being a single flavor, these mesons are sometimes called bare charm and bare bottom and reveal the 
characteristics of their quarks most clearly. Other mesons containing bottom quarks have since been observed. In 
1995, two groups at Fermilab confirmed the top quark’s existence, completing the picture of six quarks listed in 
[link]. Each successive quark discovery—first c, then b, and finally t —has required higher energy because each 
has higher mass. Quark masses in [link] are only approximately known, because they are not directly observed. 
They must be inferred from the masses of the particles they combine to form. 


What’s Color got to do with it?—A Whiter Shade of Pale 


As mentioned and shown in [link], quarks carry another quantum number, which we call color. Of course, it is not 
the color we sense with visible light, but its properties are analogous to those of three primary and three secondary 
colors. Specifically, a quark can have one of three color values we call red (R), green (G), and blue (B) in 


analogy to those primary visible colors. Antiquarks have three values we call antired or cyan| F }, antigreen or 


magenta (<) , and antiblue or yellow (2) in analogy to those secondary visible colors. The reason for these 


names is that when certain visual colors are combined, the eye sees white. The analogy of the colors combining to 
white is used to explain why baryons are made of three quarks, why mesons are a quark and an antiquark, and why 
we cannot isolate a single quark. The force between the quarks is such that their combined colors produce white. 
This is illustrated in [link]. A baryon must have one of each primary color or RGB, which produces white. A 
meson must have a primary color and its anticolor, also producing white. 


Baryon 


The three quarks composing a baryon must 
be RGB, which add to white. The quark and 
antiquark composing a meson must be a 


color and anticolor, here RR also adding to 
white. The force between systems that have 
color is so great that they can neither be 
separated nor exist as colored. 


Why must hadrons be white? The color scheme is intentionally devised to explain why baryons have three quarks 
and mesons have a quark and an antiquark. Quark color is thought to be similar to charge, but with more values. 
An ion, by analogy, exerts much stronger forces than a neutral molecule. When the color of a combination of 
quarks is white, it is like a neutral atom. The forces a white particle exerts are like the polarization forces in 
molecules, but in hadrons these leftovers are the strong nuclear force. When a combination of quarks has color 
other than white, it exerts extremely large forces—even larger than the strong force—and perhaps cannot be stable 
or permanently separated. This is part of the theory of quark confinement, which explains how quarks can exist 
and yet never be isolated or directly observed. Finally, an extra quantum number with three values (like those we 


assign to color) is necessary for quarks to obey the Pauli exclusion principle. Particles such as the , which is 
composed of three strange quarks, sss, and the A**, which is three up quarks, uuu, can exist because the quarks 
have different colors and do not have the same quantum numbers. Color is consistent with all observations and is 
now widely accepted. Quark theory including color is called quantum chromodynamics (QCD), also named by 
Gell-Mann. 


The Three Families 


Fundamental particles are thought to be one of three types—leptons, quarks, or carrier particles. Each of those 
three types is further divided into three analogous families as illustrated in [link]. We have examined leptons and 
quarks in some detail. Each has six members (and their six antiparticles) divided into three analogous families. The 
first family is normal matter, of which most things are composed. The second is exotic, and the third more exotic 
and more massive than the second. The only stable particles are in the first family, which also has unstable 
members. 


Always searching for symmetry and similarity, physicists have also divided the carrier particles into three families, 
omitting the graviton. Gravity is special among the four forces in that it affects the space and time in which the 
other forces exist and is proving most difficult to include in a Theory of Everything or TOE (to stub the pretension 
of such a theory). Gravity is thus often set apart. It is not certain that there is meaning in the groupings shown in 
[link], but the analogies are tempting. In the past, we have been able to make significant advances by looking for 
analogies and patterns, and this is an example of one under current scrutiny. There are connections between the 
families of leptons, in that the 7 decays into the yw and the yp into the e. Similarly for quarks, the higher families 
eventually decay into the lowest, leaving only u and d quarks. We have long sought connections between the forces 
in nature. Since these are carried by particles, we will explore connections between gluons, W* and Z°, and 
photons as part of the search for unification of forces discussed in GUTs: The Unification of Forces.. 
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The three types of particles are 
leptons, quarks, and carrier particles. 
Each of those types is divided into 
three analogous families, with the 
graviton left out. 


Summary 


e Hadrons are thought to be composed of quarks, with baryons having three quarks and mesons having a quark 
and an antiquark. 

e The characteristics of the six quarks and their antiquark counterparts are given in [link], and the quark 
compositions of certain hadrons are given in [link]. 


e Indirect evidence for quarks is very strong, explaining all known hadrons and their quantum numbers, such as 
strangeness, charm, topness, and bottomness. 


¢ Quarks come in six flavors and three colors and occur only in combinations that produce white. 


e Fundamental particles have no further substructure, not even a size beyond their de Broglie wavelength. 
e There are three types of fundamental particles—leptons, quarks, and carrier particles. Each type is divided 
into three analogous families as indicated in [link]. 


Conceptual Questions 


Exercise: 
Problem: 
The quark flavor change d — w takes place in 8” decay. Does this mean that the reverse quark flavor change 


u — d takes place in G+ decay? Justify your response by writing the decay in terms of the quark constituents, 
noting that it looks as if a proton is converted into a neutron in B* decay. 


Exercise: 


Problem: Explain how the weak force can change strangeness by changing quark flavor. 
Exercise: 
Problem: 
Beta decay is caused by the weak force, as are all reactions in which strangeness changes. Does this imply 
that the weak force can change quark flavor? Explain. 
Exercise: 


Problem: 


Why is it easier to see the properties of the c, b, and t quarks in mesons having composition W ~ or tt rather 
than in baryons having a mixture of quarks, such as udb? 


Exercise: 
Problem: 
How can quarks, which are fermions, combine to form bosons? Why must an even number combine to form a 
boson? Give one example by stating the quark substructure of a boson. 
Exercise: 
Problem: 
What evidence is cited to support the contention that the gluon force between quarks is greater than the strong 
nuclear force between hadrons? How is this related to color? Is it also related to quark confinement? 
Exercise: 
Problem: 
Discuss how we know that 7-mesons (7+ ,7r,7°) are not fundamental particles and are not the basic carriers 
of the strong force. 


Exercise: 


Problem: An antibaryon has three antiquarks with colors RGB. What is its color? 
Exercise: 
Problem: 


Suppose leptons are created in a reaction. Does this imply the weak force is acting? (for example, consider 3 
decay.) 


Exercise: 
Problem: 
How can the lifetime of a particle indicate that its decay is caused by the strong nuclear force? How can a 


change in strangeness imply which force is responsible for a reaction? What does a change in quark flavor 
imply about the force that is responsible? 


Exercise: 


Problem:(a) Do all particles having strangeness also have at least one strange quark in them? 


(b) Do all hadrons with a strange quark also have nonzero strangeness? 
Exercise: 
Problem: 
The sigma-zero particle decays mostly via the reaction ee A y. Explain how this decay and the 
respective quark compositions imply that the D° is an excited state of the A°. 
Exercise: 
Problem: 
What do the quark compositions and other quantum numbers imply about the relationships between the A* 
and the proton? The A° and the neutron? 
Exercise: 
Problem: 
Discuss the similarities and differences between the photon and the Z ° in terms of particle properties, 
including forces felt. 


Exercise: 


Problem: Identify evidence for electroweak unification. 
Exercise: 


Problem: 
The quarks in a particle are confined, meaning individual quarks cannot be directly observed. Are gluons 
confined as well? Explain 

Problems & Exercises 


Exercise: 


Problem: (a) Verify from its quark composition that the A* particle could be an excited state of the proton. 


(b) There is a spread of about 100 MeV in the decay energy of the A‘, interpreted as uncertainty due to its 
short lifetime. What is its approximate lifetime? 


(c) Does its decay proceed via the strong or weak force? 
Solution: 


(a) The uud composition is the same as for a proton. 


(b) 3.3 x 107% s 


(c) Strong (short lifetime) 
Exercise: 


Problem: 


Accelerators such as the Triangle Universities Meson Facility (TRIUMF) in British Columbia produce 
secondary beams of pions by having an intense primary proton beam strike a target. Such “meson factories” 
have been used for many years to study the interaction of pions with nuclei and, hence, the strong nuclear 
force. One reaction that occurs is t* + p> A** —+ 2* + p, where the A*™ is a very short-lived particle. 
The graph in [link] shows the probability of this reaction as a function of energy. The width of the bump is the 
uncertainty in energy due to the short lifetime of the A*~. 


(a) Find this lifetime. 


(b) Verify from the quark composition of the particles that this reaction annihilates and then re-creates a d 


quark anda d antiquark by writing the reaction and decay in terms of quarks. 


(c) Draw a Feynman diagram of the production and decay of the oa showing the individual quarks 
involved. 
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This graph shows the probability of an 
interaction between a 7~ and a proton as a 
function of energy. The bump is interpreted 
as a very short lived particle called a A*~. 
The approximately 100-MeV width of the 


bump is due to the short lifetime of the A** 


Exercise: 


Problem: 


The reaction 7* + p —» A** (described in the preceding problem) takes place via the strong force. (a) What 
is the baryon number of the A** particle? 


(b) Draw a Feynman diagram of the reaction showing the individual quarks involved. 
Solution: 


a) A** (uuu); B 


II 
ole 
+ 
we 
+ 
wl 
II 
eS 


b) 


Exercise: 


Problem: One of the decay modes of the omega minus is Q~ —> Botan. 

(a) What is the change in strangeness? 

(b) Verify that baryon number and charge are conserved, while lepton numbers are unaffected. 

(c) Write the equation in terms of the constituent quarks, indicating that the weak force is responsible. 
Exercise: 

Problem: Repeat the previous problem for the decay mode Q7 — A°o+K-. 

Solution: 

(a) +1 

(b) B=1=1+40, Z == 0+ (-1), all lepton numbers are 0 before and after 

(c) (sss) + (uds) + (us) 


Exercise: 


Problem: One decay mode for the eta-zero meson is 7° + y +y. 

(a) Find the energy released. 

(b) What is the uncertainty in the energy due to the short lifetime? 

(c) Write the decay in terms of the constituent quarks. 

(d) Verify that baryon number, lepton numbers, and charge are conserved. 
Exercise: 

Problem: One decay mode for the eta-zero meson is 7° —> 1° + 79, 

(a) Write the decay in terms of the quark constituents. 

(b) How much energy is released? 


(c) What is the ultimate release of energy, given the decay mode for the pi zero is 79 > y + y? 


Solution: 


(a) (uu + dd) + (uu+ dd) + (uu +dd) 


(b) 277.9 MeV 


(c) 547.9 MeV 
Exercise: 


Problem: 


Is the decay n — e* + 7 possible considering the appropriate conservation laws? State why or why not. 
Exercise: 


Problem: 


Is the decay uw + e + + Vv, possible considering the appropriate conservation laws? State why or why 
not. 


Solution: 


No. Charge = —1 is conserved. L,, = 0 # L,, = 2 is not conserved. L,, = 1 is conserved. 
Exercise: 


Problem: 
(a) Is the decay Mo +n+7° possible considering the appropriate conservation laws? State why or why not. 


(b) Write the decay in terms of the quark constituents of the particles. 
Exercise: 


Problem: 


(a) Is the decay & ~—+ +77 possible considering the appropriate conservation laws? State why or why 
not. (b) Write the decay in terms of the quark constituents of the particles. 


Solution: 


(a)Yes. Z = —1 = 0+ (—1), B=1=1+ 0, all lepton family numbers are 0 before and after, spontaneous 
since mass greater before reaction. 


(b) dds + udd + ud 
Exercise: 


Problem: 


The only combination of quark colors that produces a white baryon is RGB. Identify all the color 
combinations that can produce a white meson. 


Exercise: 


Problem: 


(a) Three quarks form a baryon. How many combinations of the six known quarks are there if all 
combinations are possible? 


(b) This number is less than the number of known baryons. Explain why. 


Solution: 


(a) 216 
(b) There are more baryons observed because we have the 6 antiquarks and various mixtures of quarks (as for 
the m-meson) as well. 
Exercise: 
Problem: 
(a) Show that the conjectured decay of the proton, p + 7° + e*, violates conservation of baryon number and 
conservation of lepton number. 
(b) What is the analogous decay process for the antiproton? 
Exercise: 
Problem: 


Verify the quantum numbers given for the Q* in [link] by adding the quantum numbers for its quark 
constituents as inferred from [link]. 


Solution: 

Q+(s8s) 

=, 1 1 _ 
B= 3 3 3 > 1, 


L., p,T =0+04+0=0, 

Q=f¢ot7Hh 

S=14+141=3. 
Exercise: 


Problem: 


Verify the quantum numbers given for the proton and neutron in [link] by adding the quantum numbers for 
their quark constituents as given in [link]. 


Exercise: 


Problem: 
(a) How much energy would be released if the proton did decay via the conjectured reaction p > 7° + e*? 


(b) Given that the 7° decays to two y s and that the e* will find an electron to annihilate, what total energy is 
ultimately produced in proton decay? 


(c) Why is this energy greater than the proton’s total mass (converted to energy)? 
Solution: 

(a)803 MeV 

(b) 938.8 MeV 


(c) The annihilation energy of an extra electron is included in the total energy. 
Exercise: 
Problem: 


(a) Find the charge, baryon number, strangeness, charm, and bottomness of the J/W particle from its quark 
composition. 


(b) Do the same for the Y particle. 
Exercise: 


Problem: 


There are particles called D-mesons. One of them is the D* meson, which has a single positive charge and a 
baryon number of zero, also the value of its strangeness, topness, and bottomness. It has a charm of +1. What 
is its quark configuration? 


Solution: 


cd 
Exercise: 
Problem: 
There are particles called bottom mesons or B-mesons. One of them is the B~ meson, which has a single 


negative charge; its baryon number is zero, as are its strangeness, charm, and topness. It has a bottomness of 
—1. What is its quark configuration? 


Exercise: 


Problem: (a) What particle has the quark composition wud? 
(b) What should its decay mode be? 
Solution: 
a)The antiproton 
b)p > 7° + e7 
Exercise: 


Problem: 


(a) Show that all combinations of three quarks produce integral charges. Thus baryons must have integral 
charge. 


(b) Show that all combinations of a quark and an antiquark produce only integral charges. Thus mesons must 
have integral charge. 
Glossary 


bottom 
a quark flavor 


charm 
a quark flavor, which is the counterpart of the strange quark 


color 
a quark flavor 


down 
the second-lightest of all quarks 


flavors 


quark type 


fundamental particle 
particle with no substructure 


quantum chromodynamics 
quark theory including color 


quark 
an elementary particle and a fundamental constituent of matter 


strange 
the third lightest of all quarks 


theory of quark confinement 
explains how quarks can exist and yet never be isolated or directly observed 


top 
a quark flavor 


up 
the lightest of all quarks 


GUTs: The Unification of Forces 


e State the grand unified theory. 

e Explain the electroweak theory. 

¢ Define gluons. 

¢ Describe the principle of quantum chromodynamics. 
Define the standard model. 


Present quests to show that the four basic forces are different manifestations 
of a single unified force follow a long tradition. In the 19th century, the 
distinct electric and magnetic forces were shown to be intimately connected 
and are now collectively called the electromagnetic force. More recently, 
the weak nuclear force has been shown to be connected to the 
electromagnetic force in a manner suggesting that a theory may be 
constructed in which all four forces are unified. Certainly, there are 
similarities in how forces are transmitted by the exchange of carrier 
particles, and the carrier particles themselves (the gauge bosons in [Link]) 
are also similar in important ways. The analogy to the unification of electric 
and magnetic forces is quite good—the four forces are distinct under 
normal circumstances, but there are hints of connections even on the atomic 
scale, and there may be conditions under which the forces are intimately 
related and even indistinguishable. The search for a correct theory linking 
the forces, called the Grand Unified Theory (GUT), is explored in this 
section in the realm of particle physics. Frontiers of Physics expands the 
story in making a connection with cosmology, on the opposite end of the 
distance scale. 


[link] is a Feynman diagram showing how the weak nuclear force is 
transmitted by the carrier particle Z 0 similar to the diagrams in [link] and 
[link] for the electromagnetic and strong nuclear forces. In the 1960s, a 
gauge theory, called electroweak theory, was developed by Steven 
Weinberg, Sheldon Glashow, and Abdus Salam and proposed that the 
electromagnetic and weak forces are identical at sufficiently high energies. 
One of its predictions, in addition to describing both electromagnetic and 
weak force phenomena, was the existence of the W*,W_, and Z° carrier 
particles. Not only were three particles having spin 1 predicted, the mass of 
the W* and W~ was predicted to be 81 GeV / c’, and that of the Z° was 


predicted to be 90 GeV /c?. (Their masses had to be about 1000 times that 
of the pion, or about 100 GeV/ c’, since the range of the weak force is 
about 1000 times less than the strong force carried by virtual pions.) In 
1983, these carrier particles were observed at CERN with the predicted 
characteristics, including masses having the predicted values as seen in 
[link]. This was another triumph of particle theory and experimental effort, 
resulting in the 1984 Nobel Prize to the experiment’s group leaders Carlo 
Rubbia and Simon van der Meer. Theorists Weinberg, Glashow, and Salam 
had already been honored with the 1979 Nobel Prize for other aspects of 


electroweak theory. 
t 


The exchange of a virtual 
Z° carries the weak 
nuclear force between an 
electron and a neutrino in 
this Feynman diagram. 
The Z° is one of the 
carrier particles for the 
weak nuclear force that 
has now been created in 
the laboratory with 
characteristics predicted 
by electroweak theory. 


Although the weak nuclear force is very short ranged (< 101° m, as 
indicated in [link]), its effects on atomic levels can be measured given the 
extreme precision of modern techniques. Since electrons spend some time 
in the nucleus, their energies are affected, and spectra can even indicate new 
aspects of the weak force, such as the possibility of other carrier particles. 
So systems many orders of magnitude larger than the range of the weak 
force supply evidence of electroweak unification in addition to evidence 
found at the particle scale. 


Gluons (g) are the proposed carrier particles for the strong nuclear force, 
although they are not directly observed. Like quarks, gluons may be 
confined to systems having a total color of white. Less is known about 
gluons than the fact that they are the carriers of the weak and certainly of 
the electromagnetic force. QCD theory calls for eight gluons, all massless 
and all spin 1. Six of the gluons carry a color and an anticolor, while two do 
not carry color, as illustrated in [link](a). There is indirect evidence of the 
existence of gluons in nucleons. When high-energy electrons are scattered 
from nucleons and evidence of quarks is seen, the momenta of the quarks 
are smaller than they would be if there were no gluons. That means that the 
gluons carrying force between quarks also carry some momentum, inferred 
by the already indirect quark momentum measurements. At any rate, the 
gluons carry color charge and can change the colors of quarks when 
exchanged, as seen in [link](b). In the figure, a red down quark interacts 
with a green strange quark by sending it a gluon. That gluon carries red 


away from the down quark and leaves it green, because it is an RG (red- 
antigreen) gluon. (Taking antigreen away leaves you green.) Its 
antigreenness kills the green in the strange quark, and its redness turns the 
quark red. 
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In figure (a), the eight types of gluons that carry 
the strong nuclear force are divided into a group of 
six that carry color and a group of two that do not. 

Figure (b) shows that the exchange of gluons 
between quarks carries the strong force and may 
change the color of a quark. 


The strong force is complicated, since observable particles that feel the 
strong force (hadrons) contain multiple quarks. [link] shows the quark and 
gluon details of pion exchange between a proton and a neutron as illustrated 
earlier in [link] and [link]. The quarks within the proton and neutron move 
along together exchanging gluons, until the proton and neutron get close 
together. As the wu quark leaves the proton, a gluon creates a pair of virtual 


particles, a d quark and a d antiquark. The d quark stays behind and the 
proton turns into a neutron, while the u and d move together as a 7* ((link] 


confirms the ud composition for the 7*.) The d annihilates a d quark in the 
neutron, the u joins the neutron, and the neutron becomes a proton. A pion 
is exchanged and a force is transmitted. 


Neutron Proton 


x Proton Neutron 


This Feynman diagram is 
the same interaction as 
shown in [link], but it shows 


the quark and gluon details 
of the strong force 
interaction. 


It is beyond the scope of this text to go into more detail on the types of 
quark and gluon interactions that underlie the observable particles, but the 
theory (quantum chromodynamics or QCD) is very self-consistent. So 
successful have QCD and the electroweak theory been that, taken together, 
they are called the Standard Model. Advances in knowledge are expected 
to modify, but not overthrow, the Standard Model of particle physics and 
forces. 


Note: 

Making Connections: Unification of Forces 

Grand Unified Theory (GUT) is successful in describing the four forces as 
distinct under normal circumstances, but connected in fundamental ways. 
Experiments have verified that the weak and electromagnetic force become 
identical at very small distances and provide the GUT description of the 
carrier particles for the forces. GUT predicts that the other forces become 
identical under conditions so extreme that they cannot be tested in the 
laboratory, although there may be lingering evidence of them in the 
evolution of the universe. GUT is also successful in describing a system of 
carrier particles for all four forces, but there is much to be done, 
particularly in the realm of gravity. 


How can forces be unified? They are definitely distinct under most 
circumstances, for example, being carried by different particles and having 
greatly different strengths. But experiments show that at extremely small 
distances, the strengths of the forces begin to become more similar. In fact, 
electroweak theory’s prediction of the W*, W~, and Z° carrier particles 
was based on the strengths of the two forces being identical at extremely 
small distances as seen in [link]. As discussed in case of the creation of 


virtual particles for extremely short times, the small distances or short 
ranges correspond to the large masses of the carrier particles and the 
correspondingly large energies needed to create them. Thus, the energy 
scale on the horizontal axis of [link] corresponds to smaller and smaller 
distances, with 100 GeV corresponding to approximately, 10~ ‘8m for 
example. At that distance, the strengths of the EM and weak forces are the 
same. To test physics at that distance, energies of about 100 GeV must be 
put into the system, and that is sufficient to create and release the W*, W_, 
and Z° carrier particles. At those and higher energies, the masses of the 
carrier particles becomes less and less relevant, and the Z° in particular 
resembles the massless, chargeless, spin 1 photon. In fact, there is enough 
energy when things are pushed to even smaller distances to transform the, 
and Z° into massless carrier particles more similar to photons and gluons. 
These have not been observed experimentally, but there is a prediction of an 
associated particle called the Higgs boson. The mass of this particle is not 
predicted with nearly the certainty with which the mass of the W*, W, 
and Z° particles were predicted, but it was hoped that the Higgs boson 
could be observed at the now-canceled Superconducting Super Collider 
(SSC). Ongoing experiments at the Large Hadron Collider at CERN have 
presented some evidence for a Higgs boson with a mass of 125 GeV, and 
there is a possibility of a direct discovery during 2012. The existence of this 
more massive particle would give validity to the theory that the carrier 


particles are identical under certain circumstances. 
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The relative strengths of the four basic 
forces vary with distance and, hence, 
energy is needed to probe small 
distances. At ordinary energies (a few 
eV or less), the forces differ greatly as 


indicated in [link]. However, at 
energies available at accelerators, the 
weak and EM forces become 
identical, or unified. Unfortunately, 
the energies at which the strong and 
electroweak forces become the same 
are unreachable even in principle at 
any conceivable accelerator. The 
universe may provide a laboratory, 
and nature may show effects at 
ordinary energies that give us clues 
about the validity of this graph. 


The small distances and high energies at which the electroweak force 
becomes identical with the strong nuclear force are not reachable with any 
conceivable human-built accelerator. At energies of about 10’ GeV 
(16,000 J per particle), distances of about 10°-°° m can be probed. Such 
energies are needed to test theory directly, but these are about 10?° higher 
than the proposed giant SSC would have had, and the distances are about 
10-1? smaller than any structure we have direct knowledge of. This would 
be the realm of various GUTs, of which there are many since there is no 
constraining evidence at these energies and distances. Past experience has 
shown that any time you probe so many orders of magnitude further (here, 
about 1012), you find the unexpected. Even more extreme are the energies 
and distances at which gravity is thought to unify with the other forces in a 
TOE. Most speculative and least constrained by experiment are TOEs, one 
of which is called Superstring theory. Superstrings are entities that are 
10-*° m in scale and act like one-dimensional oscillating strings and are 
also proposed to underlie all particles, forces, and space itself. 


At the energy of GUTs, the carrier particles of the weak force would 
become massless and identical to gluons. If that happens, then both lepton 
and baryon conservation would be violated. We do not see such violations, 
because we do not encounter such energies. However, there is a tiny 
probability that, at ordinary energies, the virtual particles that violate the 


conservation of baryon number may exist for extremely small amounts of 
time (corresponding to very small ranges). All GUTs thus predict that the 
proton should be unstable, but would decay with an extremely long lifetime 
of about 10°! y. The predicted decay mode is 

Equation: 


p > 7° + e*, (proposed proton decay) 


which violates both conservation of baryon number and electron family 
number. Although 10°! y is an extremely long time (about 107! times the 
age of the universe), there are a lot of protons, and detectors have been 
constructed to look for the proposed decay mode as seen in [link]. It is 
somewhat comforting that proton decay has not been detected, and its 
experimental lifetime is now greater than 5 x 10°? y. This does not prove 
GUTs wrong, but it does place greater constraints on the theories, benefiting 
theorists in many ways. 


From looking increasingly inward at smaller details for direct evidence of 
electroweak theory and GUTs, we turn around and look to the universe for 
evidence of the unification of forces. In the 1920s, the expansion of the 
universe was discovered. Thinking backward in time, the universe must 
once have been very small, dense, and extremely hot. At a tiny fraction of a 
second after the fabled Big Bang, forces would have been unified and may 
have left their fingerprint on the existing universe. This, one of the most 
exciting forefronts of physics, is the subject of Frontiers of Physics. 


In the Tevatron accelerator at Fermilab, 
protons and antiprotons collide at high 
energies, and some of those collisions could 
result in the production of a Higgs boson in 
association with a W boson. When the W 
boson decays to a high-energy lepton and a 
neutrino, the detector triggers on the lepton, 
whether it is an electron or a muon. (credit: 
D. J. Miller) 


Summary 


e Attempts to show unification of the four forces are called Grand 
Unified Theories (GUTs) and have been partially successful, with 
connections proven between EM and weak forces in electroweak 
theory. 

e The strong force is carried by eight proposed particles called gluons, 
which are intimately connected to a quantum number called color— 
their governing theory is thus called quantum chromodynamics 


(QCD). Taken together, QCD and the electroweak theory are widely 
accepted as the Standard Model of particle physics. 

e Unification of the strong force is expected at such high energies that it 
cannot be directly tested, but it may have observable consequences in 
the as-yet unobserved decay of the proton and topics to be discussed in 
the next chapter. Although unification of forces is generally 
anticipated, much remains to be done to prove its validity. 


Conceptual Questions 


Exercise: 
Problem: 
If a GUT is proven, and the four forces are unified, it will still be 


correct to say that the orbit of the moon is determined by the 
gravitational force. Explain why. 


Exercise: 
Problem: 
If the Higgs boson is discovered and found to have mass, will it be 


considered the ultimate carrier of the weak force? Explain your 
response. 


Exercise: 


Problem: 


Gluons and the photon are massless. Does this imply that the W*, 
W~-, and Z° are the ultimate carriers of the weak force? 


Problems & Exercises 


Exercise: 


Problem: Integrated Concepts 


The intensity of cosmic ray radiation decreases rapidly with increasing 
energy, but there are occasionally extremely energetic cosmic rays that 
create a shower of radiation from all the particles they create by 
striking a nucleus in the atmosphere as seen in the figure given below. 
Suppose a cosmic ray particle having an energy of 10’? GeV converts 
its energy into particles with masses averaging 200 MeV/c’. (a) How 
many particles are created? (b) If the particles rain down on a 
1.00-km? area, how many particles are there per square meter? 


Extremely 
energetic 
cosmic ray 


Atmosphere 


An extremely energetic cosmic ray 
creates a shower of particles on earth. 
The energy of these rare cosmic rays 
can approach a joule (about 10'° GeV 
) and, after multiple collisions, huge 
numbers of particles are created from 
this energy. Cosmic ray showers have 

been observed to extend over many 

square kilometers. 


Solution: 
(a) 5 x 10° 


(b) 5 x 10* particles / m’” 


Exercise: 


Problem: Integrated Concepts 


Assuming conservation of momentum, what is the energy of each ~y 
ray produced in the decay of a neutral at rest pion, in the reaction 
W—+y+y? 


Exercise: 


Problem: Integrated Concepts 


What is the wavelength of a 50-GeV electron, which is produced at 
SLAC? This provides an idea of the limit to the detail it can probe. 


Solution: 


2.5x10°)’m 


Exercise: 


Problem: Integrated Concepts 


gta ie : = 1 ¥ 

(a) Calculate the relativistic quantity ~ = Vive for 1.00-TeV 
protons produced at Fermilab. (b) If such a proton created a 7* having 
the same speed, how long would its life be in the laboratory? (c) How 


far could it travel in this time? 


Exercise: 
Problem: Integrated Concepts 


The primary decay mode for the negative pionis7 — uw + Vii (a) 
What is the energy release in MeV in this decay? (b) Using 
conservation of momentum, how much energy does each of the decay 
products receive, given the 7 is at rest when it decays? You may 
assume the muon antineutrino is massless and has momentum 

p = E/c, just like a photon. 


Solution: 
(a) 33.9 MeV 


(b) Muon antineutrino 29.8 MeV, muon 4.1 MeV (kinetic energy) 


Exercise: 


Problem: Integrated Concepts 


Plans for an accelerator that produces a secondary beam of K-mesons 
to scatter from nuclei, for the purpose of studying the strong force, call 
for them to have a kinetic energy of 500 MeV. (a) What would the 
relativistic quantity y = u be for these particles? (b) How long 


/ 1-0? /c? 
would their average lifetime be in the laboratory? (c) How far could 
they travel in this time? 


Exercise: 


Problem: Integrated Concepts 


Suppose you are designing a proton decay experiment and you can 
detect 50 percent of the proton decays in a tank of water. (a) How 
many kilograms of water would you need to see one decay per month, 
assuming a lifetime of 10°! y? (b) How many cubic meters of water is 
this? (c) If the actual lifetime is 10°° y, how long would you have to 
wait on an average to see a single proton decay? 


Solution: 
(a) 7.2 x 10° kg 
(b) 7.2 x 10? m? 


(c) 100 months 


Exercise: 


Problem: Integrated Concepts 


In supernovas, neutrinos are produced in huge amounts. They were 
detected from the 1987A supernova in the Magellanic Cloud, which is 
about 120,000 light years away from the Earth (relatively close to our 
Milky Way galaxy). If neutrinos have a mass, they cannot travel at the 
speed of light, but if their mass is small, they can get close. (a) 
Suppose a neutrino with a 7-eV/c? mass Has a kinetic energy of 700 
keV. Find the relativistic quantity y = Jaa We for it. (b) If the 


neutrino leaves the 1987A supernova at the same time as a photon and 
both travel to Earth, how much sooner does the photon arrive? This is 
not a large time difference, given that it is impossible to know which 
neutrino left with which photon and the poor efficiency of the neutrino 
detectors. Thus, the fact that neutrinos were observed within hours of 
the brightening of the supernova only places an upper limit on the 
neutrino’s mass. (Hint: You may need to use a series expansion to find 
v for the neutrino, since its + is so large.) 


Exercise: 


Problem: Construct Your Own Problem 


Consider an ultrahigh-energy cosmic ray entering the Earth’s 
atmosphere (some have energies approaching a joule). Construct a 
problem in which you calculate the energy of the particle based on the 
number of particles in an observed cosmic ray shower. Among the 
things to consider are the average mass of the shower particles, the 
average number per square meter, and the extent (number of square 
meters covered) of the shower. Express the energy in eV and joules. 


Exercise: 
Problem: Construct Your Own Problem 


Consider a detector needed to observe the proposed, but extremely 
rare, decay of an electron. Construct a problem in which you calculate 


the amount of matter needed in the detector to be able to observe the 
decay, assuming that it has a signature that is clearly identifiable. 
Among the things to consider are the estimated half life (long for rare 
events), and the number of decays per unit time that you wish to 
observe, as well as the number of electrons in the detector substance. 


Glossary 


electroweak theory 
theory showing connections between EM and weak forces 


grand unified theory 
theory that shows unification of the strong and electroweak forces 


gluons 
eight proposed particles which carry the strong force 


Higgs boson 
a massive particle that, if observed, would give validity to the theory 
that carrier particles are identical under certain circumstances 


quantum chromodynamics 
the governing theory of connecting quantum number color to gluons 


standard model 
combination of quantum chromodynamics and electroweak theory 


superstring theory 
a theory of everything based on vibrating strings some 10-*° m in 
length 


Introduction to Quantum Physics 
class="introduction" 


A black fly 
imaged by 
an electron 
microscope 
is as 
monstrous 
as any 
science- 
fiction 
creature. 
(credit: 
WSs 
Departmen 
t of 
Agriculture 
via 
Wikimedia 
Commons) 


Quantum mechanics is the branch of physics needed to deal with 
submicroscopic objects. Because these objects are smaller than we can 
observe directly with our senses and generally must be observed with the 
aid of instruments, parts of quantum mechanics seem as foreign and bizarre 
as parts of relativity. But, like relativity, quantum mechanics has been 
shown to be valid—truth is often stranger than fiction. 


Certain aspects of quantum mechanics are familiar to us. We accept as fact 
that matter is composed of atoms, the smallest unit of an element, and that 
these atoms combine to form molecules, the smallest unit of a compound. 
(See [link].) While we cannot see the individual water molecules in a 
stream, for example, we are aware that this is because molecules are so 
small and so numerous in that stream. When introducing atoms, we 
commonly say that electrons orbit atoms in discrete shells around a tiny 
nucleus, itself composed of smaller particles called protons and neutrons. 
We are also aware that electric charge comes in tiny units carried almost 
entirely by electrons and protons. As with water molecules in a stream, we 


do not notice individual charges in the current through a lightbulb, because 
the charges are so small and so numerous in the macroscopic situations we 
sense directly. 


O 
O#O 
O°D 
Atoms and their substructure 
are familiar examples of objects 
that require quantum mechanics 
to be fully explained. Certain of 
their characteristics, such as the 
discrete electron shells, are 
classical physics explanations. 
In quantum mechanics we 


conceptualize discrete “electron 
clouds” around the nucleus. 


Note: 

Making Connections: Realms of Physics 

Classical physics is a good approximation of modern physics under 
conditions first discussed in the The Nature of Science and Physics. 
Quantum mechanics is valid in general, and it must be used rather than 
classical physics to describe small objects, such as atoms. 


Atoms, molecules, and fundamental electron and proton charges are all 
examples of physical entities that are quantized—that is, they appear only 
in certain discrete values and do not have every conceivable value. 


Quantized is the opposite of continuous. We cannot have a fraction of an 
atom, or part of an electron’s charge, or 14-1/3 cents, for example. Rather, 
everything is built of integral multiples of these substructures. Quantum 
physics is the branch of physics that deals with small objects and the 
quantization of various entities, including energy and angular momentum. 
Just as with classical physics, quantum physics has several subfields, such 
as mechanics and the study of electromagnetic forces. The correspondence 
principle states that in the classical limit (large, slow-moving objects), 
quantum mechanics becomes the same as classical physics. In this chapter, 
we begin the development of quantum mechanics and its description of the 
strange submicroscopic world. In later chapters, we will examine many 
areas, such as atomic and nuclear physics, in which quantum mechanics is 
crucial. 


Glossary 


quantized 
the fact that certain physical entities exist only with particular discrete 
values and not every conceivable value 


correspondence principle 
in the classical limit (large, slow-moving objects), quantum mechanics 
becomes the same as classical physics 


quantum mechanics 
the branch of physics that deals with small objects and with the 
quantization of various entities, especially energy 


Quantization of Energy 


e Explain Max Planck’s contribution to the development of quantum 
mechanics. 


e Explain why atomic spectra indicate quantization. 


Planck’s Contribution 


Energy is quantized in some systems, meaning that the system can have 
only certain energies and not a continuum of energies, unlike the classical 
case. This would be like having only certain speeds at which a car can 
travel because its kinetic energy can have only certain values. We also find 
that some forms of energy transfer take place with discrete lumps of energy. 
While most of us are familiar with the quantization of matter into lumps 
called atoms, molecules, and the like, we are less aware that energy, too, 
can be quantized. Some of the earliest clues about the necessity of quantum 
mechanics over classical physics came from the quantization of energy. 


6000 K (white hot) 


EM radiation intensity 


UV R 
Visible 
range 
Graphs of blackbody 


radiation (from an ideal 
radiator) at three different 
radiator temperatures. 
The intensity or rate of 


radiation emission 
increases dramatically 
with temperature, and the 
peak of the spectrum 
shifts toward the visible 
and ultraviolet parts of 
the spectrum. The shape 
of the spectrum cannot be 
described with classical 
physics. 


Where is the quantization of energy observed? Let us begin by considering 
the emission and absorption of electromagnetic (EM) radiation. The EM 
spectrum radiated by a hot solid is linked directly to the solid’s temperature. 
(See [link].) An ideal radiator is one that has an emissivity of 1 at all 
wavelengths and, thus, is jet black. Ideal radiators are therefore called 
blackbodies, and their EM radiation is called blackbody radiation. It was 
discussed that the total intensity of the radiation varies as T+, the fourth 
power of the absolute temperature of the body, and that the peak of the 
spectrum shifts to shorter wavelengths at higher temperatures. All of this 
seems quite continuous, but it was the curve of the spectrum of intensity 
versus wavelength that gave a clue that the energies of the atoms in the 
solid are quantized. In fact, providing a theoretical explanation for the 
experimentally measured shape of the spectrum was a mystery at the turn of 
the century. When this “ultraviolet catastrophe” was eventually solved, the 
answers led to new technologies such as computers and the sophisticated 
imaging techniques described in earlier chapters. Once again, physics as an 
enabling science changed the way we live. 


The German physicist Max Planck (1858-1947) used the idea that atoms 
and molecules in a body act like oscillators to absorb and emit radiation. 
The energies of the oscillating atoms and molecules had to be quantized to 
correctly describe the shape of the blackbody spectrum. Planck deduced 
that the energy of an oscillator having a frequency f is given by 
Equation: 


1 
f= — |hf. 
(n+ >) 


Here n is any nonnegative integer (0, 1, 2, 3, ...). The symbol A stands for 
Planck’s constant, given by 
Equation: 


h — 6.626 x 104 J-s. 


The equation & = (n st + )hf means that an oscillator having a frequency 
f (emitting and absorbing EM radiation of frequency f) can have its energy 
increase or decrease only in discrete steps of size 

Equation: 


AE = hf. 


It might be helpful to mention some macroscopic analogies of this 
quantization of energy phenomena. This is like a pendulum that has a 
characteristic oscillation frequency but can swing with only certain 
amplitudes. Quantization of energy also resembles a standing wave on a 
string that allows only particular harmonics described by integers. It is also 
similar to going up and down a hill using discrete stair steps rather than 
being able to move up and down a continuous slope. Your potential energy 
takes on discrete values as you move from step to step. 


Using the quantization of oscillators, Planck was able to correctly describe 
the experimentally known shape of the blackbody spectrum. This was the 
first indication that energy is sometimes quantized on a small scale and 
earned him the Nobel Prize in Physics in 1918. Although Planck’s theory 
comes from observations of a macroscopic object, its analysis is based on 
atoms and molecules. It was such a revolutionary departure from classical 
physics that Planck himself was reluctant to accept his own idea that energy 
states are not continuous. The general acceptance of Planck’s energy 
quantization was greatly enhanced by Einstein’s explanation of the 
photoelectric effect (discussed in the next section), which took energy 


quantization a step further. Planck was fully involved in the development of 
both early quantum mechanics and relativity. He quickly embraced 
Einstein’s special relativity, published in 1905, and in 1906 Planck was the 
first to suggest the correct formula for relativistic momentum, p = ymu. 


The German physicist Max 
Planck had a major influence on 
the early development of 
quantum mechanics, being the 
first to recognize that energy is 
sometimes quantized. Planck 
also made important 
contributions to special 
relativity and classical physics. 
(credit: Library of Congress, 
Prints and Photographs Division 
via Wikimedia Commons) 


Note that Planck’s constant h is a very small number. So for an infrared 
frequency of 10'+ Hz being emitted by a blackbody, for example, the 
difference between energy levels is only 

AE = hf=(6.63 x 10-°4 J-s)(10'4 Hz)= 6.63 x 10° J, or about 0.4 


eV. This 0.4 eV of energy is significant compared with typical atomic 


energies, which are on the order of an electron volt, or thermal energies, 
which are typically fractions of an electron volt. But on a macroscopic or 
classical scale, energies are typically on the order of joules. Even if 
macroscopic energies are quantized, the quantum steps are too small to be 
noticed. This is an example of the correspondence principle. For a large 
object, quantum mechanics produces results indistinguishable from those of 
classical physics. 


Atomic Spectra 


Now let us turn our attention to the emission and absorption of EM 
radiation by gases. The Sun is the most common example of a body 
containing gases emitting an EM spectrum that includes visible light. We 
also see examples in neon signs and candle flames. Studies of emissions of 
hot gases began more than two centuries ago, and it was soon recognized 
that these emission spectra contained huge amounts of information. The 
type of gas and its temperature, for example, could be determined. We now 
know that these EM emissions come from electrons transitioning between 
energy levels in individual atoms and molecules; thus, they are called 
atomic spectra. Atomic spectra remain an important analytical tool today. 
[link] shows an example of an emission spectrum obtained by passing an 
electric discharge through a material. One of the most important 
characteristics of these spectra is that they are discrete. By this we mean 
that only certain wavelengths, and hence frequencies, are emitted. This is 
called a line spectrum. If frequency and energy are associated as AF = hf, 
the energies of the electrons in the emitting atoms and molecules are 
quantized. This is discussed in more detail later in this chapter. 


Emission spectrum of oxygen. When an electrical discharge is 
passed through a substance, its atoms and molecules absorb 
energy, which is reemitted as EM radiation. The discrete nature 
of these emissions implies that the energy states of the atoms 


and molecules are quantized. Such atomic spectra were used as 
analytical tools for many decades before it was understood why 
they are quantized. (credit: Teravolt, Wikimedia Commons) 


It was a major puzzle that atomic spectra are quantized. Some of the best 
minds of 19th-century science failed to explain why this might be. Not until 
the second decade of the 20th century did an answer based on quantum 
mechanics begin to emerge. Again a macroscopic or classical body of gas 
was involved in the studies, but the effect, as we shall see, is due to 
individual atoms and molecules. 


Note: 

PhET Explorations: Models of the Hydrogen Atom 

How did scientists figure out the structure of atoms without looking at 
them? Try out different models by shooting light at the atom. Check how 
the prediction of the model matches the experimental results. 


https://archive.cnx.org/specials/d77cc1d0-33e4-11e6-b016- 


Section Summary 


e The first indication that energy is sometimes quantized came from 
blackbody radiation, which is the emission of EM radiation by an 
object with an emissivity of 1. 

e Planck recognized that the energy levels of the emitting atoms and 
molecules were quantized, with only the allowed values of 
f= (n - +) hf , where n is any non-negative integer (0, 1, 2, 3, ...). 

e his Planck’s constant, whose value is h = 6.626 x 10-4 J-s. 

e Thus, the oscillatory absorption and emission energies of atoms and 
molecules in a blackbody could increase or decrease only in steps of 


size AE = hf where f is the frequency of the oscillatory nature of the 
absorption and emission of EM radiation. 

e Another indication of energy levels being quantized in atoms and 
molecules comes from the lines in atomic spectra, which are the EM 
emissions of individual atoms and molecules. 


Conceptual Questions 


Exercise: 
Problem: 
Give an example of a physical entity that is quantized. State 
specifically what the entity is and what the limits are on its values. 
Exercise: 
Problem: 
Give an example of a physical entity that is not quantized, in that it is 
continuous and may have a continuous range of values. 
Exercise: 
Problem: 
What aspect of the blackbody spectrum forced Planck to propose 
quantization of energy levels in its atoms and molecules? 
Exercise: 
Problem: 
If Planck’s constant were large, say 10°“ times greater than it is, we 


would observe macroscopic entities to be quantized. Describe the 
motions of a child’s swing under such circumstances. 


Exercise: 


Problem: Why don’t we notice quantization in everyday events? 


Problems & Exercises 


Exercise: 
Problem: 
A LiBr molecule oscillates with a frequency of 1.7 x 10/° Hz. (a) 
What is the difference in energy in eV between allowed oscillator 


States? (b) What is the approximate value of n for a state having an 
energy of 1.0 eV? 


Solution: 
(a) 0.070 eV 


(b) 14 
Exercise: 
Problem: 
The difference in energy between allowed oscillator states in HBr 


molecules is 0.330 eV. What is the oscillation frequency of this 
molecule? 


Exercise: 
Problem: 
A physicist is watching a 15-kg orangutan at a zoo swing lazily ina 
tire at the end of a rope. He (the physicist) notices that each oscillation 
takes 3.00 s and hypothesizes that the energy is quantized. (a) What is 
the difference in energy in joules between allowed oscillator states? (b) 


What is the value of n for a state where the energy is 5.00 J? (c) Can 
the quantization be observed? 


Solution: 
(a): 2.21 10" J 


(b) 2.26 x 10°4 


(c) No 


Glossary 


blackbody 
an ideal radiator, which can radiate equally well at all wavelengths 


blackbody radiation 
the electromagnetic radiation from a blackbody 


Planck’s constant 
h = 6.626 x 104 J-s 


atomic spectra 
the electromagnetic emission from atoms and molecules 


The Photoelectric Effect 


e Describe a typical photoelectric-effect experiment. 

¢ Determine the maximum kinetic energy of photoelectrons ejected by 
photons of one energy or wavelength, when given the maximum 
kinetic energy of photoelectrons for a different photon energy or 
wavelength. 


When light strikes materials, it can eject electrons from them. This is called 
the photoelectric effect, meaning that light (photo) produces electricity. 
One common use of the photoelectric effect is in light meters, such as those 
that adjust the automatic iris on various types of cameras. In a similar way, 
another use is in solar cells, as you probably have in your calculator or have 
seen on a roof top or a roadside sign. These make use of the photoelectric 
effect to convert light into electricity for running different devices. 


The 
photoelectric 
effect can be 
observed by 

allowing 
light to fall 
on the metal 
plate in this 
evacuated 
tube. 
Electrons 
ejected by 
the light are 
collected on 
the collector 
wire and 


measured as 
a current. A 
retarding 
voltage 
between the 
collector 
wire and 
plate can 
then be 
adjusted so 
as to 
determine the 
energy of the 
ejected 
electrons. For 
example, if it 
is sufficiently 
negative, no 
electrons will 
reach the 
wire. (credit: 
P.P. Urone) 


This effect has been known for more than a century and can be studied 
using a device such as that shown in [link]. This figure shows an evacuated 
tube with a metal plate and a collector wire that are connected by a variable 
voltage source, with the collector more negative than the plate. When light 
(or other EM radiation) strikes the plate in the evacuated tube, it may eject 
electrons. If the electrons have energy in electron volts (eV) greater than the 
potential difference between the plate and the wire in volts, some electrons 
will be collected on the wire. Since the electron energy in eV is qV, where 
q is the electron charge and V is the potential difference, the electron 
energy can be measured by adjusting the retarding voltage between the wire 
and the plate. The voltage that stops the electrons from reaching the wire 
equals the energy in eV. For example, if —3.00 V barely stops the electrons, 


their energy is 3.00 eV. The number of electrons ejected can be determined 
by measuring the current between the wire and plate. The more light, the 
more electrons; a little circuitry allows this device to be used as a light 
meter. 


What is really important about the photoelectric effect is what Albert 
Einstein deduced from it. Einstein realized that there were several 
characteristics of the photoelectric effect that could be explained only if EM 
radiation is itself quantized: the apparently continuous stream of energy in 
an EM wave is actually composed of energy quanta called photons. In his 
explanation of the photoelectric effect, Einstein defined a quantized unit or 
quantum of EM energy, which we now call a photon, with an energy 
proportional to the frequency of EM radiation. In equation form, the photon 
energy is 

Equation: 


E=nhf, 


where F is the energy of a photon of frequency f and h is Planck’s 
constant. This revolutionary idea looks similar to Planck’s quantization of 
energy states in blackbody oscillators, but it is quite different. It is the 
quantization of EM radiation itself. EM waves are composed of photons 
and are not continuous smooth waves as described in previous chapters on 
optics. Their energy is absorbed and emitted in lumps, not continuously. 
This is exactly consistent with Planck’s quantization of energy levels in 
blackbody oscillators, since these oscillators increase and decrease their 
energy in steps of hf by absorbing and emitting photons having EF = hf. 
We do not observe this with our eyes, because there are so many photons in 
common light sources that individual photons go unnoticed. (See [link].) 
The next section of the text (Photon Energies and the Electromagnetic 
Spectrum) is devoted to a discussion of photons and some of their 
characteristics and implications. For now, we will use the photon concept to 
explain the photoelectric effect, much as Einstein did. 


Flashlight § -=-hf Gade 


= ig a 
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An EM wave of frequency f is composed of 
photons, or individual quanta of EM 
radiation. The energy of each photon is 
E = hf, where h is Planck’s constant and f 
is the frequency of the EM radiation. Higher 
intensity means more photons per unit area. 
The flashlight emits large numbers of 
photons of many different frequencies, 
hence others have energy E/= hf/, and so 
on. 


The photoelectric effect has the properties discussed below. All these 
properties are consistent with the idea that individual photons of EM 
radiation are absorbed by individual electrons in a material, with the 
electron gaining the photon’s energy. Some of these properties are 
inconsistent with the idea that EM radiation is a simple wave. For 
simplicity, let us consider what happens with monochromatic EM radiation 
in which all photons have the same energy hf. 


1. If we vary the frequency of the EM radiation falling on a material, we 
find the following: For a given material, there is a threshold frequency 
fo for the EM radiation below which no electrons are ejected, 
regardless of intensity. Individual photons interact with individual 
electrons. Thus if the photon energy is too small to break an electron 
away, no electrons will be ejected. If EM radiation was a simple wave, 
sufficient energy could be obtained by increasing the intensity. 

2. Once EM radiation falls on a material, electrons are ejected without 
delay. As soon as an individual photon of a sufficiently high frequency 
is absorbed by an individual electron, the electron is ejected. If the EM 


radiation were a simple wave, several minutes would be required for 
sufficient energy to be deposited to the metal surface to eject an 
electron. 

. The number of electrons ejected per unit time is proportional to the 
intensity of the EM radiation and to no other characteristic. High- 
intensity EM radiation consists of large numbers of photons per unit 
area, with all photons having the same characteristic energy hf. 

. If we vary the intensity of the EM radiation and measure the energy of 
ejected electrons, we find the following: The maximum kinetic energy 
of ejected electrons is independent of the intensity of the EM radiation. 
Since there are so many electrons in a material, it is extremely unlikely 
that two photons will interact with the same electron at the same time, 
thereby increasing the energy given it. Instead (as noted in 3 above), 
increased intensity results in more electrons of the same energy being 
ejected. If EM radiation were a simple wave, a higher intensity could 
give more energy, and higher-energy electrons would be ejected. 

. The kinetic energy of an ejected electron equals the photon energy 
minus the binding energy of the electron in the specific material. An 
individual photon can give all of its energy to an electron. The 
photon’s energy is partly used to break the electron away from the 
material. The remainder goes into the ejected electron’s kinetic energy. 
In equation form, this is given by 

Equation: 


KE, = hf — BE, 


where KE, is the maximum kinetic energy of the ejected electron, hf 
is the photon’s energy, and BE is the binding energy of the electron to 
the particular material. (BE is sometimes called the work function of 
the material.) This equation, due to Einstein in 1905, explains the 
properties of the photoelectric effect quantitatively. An individual 
photon of EM radiation (it does not come any other way) interacts with 
an individual electron, supplying enough energy, BE, to break it away, 
with the remainder going to kinetic energy. The binding energy is 

BE = hf , where fo is the threshold frequency for the particular 
material. [link] shows a graph of maximum KE, versus the frequency 
of incident EM radiation falling on a particular material. 


Photoelectric effect. A 


graph of the kinetic 
energy of an ejected 
electron, KE., versus the 
frequency of EM 
radiation impinging on a 
certain material. There is 
a threshold frequency 
below which no electrons 
are ejected, because the 
individual photon 
interacting with an 
individual electron has 
insufficient energy to 
break it away. Above the 
threshold energy, KE, 
increases linearly with f, 
consistent with 
KE, = hf — BE. The 
slope of this line is h — 
the data can be used to 
determine Planck’s 
constant experimentally. 
Einstein gave the first 
successful explanation of 
such data by proposing 


the idea of photons— 
quanta of EM radiation. 


Einstein’s idea that EM radiation is quantized was crucial to the beginnings 
of quantum mechanics. It is a far more general concept than its explanation 
of the photoelectric effect might imply. All EM radiation can also be 
modeled in the form of photons, and the characteristics of EM radiation are 
entirely consistent with this fact. (As we will see in the next section, many 
aspects of EM radiation, such as the hazards of ultraviolet (UV) radiation, 
can be explained only by photon properties.) More famous for modern 
relativity, Einstein planted an important seed for quantum mechanics in 
1905, the same year he published his first paper on special relativity. His 
explanation of the photoelectric effect was the basis for the Nobel Prize 
awarded to him in 1921. Although his other contributions to theoretical 
physics were also noted in that award, special and general relativity were 
not fully recognized in spite of having been partially verified by experiment 
by 1921. Although hero-worshipped, this great man never received Nobel 
recognition for his most famous work—telativity. 


Example: 

Calculating Photon Energy and the Photoelectric Effect: A Violet 
Light 

(a) What is the energy in joules and electron volts of a photon of 420-nm 
violet light? (b) What is the maximum kinetic energy of electrons ejected 
from calcium by 420-nm violet light, given that the binding energy (or 
work function) of electrons for calcium metal is 2.71 eV? 

Strategy 

To solve part (a), note that the energy of a photon is given by & = hf. For 
part (b), once the energy of the photon is calculated, it is a straightforward 
application of KE, = hf—BE to find the ejected electron’s maximum 
kinetic energy, since BE is given. 

Solution for (a) 

Photon energy is given by 


Equation: 
E =hf 


Since we are given the wavelength rather than the frequency, we solve the 
familiar relationship c = fA for the frequency, yielding 
Equation: 


Combining these two equations gives the useful relationship 
Equation: 


jj 
a 


Now substituting known values yields 
Equation: 


_ (6.63 x 10°** J - s) (3.00 x 10° m/s) 


= Ge 
420 x 10° m 


Converting to eV, the energy of the photon is 
Equation: 


1 
H = (4.74x 10 J) ee ace 


1.6 x 10°! J 


Solution for (b) 

Finding the kinetic energy of the ejected electron is now a simple 
application of the equation KE, = hf—BE. Substituting the photon energy 
and binding energy yields 

Equation: 


KE, — hf- BE = 2.96 eV — 2.71 eV = 0.246 eV. 


Discussion 


The energy of this 420-nm photon of violet light is a tiny fraction of a 
joule, and so it is no wonder that a single photon would be difficult for us 
to sense directly—humans are more attuned to energies on the order of 
joules. But looking at the energy in electron volts, we can see that this 
photon has enough energy to affect atoms and molecules. A DNA molecule 
can be broken with about 1 eV of energy, for example, and typical atomic 
and molecular energies are on the order of eV, so that the UV photon in this 
example could have biological effects. The ejected electron (called a 
photoelectron) has a rather low energy, and it would not travel far, except 
in a vacuum. The electron would be stopped by a retarding potential of but 
0.26 eV. In fact, if the photon wavelength were longer and its energy less 
than 2.71 eV, then the formula would give a negative kinetic energy, an 
impossibility. This simply means that the 420-nm photons with their 2.96- 
eV energy are not much above the frequency threshold. You can show for 
yourself that the threshold wavelength is 459 nm (blue light). This means 
that if calcium metal is used in a light meter, the meter will be insensitive 
to wavelengths longer than those of blue light. Such a light meter would be 
completely insensitive to red light, for example. 


Note: 

PhET Explorations: Photoelectric Effect 

See how light knocks electrons off a metal target, and recreate the 
experiment that spawned the field of quantum mechanics. 


https://archive.cnx.org/specials/cf1152da-eae8-11e5-b874- 
£779884a9994/photoelectric-effect/#sim-photoelectric-effect 


Section Summary 


e The photoelectric effect is the process in which EM radiation ejects 
electrons from a material. 

e Einstein proposed photons to be quanta of EM radiation having energy 
E = hf, where f is the frequency of the radiation. 


e All EM radiation is composed of photons. As Einstein explained, all 
characteristics of the photoelectric effect are due to the interaction of 
individual photons with individual electrons. 

e The maximum kinetic energy KE, of ejected electrons 
(photoelectrons) is given by KE, = hf-— BE, where hf is the photon 
energy and BE is the binding energy (or work function) of the electron 
to the particular material. 


Conceptual Questions 


Exercise: 
Problem: 
Is visible light the only type of EM radiation that can cause the 
photoelectric effect? 
Exercise: 
Problem: 
Which aspects of the photoelectric effect cannot be explained without 


photons? Which can be explained without photons? Are the latter 
inconsistent with the existence of photons? 


Exercise: 
Problem: 
Is the photoelectric effect a direct consequence of the wave character 


of EM radiation or of the particle character of EM radiation? Explain 
briefly. 


Exercise: 
Problem: 
Insulators (nonmetals) have a higher BE than metals, and it is more 
difficult for photons to eject electrons from insulators. Discuss how 


this relates to the free charges in metals that make them good 
conductors. 


Exercise: 
Problem: 
If you pick up and shake a piece of metal that has electrons in it free to 
move as a current, no electrons fall out. Yet if you heat the metal, 
electrons can be boiled off. Explain both of these facts as they relate to 


the amount and distribution of energy involved with shaking the object 
as compared with heating it. 


Problems & Exercises 


Exercise: 
Problem: 
What is the longest-wavelength EM radiation that can eject a 


photoelectron from silver, given that the binding energy is 4.73 eV? Is 
this in the visible range? 


Solution: 


263 nm 
Exercise: 
Problem: 
Find the longest-wavelength photon that can eject an electron from 


potassium, given that the binding energy is 2.24 eV. Is this visible EM 
radiation? 


Exercise: 


Problem: 


What is the binding energy in eV of electrons in magnesium, if the 
longest-wavelength photon that can eject electrons is 337 nm? 


Solution: 


3.69 eV 
Exercise: 
Problem: 
Calculate the binding energy in eV of electrons in aluminum, if the 
longest-wavelength photon that can eject them is 304 nm. 
Exercise: 
Problem: 
What is the maximum kinetic energy in eV of electrons ejected from 


sodium metal by 450-nm EM radiation, given that the binding energy 
is 2.28 eV? 


Solution: 


0.483 eV 
Exercise: 
Problem: 
UV radiation having a wavelength of 120 nm falls on gold metal, to 


which electrons are bound by 4.82 eV. What is the maximum kinetic 
energy of the ejected photoelectrons? 


Exercise: 


Problem: 


Violet light of wavelength 400 nm ejects electrons with a maximum 
kinetic energy of 0.860 eV from sodium metal. What is the binding 
energy of electrons to sodium metal? 


Solution: 


2.25 eV 


Exercise: 


Problem: 


UV radiation having a 300-nm wavelength falls on uranium metal, 
ejecting 0.500-eV electrons. What is the binding energy of electrons to 
uranium metal? 


Exercise: 
Problem: 
What is the wavelength of EM radiation that ejects 2.00-eV electrons 


from calcium metal, given that the binding energy is 2.71 eV? What 
type of EM radiation is this? 


Solution: 
(a) 264 nm 


(b) Ultraviolet 
Exercise: 
Problem: 
Find the wavelength of photons that eject 0.100-eV electrons from 


potassium, given that the binding energy is 2.24 eV. Are these photons 
visible? 


Exercise: 


Problem: 


What is the maximum velocity of electrons ejected from a material by 
80-nm photons, if they are bound to the material by 4.73 eV? 


Solution: 


1.95 x 10° m/s 


Exercise: 


Problem: 


Photoelectrons from a material with a binding energy of 2.71 eV are 
ejected by 420-nm photons. Once ejected, how long does it take these 
electrons to travel 2.50 cm to a detection device? 


Exercise: 
Problem: 
A laser with a power output of 2.00 mW at a wavelength of 400 nm is 
projected onto calcium metal. (a) How many electrons per second are 


ejected? (b) What power is carried away by the electrons, given that 
the binding energy is 2.71 eV? 


Solution: 
(a) 4.02 x 10° /s 


(b) 0.256 mW 
Exercise: 


Problem: 


(a) Calculate the number of photoelectrons per second ejected from a 
1.00-mm ? area of sodium metal by 500-nm EM radiation having an 
intensity of 1.30 kW/ m? (the intensity of sunlight above the Earth’s 
atmosphere). (b) Given that the binding energy is 2.28 eV, what power 
is carried away by the electrons? (c) The electrons carry away less 
power than brought in by the photons. Where does the other power go? 
How can it be recovered? 


Exercise: 
Problem: Unreasonable Results 
Red light having a wavelength of 700 nm is projected onto magnesium 


metal to which electrons are bound by 3.68 eV. (a) Use 
KE, = hf-BE to calculate the kinetic energy of the ejected electrons. 


(b) What is unreasonable about this result? (c) Which assumptions are 
unreasonable or inconsistent? 


Solution: 
(a) -1.90 eV 
(b) Negative kinetic energy 


(c) That the electrons would be knocked free. 


Exercise: 


Problem: Unreasonable Results 


(a) What is the binding energy of electrons to a material from which 
4.00-eV electrons are ejected by 400-nm EM radiation? (b) What is 
unreasonable about this result? (c) Which assumptions are 
unreasonable or inconsistent? 


Glossary 


photoelectric effect 
the phenomenon whereby some materials eject electrons when light is 
shined on them 


photon 
a quantum, or particle, of electromagnetic radiation 


photon energy 
the amount of energy a photon has; & = hf 


binding energy 
also called the work function; the amount of energy necessary to eject 
an electron from a material 


Photon Energies and the Electromagnetic Spectrum 


e Explain the relationship between the energy of a photon in joules or electron volts and its 
wavelength or frequency. 

¢ Calculate the number of photons per second emitted by a monochromatic source of specific 
wavelength and power. 


Ionizing Radiation 


A photon is a quantum of EM radiation. Its energy is given by F = hf and is related to the frequency f 
and wavelength J of the radiation by 
Equation: 


h 
E=hf= ~ (energy of a photon), 


where F is the energy of a single photon and c is the speed of light. When working with small systems, 
energy in eV is often useful. Note that Planck’s constant in these units is 
Equation: 


h=414x10' eV-s. 


Since many wavelengths are stated in nanometers (nm), it is also useful to know that 
Equation: 


he = 1240 eV - nm. 


These will make many calculations a little easier. 


All EM radiation is composed of photons. [link] shows various divisions of the EM spectrum plotted 
against wavelength, frequency, and photon energy. Previously in this book, photon characteristics were 
alluded to in the discussion of some of the characteristics of UV, x rays, and ¥ rays, the first of which 
start with frequencies just above violet in the visible spectrum. It was noted that these types of EM 
radiation have characteristics much different than visible light. We can now see that such properties 
arise because photon energy is larger at high frequencies. 
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©2001 Brooks/Cole - Thomson Learning Frequency (Hz) 
The EM spectrum, showing major categories as a function of photon energy in eV, as 
well as wavelength and frequency. Certain characteristics of EM radiation are 
directly attributable to photon energy alone. 


Rotational energies of molecules 10-5 eV 
Vibrational energies of molecules 0.1 eV 
Energy between outer electron shells in atoms 1leV 
Binding energy of a weakly bound molecule 1eV 
Energy of red light 2eV 
Binding energy of a tightly bound molecule 10 eV 
Energy to ionize atom or molecule 10 to 1000 eV 


Representative Energies for Submicroscopic Effects (Order of Magnitude Only) 


Photons act as individual quanta and interact with individual electrons, atoms, molecules, and so on. 
The energy a photon carries is, thus, crucial to the effects it has. [link] lists representative 
submicroscopic energies in eV. When we compare photon energies from the EM spectrum in [link] 
with energies in the table, we can see how effects vary with the type of EM radiation. 


Gamma rays, a form of nuclear and cosmic EM radiation, can have the highest frequencies and, 
hence, the highest photon energies in the EM spectrum. For example, a y-ray photon with f= 107! Hz 
has an energy E = hf = 6.63 x 10°13 J = 4.14 MeV. This is sufficient energy to ionize thousands of 
atoms and molecules, since only 10 to 1000 eV are needed per ionization. In fact, 7 rays are one type 
of ionizing radiation, as are x rays and UV, because they produce ionization in materials that absorb 


them. Because so much ionization can be produced, a single y-ray photon can cause significant 
damage to biological tissue, killing cells or damaging their ability to properly reproduce. When cell 
reproduction is disrupted, the result can be cancer, one of the known effects of exposure to ionizing 
radiation. Since cancer cells are rapidly reproducing, they are exceptionally sensitive to the disruption 
produced by ionizing radiation. This means that ionizing radiation has positive uses in cancer treatment 
as well as risks in producing cancer. 


One of the first x-ray 
images, taken by 
Roentgen himself. The 
hand belongs to Bertha 
Roentgen, his wife. 
(credit: Wilhelm Conrad 
R6ntgen, via Wikimedia 
Commons) 


High photon energy also enables y rays to penetrate materials, since a collision with a single atom or 
molecule is unlikely to absorb all the 7 ray’s energy. This can make ¥ rays useful as a probe, and they 
are sometimes used in medical imaging. x rays, as you can see in [link], overlap with the low- 
frequency end of the y ray range. Since x rays have energies of keV and up, individual x-ray photons 
also can produce large amounts of ionization. At lower photon energies, x rays are not as penetrating as 
7 rays and are slightly less hazardous. X rays are ideal for medical imaging, their most common use, 
and a fact that was recognized immediately upon their discovery in 1895 by the German physicist W. 
C. Roentgen (1845-1923). (See [link].) Within one year of their discovery, x rays (for a time called 
Roentgen rays) were used for medical diagnostics. Roentgen received the 1901 Nobel Prize for the 
discovery of x rays. 


Note: 


Connections: Conservation of Energy 

Once again, we find that conservation of energy allows us to consider the initial and final forms that 
energy takes, without having to make detailed calculations of the intermediate steps. [link] is solved 
by considering only the initial and final forms of energy. 


Metal 
target High- 
X rays V = voltage 


source 


Electrons 


Heated 
filament 


Filament 
voltage 


X rays are produced when 
energetic electrons strike 
the copper anode of this 
cathode ray tube (CRT). 
Electrons (shown here as 

separate particles) interact 

individually with the 

material they strike, 

sometimes producing 
photons of EM radiation. 


While ¥ rays originate in nuclear decay, x rays are produced by the process shown in [link]. Electrons 
ejected by thermal agitation from a hot filament in a vacuum tube are accelerated through a high 
voltage, gaining kinetic energy from the electrical potential energy. When they strike the anode, the 
electrons convert their kinetic energy to a variety of forms, including thermal energy. But since an 
accelerated charge radiates EM waves, and since the electrons act individually, photons are also 
produced. Some of these x-ray photons obtain the kinetic energy of the electron. The accelerated 
electrons originate at the cathode, so such a tube is called a cathode ray tube (CRT), and various 
versions of them are found in older TV and computer screens as well as in x-ray machines. 


Example: 

X-ray Photon Energy and X-ray Tube Voltage 

Find the maximum energy in eV of an x-ray photon produced by electrons accelerated through a 
potential difference of 50.0 kV in a CRT like the one in [link]. 

Strategy 


Electrons can give all of their kinetic energy to a single photon when they strike the anode of a CRT. 
(This is something like the photoelectric effect in reverse.) The kinetic energy of the electron comes 
from electrical potential energy. Thus we can simply equate the maximum photon energy to the 
electrical potential energy—that is, hf = qV. (We do not have to calculate each step from beginning 
to end if we know that all of the starting energy qV is converted to the final form hf.) 

Solution 

The maximum photon energy is hf = qV, where q is the charge of the electron and V is the 
accelerating voltage. Thus, 

Equation: 


hf = (1.60 x 10°'* C)(50.0 x 10° V). 


From the definition of the electron volt, we know 1 eV = 1.60 x 10°!9 J, where 1 J =1C-V. 
Gathering factors and converting energy to eV yields 
Equation: 

leV 
1.60 x 10° C-V 


hf = (50.0 x 10°)(1.60 x 10° C- v)( ) = (50.0 x 10%)(1 eV) = 50.0 keV. 


Discussion 

This example produces a result that can be applied to many similar situations. If you accelerate a 
single elementary charge, like that of an electron, through a potential given in volts, then its energy in 
eV has the same numerical value. Thus a 50.0-kV potential generates 50.0 keV electrons, which in 
turn can produce photons with a maximum energy of 50 keV. Similarly, a 100-kV potential in an x-ray 
tube can generate up to 100-keV x-ray photons. Many x-ray tubes have adjustable voltages so that 
various energy x rays with differing energies, and therefore differing abilities to penetrate, can be 
generated. 


~ 


X-ray intensity 


trax f 
qv = fax 


X-ray spectrum obtained when 
energetic electrons strike a 
material. The smooth part of the 
spectrum is bremsstrahlung, 
while the peaks are 
characteristic of the anode 
material. Both are atomic 
processes that produce energetic 


photons known as x-ray 
photons. 


[link] shows the spectrum of x rays obtained from an x-ray tube. There are two distinct features to the 
spectrum. First, the smooth distribution results from electrons being decelerated in the anode material. 
A curve like this is obtained by detecting many photons, and it is apparent that the maximum energy is 
unlikely. This decelerating process produces radiation that is called bremsstrahlung (German for 
braking radiation). The second feature is the existence of sharp peaks in the spectrum; these are called 
characteristic x rays, since they are characteristic of the anode material. Characteristic x rays come 
from atomic excitations unique to a given type of anode material. They are akin to lines in atomic 
spectra, implying the energy levels of atoms are quantized. Phenomena such as discrete atomic spectra 
and characteristic x rays are explored further in Atomic Physics. 


Ultraviolet radiation (approximately 4 eV to 300 eV) overlaps with the low end of the energy range 
of x rays, but UV is typically lower in energy. UV comes from the de-excitation of atoms that may be 
part of a hot solid or gas. These atoms can be given energy that they later release as UV by numerous 
processes, including electric discharge, nuclear explosion, thermal agitation, and exposure to x rays. A 
UV photon has sufficient energy to ionize atoms and molecules, which makes its effects different from 
those of visible light. UV thus has some of the same biological effects as - rays and x rays. For 
example, it can cause skin cancer and is used as a sterilizer. The major difference is that several UV 
photons are required to disrupt cell reproduction or kill a bacterium, whereas single y-ray and X-ray 
photons can do the same damage. But since UV does have the energy to alter molecules, it can do what 
visible light cannot. One of the beneficial aspects of UV is that it triggers the production of vitamin D 
in the skin, whereas visible light has insufficient energy per photon to alter the molecules that trigger 
this production. Infantile jaundice is treated by exposing the baby to UV (with eye protection), called 
phototherapy, the beneficial effects of which are thought to be related to its ability to help prevent the 
buildup of potentially toxic bilirubin in the blood. 


Example: 

Photon Energy and Effects for UV 

Short-wavelength UV is sometimes called vacuum UV, because it is strongly absorbed by air and must 
be studied in a vacuum. Calculate the photon energy in eV for 100-nm vacuum UV, and estimate the 
number of molecules it could ionize or break apart. 

Strategy 

Using the equation & = hf and appropriate constants, we can find the photon energy and compare it 
with energy information in [link]. 


Solution 
The energy of a photon is given by 
Equation: 
h 
f=. 
a 


Using hc = 1240 eV - nm, we find that 
Equation: 


Discussion 

According to [link], this photon energy might be able to ionize an atom or molecule, and it is about 
what is needed to break up a tightly bound molecule, since they are bound by approximately 10 eV. 
This photon energy could destroy about a dozen weakly bound molecules. Because of its high photon 
energy, UV disrupts atoms and molecules it interacts with. One good consequence is that all but the 
longest-wavelength UV is strongly absorbed and is easily blocked by sunglasses. In fact, most of the 
Sun’s UV is absorbed by a thin layer of ozone in the upper atmosphere, protecting sensitive organisms 
on Earth. Damage to our ozone layer by the addition of such chemicals as CFC’s has reduced this 
protection for us. 


Visible Light 


The range of photon energies for visible light from red to violet is 1.63 to 3.26 eV, respectively (left 
for this chapter’s Problems and Exercises to verify). These energies are on the order of those between 
outer electron shells in atoms and molecules. This means that these photons can be absorbed by atoms 
and molecules. A single photon can actually stimulate the retina, for example, by altering a receptor 
molecule that then triggers a nerve impulse. Photons can be absorbed or emitted only by atoms and 
molecules that have precisely the correct quantized energy step to do so. For example, if a red photon 
of frequency f encounters a molecule that has an energy step, AF, equal to hf, then the photon can be 
absorbed. Violet flowers absorb red and reflect violet; this implies there is no energy step between 
levels in the receptor molecule equal to the violet photon’s energy, but there is an energy step for the 
red. 


There are some noticeable differences in the characteristics of light between the two ends of the visible 
spectrum that are due to photon energies. Red light has insufficient photon energy to expose most 
black-and-white film, and it is thus used to illuminate darkrooms where such film is developed. Since 
violet light has a higher photon energy, dyes that absorb violet tend to fade more quickly than those 
that do not. (See [link].) Take a look at some faded color posters in a storefront some time, and you 
will notice that the blues and violets are the last to fade. This is because other dyes, such as red and 
green dyes, absorb blue and violet photons, the higher energies of which break up their weakly bound 
molecules. (Complex molecules such as those in dyes and DNA tend to be weakly bound.) Blue and 
violet dyes reflect those colors and, therefore, do not absorb these more energetic photons, thus 
suffering less molecular damage. 


Why do the reds, yellows, 
and greens fade before 
the blues and violets 
when exposed to the Sun, 
as with this poster? The 
answer is related to 
photon energy. (credit: 
Deb Collins, Flickr) 


Transparent materials, such as some glasses, do not absorb any visible light, because there is no energy 
step in the atoms or molecules that could absorb the light. Since individual photons interact with 
individual atoms, it is nearly impossible to have two photons absorbed simultaneously to reach a large 
energy step. Because of its lower photon energy, visible light can sometimes pass through many 
kilometers of a substance, while higher frequencies like UV, x ray, and y rays are absorbed, because 
they have sufficient photon energy to ionize the material. 


Example: 

How Many Photons per Second Does a Typical Light Bulb Produce? 

Assuming that 10.0% of a 100-W light bulb’s energy output is in the visible range (typical for 
incandescent bulbs) with an average wavelength of 580 nm, calculate the number of visible photons 
emitted per second. 

Strategy 

Power is energy per unit time, and so if we can find the energy per photon, we can determine the 
number of photons per second. This will best be done in joules, since power is given in watts, which 
are joules per second. 

Solution 

The power in visible light production is 10.0% of 100 W, or 10.0 J/s. The energy of the average visible 
photon is found by substituting the given average wavelength into the formula 

Equation: 


1) 
aN 
This produces 
Equation: 
.63 x 10** J -s)(3.00 x 10° 
EE (6.63 x 10 °** J - s)(3.00 x 10° m/s) — 3.43 x 1079 J. 
580 x 10° m 
The number of visible photons per second is thus 
Equation: 
10.0 J/s 
photon/s = Se = 2.92 x 107° photon/s. 
3.43 x 10~” J/photon 
Discussion 


This incredible number of photons per second is verification that individual photons are insignificant 
in ordinary human experience. It is also a verification of the correspondence principle—on the 
macroscopic scale, quantization becomes essentially continuous or classical. Finally, there are so 
many photons emitted by a 100-W lightbulb that it can be seen by the unaided eye many kilometers 
away. 


Lower-Energy Photons 


Infrared radiation (IR) has even lower photon energies than visible light and cannot significantly 
alter atoms and molecules. IR can be absorbed and emitted by atoms and molecules, particularly 
between closely spaced states. IR is extremely strongly absorbed by water, for example, because water 
molecules have many states separated by energies on the order of 10° eV to 10°? eV, well within the 
IR and microwave energy ranges. This is why in the IR range, skin is almost jet black, with an 
emissivity near 1—there are many states in water molecules in the skin that can absorb a large range of 
IR photon energies. Not all molecules have this property. Air, for example, is nearly transparent to 
many IR frequencies. 


Microwaves are the highest frequencies that can be produced by electronic circuits, although they are 
also produced naturally. Thus microwaves are similar to IR but do not extend to as high frequencies. 
There are states in water and other molecules that have the same frequency and energy as microwaves, 
typically about 10° eV. This is one reason why food absorbs microwaves more strongly than many 
other materials, making microwave ovens an efficient way of putting energy directly into food. 


Photon energies for both IR and microwaves are so low that huge numbers of photons are involved in 
any significant energy transfer by IR or microwaves (such as warming yourself with a heat lamp or 
cooking pizza in the microwave). Visible light, IR, microwaves, and all lower frequencies cannot 
produce ionization with single photons and do not ordinarily have the hazards of higher frequencies. 
When visible, IR, or microwave radiation is hazardous, such as the inducement of cataracts by 
microwaves, the hazard is due to huge numbers of photons acting together (not to an accumulation of 
photons, such as sterilization by weak UV). The negative effects of visible, IR, or microwave radiation 
can be thermal effects, which could be produced by any heat source. But one difference is that at very 
high intensity, strong electric and magnetic fields can be produced by photons acting together. Such 
electromagnetic fields (EMF) can actually ionize materials. 


Note: 

Misconception Alert: High-Voltage Power Lines 

Although some people think that living near high-voltage power lines is hazardous to one’s health, 
ongoing studies of the transient field effects produced by these lines show their strengths to be 
insufficient to cause damage. Demographic studies also fail to show significant correlation of ill 
effects with high-voltage power lines. The American Physical Society issued a report over 10 years 
ago on power-line fields, which concluded that the scientific literature and reviews of panels show no 
consistent, significant link between cancer and power-line fields. They also felt that the “diversion of 
resources to eliminate a threat which has no persuasive scientific basis is disturbing.” 


It is virtually impossible to detect individual photons having frequencies below microwave 
frequencies, because of their low photon energy. But the photons are there. A continuous EM wave can 
be modeled as photons. At low frequencies, EM waves are generally treated as time- and position- 
varying electric and magnetic fields with no discernible quantization. This is another example of the 
correspondence principle in situations involving huge numbers of photons. 


Note: 

PhET Explorations: Color Vision 

Make a whole rainbow by mixing red, green, and blue light. Change the wavelength of a 
monochromatic beam or filter white light. View the light as a solid beam, or see the individual 
photons. 


https://phet.colorado.edu/sims/html/color-vision/latest/color-vision_en.html 


Section Summary 


e Photon energy is responsible for many characteristics of EM radiation, being particularly 
noticeable at high frequencies. 
e Photons have both wave and particle characteristics. 


Conceptual Questions 


Exercise: 


Problem: Why are UV, x rays, and y rays called ionizing radiation? 
Exercise: 
Problem: 
How can treating food with ionizing radiation help keep it from spoiling? UV is not very 
penetrating. What else could be used? 


Exercise: 


Problem: 


Some television tubes are CRTs. They use an approximately 30-kV accelerating potential to send 
electrons to the screen, where the electrons stimulate phosphors to emit the light that forms the 
pictures we watch. Would you expect x rays also to be created? 


Exercise: 
Problem: 
Tanning salons use “safe” UV with a longer wavelength than some of the UV in sunlight. This 


“safe” UV has enough photon energy to trigger the tanning mechanism. Is it likely to be able to 
cause cell damage and induce cancer with prolonged exposure? 


Exercise: 
Problem: 
Your pupils dilate when visible light intensity is reduced. Does wearing sunglasses that lack UV 
blockers increase or decrease the UV hazard to your eyes? Explain. 
Exercise: 
Problem: 
One could feel heat transfer in the form of infrared radiation from a large nuclear bomb detonated 


in the atmosphere 75 km from you. However, none of the profusely emitted x rays or y rays 
reaches you. Explain. 


Exercise: 


Problem: Can a single microwave photon cause cell damage? Explain. 
Exercise: 


Problem: 


In an x-ray tube, the maximum photon energy is given by hf = qV. Would it be technically more 
correct to say hf = qV + BE, where BE is the binding energy of electrons in the target anode? 
Why isn’t the energy stated the latter way? 


Problems & Exercises 


Exercise: 


Problem: 


What is the energy in joules and eV of a photon in a radio wave from an AM station that has a 
1530-kHz broadcast frequency? 


Solution: 


6.34 x 10-° eV, 1.01 x 1077" J 


Exercise: 


Problem: 


(a) Find the energy in joules and eV of photons in radio waves from an FM station that has a 90.0- 
MHz broadcast frequency. (b) What does this imply about the number of photons per second that 
the radio station must broadcast? 


Exercise: 


Problem: Calculate the frequency in hertz of a 1.00-MeV y-ray photon. 
Solution: 


2.42 x 10”? Hz 
Exercise: 
Problem: 
(a) What is the wavelength of a 1.00-eV photon? (b) Find its frequency in hertz. (c) Identify the 
type of EM radiation. 
Exercise: 


Problem: 
Do the unit conversions necessary to show that hc = 1240 eV - nm, as stated in the text. 


Solution: 
Equation: 


he = (6.62607 x 10-*4 J- s) (2.99792 x 108 m/s) ( 10" am.) ( 1,00000 eV ) 


1m 1.60218x10°! J 
1239.84 eV - nm 
1240 eV -nm 


2 


Exercise: 
Problem: 
Confirm the statement in the text that the range of photon energies for visible light is 1.63 to 3.26 
eV, given that the range of visible wavelengths is 380 to 760 nm. 

Exercise: 
Problem: 
(a) Calculate the energy in eV of an IR photon of frequency 2.00 x 10° Hz. (b) How many of 
these photons would need to be absorbed simultaneously by a tightly bound molecule to break it 


apart? (c) What is the energy in eV of a y ray of frequency 3.00 x 107? Hz? (d) How many 
tightly bound molecules could a single such y ray break apart? 


Solution: 


(a) 0.0829 eV 


(b) 121 
(c) 1.24 MeV 


(d) 1.24 x 10° 


Exercise: 


Problem: Prove that, to three-digit accuracy, h = 4.14 x 10-1 eV - s, as stated in the text. 
Exercise: 


Problem: 


(a) What is the maximum energy in eV of photons produced in a CRT using a 25.0-kV 
accelerating potential, such as a color TV? (b) What is their frequency? 


Solution: 
(a) 25.0 x 10? eV 


(b) 6.04 x 1018 Hz 
Exercise: 


Problem: 


What is the accelerating voltage of an x-ray tube that produces x rays with a shortest wavelength 
of 0.0103 nm? 


Exercise: 


Problem: 


(a) What is the ratio of power outputs by two microwave ovens having frequencies of 950 and 
2560 MHz, if they emit the same number of photons per second? (b) What is the ratio of photons 
per second if they have the same power output? 


Solution: 
(a) 2.69 


(b) 0.371 
Exercise: 


Problem: 


How many photons per second are emitted by the antenna of a microwave oven, if its power 
output is 1.00 kW at a frequency of 2560 MHz? 


Exercise: 


Problem: 


Some satellites use nuclear power. (a) If such a satellite emits a 1.00-W flux of y rays having an 
average energy of 0.500 MeV, how many are emitted per second? (b) These ¥ rays affect other 
satellites. How far away must another satellite be to only receive one y ray per second per square 
meter? 


Solution: 
(a) 1.25 x 10/8 photons/s 


(b) 997 km 
Exercise: 
Problem: 
(a) If the power output of a 650-kHz radio station is 50.0 kW, how many photons per second are 
produced? (b) If the radio waves are broadcast uniformly in all directions, find the number of 


photons per second per square meter at a distance of 100 km. Assume no reflection from the 
ground or absorption by the air. 


Exercise: 
Problem: 


How many x-ray photons per second are created by an x-ray tube that produces a flux of x rays 
having a power of 1.00 W? Assume the average energy per photon is 75.0 keV. 


Solution: 


8.33 x 10°? photons/s 
Exercise: 
Problem: 
(a) How far away must you be from a 650-kHz radio station with power 50.0 kW for there to be 
only one photon per second per square meter? Assume no reflections or absorption, as if you were 


in deep outer space. (b) Discuss the implications for detecting intelligent life in other solar 
systems by detecting their radio broadcasts. 


Exercise: 
Problem: 
Assuming that 10.0% of a 100-W light bulb’s energy output is in the visible range (typical for 
incandescent bulbs) with an average wavelength of 580 nm, and that the photons spread out 


uniformly and are not absorbed by the atmosphere, how far away would you be if 500 photons per 
second enter the 3.00-mm diameter pupil of your eye? (This number easily stimulates the retina.) 


Solution: 


181 km 


Exercise: 


Problem:Construct Your Own Problem 


Consider a laser pen. Construct a problem in which you calculate the number of photons per 
second emitted by the pen. Among the things to be considered are the laser pen’s wavelength and 
power output. Your instructor may also wish for you to determine the minimum diffraction 
spreading in the beam and the number of photons per square centimeter the pen can project at 
some large distance. In this latter case, you will also need to consider the output size of the laser 
beam, the distance to the object being illuminated, and any absorption or scattering along the way. 


Glossary 


gamma ray 
also y-ray; highest-energy photon in the EM spectrum 


ionizing radiation 
radiation that ionizes materials that absorb it 


x ray 
EM photon between 7-ray and UV in energy 


bremsstrahlung 
German for braking radiation; produced when electrons are decelerated 


characteristic x rays 
x rays whose energy depends on the material they were produced in 


ultraviolet radiation 
UV; ionizing photons slightly more energetic than violet light 


visible light 
the range of photon energies the human eye can detect 


infrared radiation 
photons with energies slightly less than red light 


microwaves 
photons with wavelengths on the order of a micron (um) 


Photon Momentum 


¢ Relate the linear momentum of a photon to its energy or wavelength, 
and apply linear momentum conservation to simple processes 
involving the emission, absorption, or reflection of photons. 

e Account qualitatively for the increase of photon wavelength that is 
observed, and explain the significance of the Compton wavelength. 


Measuring Photon Momentum 


The quantum of EM radiation we call a photon has properties analogous to 
those of particles we can see, such as grains of sand. A photon interacts as a 
unit in collisions or when absorbed, rather than as an extensive wave. 
Massive quanta, like electrons, also act like macroscopic particles— 
something we expect, because they are the smallest units of matter. Particles 
carry momentum as well as energy. Despite photons having no mass, there 
has long been evidence that EM radiation carries momentum. (Maxwell and 
others who studied EM waves predicted that they would carry momentum.) 
It is now a well-established fact that photons do have momentum. In fact, 
photon momentum is suggested by the photoelectric effect, where photons 
knock electrons out of a substance. [link] shows macroscopic evidence of 
photon momentum. 


The tails of the Hale-Bopp 
comet point away from the 
Sun, evidence that light has 
momentum. Dust emanating 
from the body of the comet 
forms this tail. Particles of 
dust are pushed away from 
the Sun by light reflecting 
from them. The blue ionized 
gas tail is also produced by 
photons interacting with 
atoms in the comet material. 
(credit: Geoff Chester, U.S. 
Navy, via Wikimedia 
Commons) 


[link] shows a comet with two prominent tails. What most people do not 
know about the tails is that they always point away from the Sun rather than 
trailing behind the comet (like the tail of Bo Peep’s sheep). Comet tails are 
composed of gases and dust evaporated from the body of the comet and 
ionized gas. The dust particles recoil away from the Sun when photons 
scatter from them. Evidently, photons carry momentum in the direction of 
their motion (away from the Sun), and some of this momentum is 
transferred to dust particles in collisions. Gas atoms and molecules in the 
blue tail are most affected by other particles of radiation, such as protons 
and electrons emanating from the Sun, rather than by the momentum of 
photons. 


Note: 

Connections: Conservation of Momentum 

Not only is momentum conserved in all realms of physics, but all types of 
particles are found to have momentum. We expect particles with mass to 
have momentum, but now we see that massless particles including photons 
also carry momentum. 


Momentum is conserved in quantum mechanics just as it is in relativity and 
classical physics. Some of the earliest direct experimental evidence of this 
came from scattering of x-ray photons by electrons in substances, named 
Compton scattering after the American physicist Arthur H. Compton 
(1892-1962). Around 1923, Compton observed that x rays scattered from 
materials had a decreased energy and correctly analyzed this as being due to 
the scattering of photons from electrons. This phenomenon could be 
handled as a collision between two particles—a photon and an electron at 
rest in the material. Energy and momentum are conserved in the collision. 
(See [link]) He won a Nobel Prize in 1929 for the discovery of this 
scattering, now called the Compton effect, because it helped prove that 
photon momentum is given by 

Equation: 


where hf is Planck’s constant and 4 is the photon wavelength. (Note that 
relativistic momentum given as p = ymu is valid only for particles having 
mass.) 


E = hf E’ = hf’ 
A He 
Before After 
Before w) & 
After 
KE, = E — E’ 


The Compton effect is 
the name given to the 
scattering of a photon 
by an electron. Energy 
and momentum are 
conserved, resulting in 
a reduction of both for 
the scattered photon. 
Studying this effect, 
Compton verified that 
photons have 
momentum. 


We can see that photon momentum is small, since p = h/A and h is very 
small. It is for this reason that we do not ordinarily observe photon 


momentum. Our mirrors do not recoil when light reflects from them (except 
perhaps in cartoons). Compton saw the effects of photon momentum 
because he was observing x rays, which have a small wavelength and a 
relatively large momentum, interacting with the lightest of particles, the 
electron. 


Example: 

Electron and Photon Momentum Compared 

(a) Calculate the momentum of a visible photon that has a wavelength of 
500 nm. (b) Find the velocity of an electron having the same momentum. 
(c) What is the energy of the electron, and how does it compare with the 
energy of the photon? 

Strategy 

Finding the photon momentum is a straightforward application of its 
definition: p = 4. If we find the photon momentum is small, then we can 
assume that an electron with the same momentum will be nonrelativistic, 
making it easy to find its velocity and kinetic energy from the classical 
formulas. 

Solution for (a) 

Photon momentum is given by the equation: 

Equation: 


Bs 


Entering the given photon wavelength yields 
Equation: 


_ 6.63 x 104 J-s 


ane = I ee ey & 


Solution for (b) 

Since this momentum is indeed small, we will use the classical expression 
p = mv to find the velocity of an electron with this momentum. Solving 
for v and using the known value for the mass of an electron gives 


Equation: 


_ 1.33 x 10°" kg- m/s 


mi ee = 1460 m/s = 1460 m/s. 


mee 
WS —— 
m 


Solution for (c) 
The electron has kinetic energy, which is classically given by 
Equation: 


KE. 


| 

| 
3 
S 


Thus, 
Equation: 


1 
KE, = 5 (9-11 x 10° kg)(1455 m/s)? = 9.64 x 10° J. 


Converting this to eV by multiplying by (1 eV) /(1.602 x 10°'° J) yields 
Equation: 


KE, = 6.02 x 10° eV. 


The photon energy F is 
Equation: 


he 1240 eV -nm 
Le aie 


which is about five orders of magnitude greater. 

Discussion 

Photon momentum is indeed small. Even if we have huge numbers of 
them, the total momentum they carry is small. An electron with the same 
momentum has a 1460 m/s velocity, which is clearly nonrelativistic. A 
more massive particle with the same momentum would have an even 
smaller velocity. This is borne out by the fact that it takes far less energy to 
give an electron the same momentum as a photon. But on a quantum- 
mechanical scale, especially for high-energy photons interacting with small 


masses, photon momentum is significant. Even on a large scale, photon 
momentum can have an effect if there are enough of them and if there is 
nothing to prevent the slow recoil of matter. Comet tails are one example, 
but there are also proposals to build space sails that use huge low-mass 
mirrors (made of aluminized Mylar) to reflect sunlight. In the vacuum of 
space, the mirrors would gradually recoil and could actually take 
spacecraft from place to place in the solar system. (See [link].) 


Direction of travel 
—_—_—_—_—_—_—_—_—_—_—_—_—_— 


Solar sail 


(a) 


(a) Space sails have been proposed that use the 
momentum of sunlight reflecting from gigantic low-mass 
sails to propel spacecraft about the solar system. A 
Russian test model of this (the Cosmos 1) was launched 
in 2005, but did not make it into orbit due to a rocket 
failure. (b) A U.S. version of this, labeled LightSail-1, is 
scheduled for trial launches in the first part of this 
decade. It will have a 40-m? sail. (credit: Kim 
Newton/NASA) 


Relativistic Photon Momentum 


There is a relationship between photon momentum p and photon energy & 
that is consistent with the relation given previously for the relativistic total 
energy of a particle as E* = (pc)? + (mc)?. We know m is zero for a 
photon, but p is not, so that E? = (pc)? + (mc)? becomes 


Equation: 


or 
Equation: 


0 | & 


p = — (photons). 


To check the validity of this relation, note that F = hc/A for a photon. 
Substituting this into p = E’/c yields 
Equation: 


p=(he/A)/e= >, 


as determined experimentally and discussed above. Thus, p = E//c is 
equivalent to Compton’s result p = h/X. For a further verification of the 
relationship between photon energy and momentum, see [link]. 


Note: 

Photon Detectors 

Almost all detection systems talked about thus far—eyes, photographic 
plates, photomultiplier tubes in microscopes, and CCD cameras—rely on 
particle-like properties of photons interacting with a sensitive area. A 
change is caused and either the change is cascaded or zillions of points are 
recorded to form an image we detect. These detectors are used in 
biomedical imaging systems, and there is ongoing research into improving 
the efficiency of receiving photons, particularly by cooling detection 
systems and reducing thermal effects. 


Example: 

Photon Energy and Momentum 

Show that p = E’/c for the photon considered in the [link]. 

Strategy 

We will take the energy / found in [link], divide it by the speed of light, 
and see if the same momentum is obtained as before. 

Solution 

Given that the energy of the photon is 2.48 eV and converting this to 
joules, we get 


Equation: 
2.48 eV)(1.60 x 10°19 J/eV 
p= £ = (2.48 eV) (1.60 x 10°" J/eV) = 1.33 x 10°’ kg- m/s. 
© 3.00 x 10° m/s 
Discussion 


This value for momentum is the same as found before (note that unrounded 
values are used in all calculations to avoid even small rounding errors), an 
expected verification of the relationship p = E//c. This also means the 
relationship between energy, momentum, and mass given by 

E* = (pc)? + (mc)? applies to both matter and photons. Once again, note 
that p is not zero, even when ™ is. 


Note: 

Problem-Solving Suggestion 

Note that the forms of the constants h = 4.14 x 10° eV -s and 

he = 1240 eV - nm may be particularly useful for this section’s Problems 
and Exercises. 


Section Summary 


e Photons have momentum, given by p = 4, where A is the photon 
wavelength. 


e Photon energy and momentum are related by p = 2, where 
E = hf = hc/A for a photon. 


Conceptual Questions 


Exercise: 
Problem: 
Which formula may be used for the momentum of all particles, with or 
without mass? 
Exercise: 
Problem: 
Is there any measurable difference between the momentum of a photon 
and the momentum of matter? 
Exercise: 
Problem: 


Why don’t we feel the momentum of sunlight when we are on the 
beach? 


Problems & Exercises 


Exercise: 


Problem: 


(a) Find the momentum of a 4.00-cm-wavelength microwave photon. 
(b) Discuss why you expect the answer to (a) to be very small. 


Solution: 


(a) 1.66 x 10-*? kg - m/s 


(b) The wavelength of microwave photons is large, so the momentum 
they carry is very small. 


Exercise: 
Problem: 
(a) What is the momentum of a 0.0100-nm-wavelength photon that 
could detect details of an atom? (b) What is its energy in MeV? 
Exercise: 
Problem: 


(a) What is the wavelength of a photon that has a momentum of 
5.00 x 10% kg - m/s? (b) Find its energy in eV. 


Solution: 
(a) 13.3 pm 


(b) 9.38 x 107? eV 
Exercise: 
Problem: 
(a) A y-ray photon has a momentum of 8.00 x 10°24 kg- m /s. What 
is its wavelength? (b) Calculate its energy in MeV. 
Exercise: 
Problem: 
(a) Calculate the momentum of a photon having a wavelength of 
2.50 um. (b) Find the velocity of an electron having the same 


momentum. (c) What is the kinetic energy of the electron, and how 
does it compare with that of the photon? 


Solution: 


(a) 2.65 x 10-* kg - m/s 


(b) 291 m/s 


(c) electron 3.86 x 10°76 J, photon 7.96 x 10° 7° J, ratio 2.06 x 10° 
Exercise: 


Problem: 


Repeat the previous problem for a 10.0-nm-wavelength photon. 
Exercise: 

Problem: 

(a) Calculate the wavelength of a photon that has the same momentum 

as a proton moving at 1.00% of the speed of light. (b) What is the 


energy of the photon in MeV? (c) What is the kinetic energy of the 
proton in MeV? 


Solution: 
(ay 1.32 x10" mi 
(b) 9.39 MeV 


(c) 4.70 x 107? MeV 
Exercise: 
Problem: 
(a) Find the momentum of a 100-keV x-ray photon. (b) Find the 


equivalent velocity of a neutron with the same momentum. (c) What is 
the neutron’s kinetic energy in keV? 


Exercise: 
Problem: 
Take the ratio of relativistic rest energy, & = ymc?, to relativistic 


momentum, p = ymu, and show that in the limit that mass approaches 
zero, you find E'/p = c. 


Solution: 


a mc? and P = ymu, so 
Equation: 


As the mass of particle approaches zero, its velocity u will approach c, 
so that the ratio of energy to momentum in this limit is 
Equation: 
. ce 
lim,,»9 — =— =c 


P Cc 


which is consistent with the equation for photon energy. 


Exercise: 


Problem: Construct Your Own Problem 


Consider a space sail such as mentioned in [link]. Construct a problem 
in which you calculate the light pressure on the sail in N/ m? produced 
by reflecting sunlight. Also calculate the force that could be produced 
and how much effect that would have on a spacecraft. Among the 
things to be considered are the intensity of sunlight, its average 
wavelength, the number of photons per square meter this implies, the 
area of the space sail, and the mass of the system being accelerated. 


Exercise: 
Problem: Unreasonable Results 
A car feels a small force due to the light it sends out from its 


headlights, equal to the momentum of the light divided by the time in 
which it is emitted. (a) Calculate the power of each headlight, if they 


exert a total force of 2.00 x 10-2 N backward on the car. (b) What is 
unreasonable about this result? (c) Which assumptions are 
unreasonable or inconsistent? 


Solution: 
(a) 3.00 x 10° W 
(b) Headlights are way too bright. 


(c) Force is too large. 


Glossary 


photon momentum 
the amount of momentum a photon has, calculated by p = 4 = 4 
Compton effect 
the phenomenon whereby x rays scattered from materials have 
decreased energy 


The Particle-Wave Duality 


e Explain what the term particle-wave duality means, and why it is 
applied to EM radiation. 


We have long known that EM radiation is a wave, capable of interference 
and diffraction. We now see that light can be modeled as photons, which are 
massless particles. This may seem contradictory, since we ordinarily deal 
with large objects that never act like both wave and particle. An ocean 
wave, for example, looks nothing like a rock. To understand small-scale 
phenomena, we make analogies with the large-scale phenomena we observe 
directly. When we say something behaves like a wave, we mean it shows 
interference effects analogous to those seen in overlapping water waves. 
(See [link].) Two examples of waves are sound and EM radiation. When we 
say something behaves like a particle, we mean that it interacts as a discrete 
unit with no interference effects. Examples of particles include electrons, 
atoms, and photons of EM radiation. How do we talk about a phenomenon 
that acts like both a particle and a wave? 


Photon 


oe 


Waves Sand 
(a) (b) 


(a) The interference pattern for 
light through a double slit is a 
wave property understood by 

analogy to water waves. (b) The 
properties of photons having 
quantized energy and 
momentum and acting as a 
concentrated unit are 


understood by analogy to 
macroscopic particles. 


There is no doubt that EM radiation interferes and has the properties of 
wavelength and frequency. There is also no doubt that it behaves as 
particles—photons with discrete energy. We call this twofold nature the 
particle-wave duality, meaning that EM radiation has both particle and 
wave properties. This so-called duality is simply a term for properties of the 
photon analogous to phenomena we can observe directly, on a macroscopic 
scale. If this term seems strange, it is because we do not ordinarily observe 
details on the quantum level directly, and our observations yield either 
particle or wavelike properties, but never both simultaneously. 


Since we have a particle-wave duality for photons, and since we have seen 
connections between photons and matter in that both have momentun,, it is 
reasonable to ask whether there is a particle-wave duality for matter as well. 
If the EM radiation we once thought to be a pure wave has particle 
properties, is it possible that matter has wave properties? The answer is yes. 
The consequences are tremendous, as we will begin to see in the next 
section. 


Note: 

PhET Explorations: Quantum Wave Interference 

When do photons, electrons, and atoms behave like particles and when do 
they behave like waves? Watch waves spread out and interfere as they pass 
through a double slit, then get detected on a screen as tiny dots. Use 
quantum detectors to explore how measurements change the waves and the 
patterns they produce on the screen. 


Quantum 
Wave 
Interferenc 
e 


Section Summary 


e EM radiation can behave like either a particle or a wave. 
e This is termed particle-wave duality. 


Glossary 


particle-wave duality 
the property of behaving like either a particle or a wave; the term for 
the phenomenon that all particles have wave characteristics 


The Wave Nature of Matter 


e Describe the Davisson-Germer experiment, and explain how it 
provides evidence for the wave nature of electrons. 


De Broglie Wavelength 


In 1923 a French physics graduate student named Prince Louis-Victor de 
Broglie (1892-1987) made a radical proposal based on the hope that nature 
is symmetric. If EM radiation has both particle and wave properties, then 
nature would be symmetric if matter also had both particle and wave 
properties. If what we once thought of as an unequivocal wave (EM 
radiation) is also a particle, then what we think of as an unequivocal particle 
(matter) may also be a wave. De Broglie’s suggestion, made as part of his 
doctoral thesis, was so radical that it was greeted with some skepticism. A 
copy of his thesis was sent to Einstein, who said it was not only probably 
correct, but that it might be of fundamental importance. With the support of 
Einstein and a few other prominent physicists, de Broglie was awarded his 
doctorate. 


De Broglie took both relativity and quantum mechanics into account to 
develop the proposal that all particles have a wavelength, given by 
Equation: 


A = — (matter and photons), 


3 


where A is Planck’s constant and p is momentum. This is defined to be the 
de Broglie wavelength. (Note that we already have this for photons, from 
the equation p = h/X.) The hallmark of a wave is interference. If matter is 
a wave, then it must exhibit constructive and destructive interference. Why 
isn’t this ordinarily observed? The answer is that in order to see significant 
interference effects, a wave must interact with an object about the same size 
as its wavelength. Since h is very small, A is also small, especially for 
macroscopic objects. A 3-kg bowling ball moving at 10 m/s, for example, 
has 


Equation: 


= h/p = (6.63 x 10°*4 J-s)/[(3 kg)(10 m/s )|= 2 x 10°*° m. 


This means that to see its wave characteristics, the bowling ball would have 
to interact with something about 10~*° m in size—far smaller than anything 
known. When waves interact with objects much larger than their 
wavelength, they show negligible interference effects and move in straight 
lines (such as light rays in geometric optics). To get easily observed 
interference effects from particles of matter, the longest wavelength and 
hence smallest mass possible would be useful. Therefore, this effect was 
first observed with electrons. 


American physicists Clinton J. Davisson and Lester H. Germer in 1925 and, 
independently, British physicist G. P. Thomson (son of J. J. Thomson, 
discoverer of the electron) in 1926 scattered electrons from crystals and 
found diffraction patterns. These patterns are exactly consistent with 
interference of electrons having the de Broglie wavelength and are 
somewhat analogous to light interacting with a diffraction grating. (See 
[link].) 


Note: 

Connections: Waves 

All microscopic particles, whether massless, like photons, or having mass, 
like electrons, have wave properties. The relationship between momentum 
and wavelength is fundamental for all particles. 


De Broglie’s proposal of a wave nature for all particles initiated a 
remarkably productive era in which the foundations for quantum mechanics 
were laid. In 1926, the Austrian physicist Erwin Schrédinger (1887-1961) 
published four papers in which the wave nature of particles was treated 
explicitly with wave equations. At the same time, many others began 
important work. Among them was German physicist Werner Heisenberg 


(1901-1976) who, among many other contributions to quantum mechanics, 
formulated a mathematical treatment of the wave nature of matter that used 
matrices rather than wave equations. We will deal with some specifics in 
later sections, but it is worth noting that de Broglie’s work was a watershed 
for the development of quantum mechanics. De Broglie was awarded the 
Nobel Prize in 1929 for his vision, as were Davisson and G. P. Thomson in 
1937 for their experimental verification of de Broglie’s hypothesis. 


This diffraction pattern was 
obtained for electrons diffracted 
by crystalline silicon. Bright 
regions are those of constructive 
interference, while dark regions 
are those of destructive 
interference. (credit: Ndthe, 
Wikimedia Commons) 


Example: 


Electron Wavelength versus Velocity and Energy 

For an electron having a de Broglie wavelength of 0.167 nm (appropriate 
for interacting with crystal lattice structures that are about this size): (a) 
Calculate the electron’s velocity, assuming it is nonrelativistic. (b) 
Calculate the electron’s kinetic energy in eV. 

Strategy 

For part (a), since the de Broglie wavelength is given, the electron’s 
velocity can be obtained from A = h/p by using the nonrelativistic 
formula for momentum, p = mv. For part (b), once v is obtained (and it 
has been verified that v is nonrelativistic), the classical kinetic energy is 
simply (1/2)mv?. 

Solution for (a) 

Substituting the nonrelativistic formula for momentum (p = mv) into the 
de Broglie wavelength gives 


Equation: 
h h 
rv = 
Dp mv 
Solving for v gives 
Equation: 
h 
v= —. 
mA 


Substituting known values yields 
Equation: 


6.63 x 10% J-s é 
v= ——_—_____—_—__——_ == 4.36 x 10° m/s. 
(9.11 x 10°! kg) (0.167 x 10°° m) / 


Solution for (b) 

While fast compared with a car, this electron’s speed is not highly 
relativistic, and so we can comfortably use the classical formula to find the 
electron’s kinetic energy and convert it to eV as requested. 

Equation: 


KE = smu" 


= +4(9.11 x 10°! kg)(4.36 x 10° m/s)? 
= (86.4x 10 J)(2%55) 


1.602x10°!9 J 
= 540 eV 


Discussion 

This low energy means that these 0.167-nm electrons could be obtained by 
accelerating them through a 54.0-V electrostatic potential, an easy task. 
The results also confirm the assumption that the electrons are 
nonrelativistic, since their velocity is just over 1% of the speed of light and 
the kinetic energy is about 0.01% of the rest energy of an electron (0.511 
MeV). If the electrons had turned out to be relativistic, we would have had 
to use more involved calculations employing relativistic formulas. 


Electron Microscopes 


One consequence or use of the wave nature of matter is found in the 
electron microscope. As we have discussed, there is a limit to the detail 
observed with any probe having a wavelength. Resolution, or observable 
detail, is limited to about one wavelength. Since a potential of only 54 V 
can produce electrons with sub-nanometer wavelengths, it is easy to get 
electrons with much smaller wavelengths than those of visible light 
(hundreds of nanometers). Electron microscopes can, thus, be constructed 
to detect much smaller details than optical microscopes. (See [link].) 


There are basically two types of electron microscopes. The transmission 
electron microscope (TEM) accelerates electrons that are emitted from a hot 
filament (the cathode). The beam is broadened and then passes through the 
sample. A magnetic lens focuses the beam image onto a fluorescent screen, 
a photographic plate, or (most probably) a CCD (light sensitive camera), 
from which it is transferred to a computer. The TEM is similar to the optical 
microscope, but it requires a thin sample examined in a vacuum. However it 
can resolve details as small as 0.1 nm (10~?° m), providing magnifications 


of 100 million times the size of the original object. The TEM has allowed 
us to see individual atoms and structure of cell nuclei. 


The scanning electron microscope (SEM) provides images by using 
secondary electrons produced by the primary beam interacting with the 
surface of the sample (see [link]). The SEM also uses magnetic lenses to 
focus the beam onto the sample. However, it moves the beam around 
electrically to “scan” the sample in the x and y directions. A CCD detector 
is used to process the data for each electron position, producing images like 
the one at the beginning of this chapter. The SEM has the advantage of not 
requiring a thin sample and of providing a 3-D view. However, its 
resolution is about ten times less than a TEM. 


Electron source 
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(a) (b) 


Schematic of a scanning electron microscope (SEM) (a) used to 
observe small details, such as those seen in this image of a tooth of a 
Himipristis, a type of shark (b). (credit: Dallas Krentzel, Flickr) 


Electrons were the first particles with mass to be directly confirmed to have 
the wavelength proposed by de Broglie. Subsequently, protons, helium 
nuclei, neutrons, and many others have been observed to exhibit 


interference when they interact with objects having sizes similar to their de 
Broglie wavelength. The de Broglie wavelength for massless particles was 
well established in the 1920s for photons, and it has since been observed 
that all massless particles have a de Broglie wavelength A = h/p. The 
wave nature of all particles is a universal characteristic of nature. We shall 
see in following sections that implications of the de Broglie wavelength 
include the quantization of energy in atoms and molecules, and an alteration 
of our basic view of nature on the microscopic scale. The next section, for 
example, shows that there are limits to the precision with which we may 
make predictions, regardless of how hard we try. There are even limits to 
the precision with which we may measure an object’s location or energy. 


Note: 

Making Connections: A Submicroscopic Diffraction Grating 

The wave nature of matter allows it to exhibit all the characteristics of 
other, more familiar, waves. Diffraction gratings, for example, produce 
diffraction patterns for light that depend on grating spacing and the 
wavelength of the light. This effect, as with most wave phenomena, is most 
pronounced when the wave interacts with objects having a size similar to 
its wavelength. For gratings, this is the spacing between multiple slits.) 
When electrons interact with a system having a spacing similar to the 
electron wavelength, they show the same types of interference patterns as 
light does for diffraction gratings, as shown at top left in [link]. 

Atoms are spaced at regular intervals in a crystal as parallel planes, as 
shown in the bottom part of [link]. The spacings between these planes act 
like the openings in a diffraction grating. At certain incident angles, the 
paths of electrons scattering from successive planes differ by one 
wavelength and, thus, interfere constructively. At other angles, the path 
length differences are not an integral wavelength, and there is partial to 
total destructive interference. This type of scattering from a large crystal 
with well-defined lattice planes can produce dramatic interference patterns. 
It is called Bragg reflection, for the father-and-son team who first explored 
and analyzed it in some detail. The expanded view also shows the path- 
length differences and indicates how these depend on incident angle 6 in a 


manner similar to the diffraction patterns for x rays reflecting from a 


crystal. 
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The diffraction pattern at top left is 
produced by scattering electrons from 
a crystal and is graphed as a function 

of incident angle relative to the 
regular array of atoms in a crystal, as 
shown at bottom. Electrons scattering 
from the second layer of atoms travel 
farther than those scattered from the 
top layer. If the path length difference 
(PLD) is an integral wavelength, there 
is constructive interference. 


Let us take the spacing between parallel planes of atoms in the crystal to be 
d. As mentioned, if the path length difference (PLD) for the electrons is a 
whole number of wavelengths, there will be constructive interference— 
that is, PLD = n(n = 1, 2, 3,...). Because AB = BC = d sin 0, we 
have constructive interference when nA = 2d sin 8. This relationship is 


called the Bragg equation and applies not only to electrons but also to x 
rays. 

The wavelength of matter is a submicroscopic characteristic that explains a 
macroscopic phenomenon such as Bragg reflection. Similarly, the 
wavelength of light is a submicroscopic characteristic that explains the 
macroscopic phenomenon of diffraction patterns. 


Section Summary 


e Particles of matter also have a wavelength, called the de Broglie 
wavelength, given by A = a where p is momentum. 


e Matter is found to have the same interference characteristics as any 
other wave. 


Conceptual Questions 


Exercise: 


Problem: 


How does the interference of water waves differ from the interference 
of electrons? How are they analogous? 


Exercise: 


Problem: Describe one type of evidence for the wave nature of matter. 
Exercise: 


Problem: 


Describe one type of evidence for the particle nature of EM radiation. 


Problems & Exercises 


Exercise: 


Problem: 
At what velocity will an electron have a wavelength of 1.00 m? 
Solution: 


7.28x 104m 
Exercise: 
Problem: 
What is the wavelength of an electron moving at 3.00% of the speed of 
light? 
Exercise: 
Problem: 
At what velocity does a proton have a 6.00-fm wavelength (about the 


size of a nucleus)? Assume the proton is nonrelativistic. (1 femtometer 
10"? ms) 


Solution: 


6.62 x 10’ m/s 
Exercise: 


Problem: 


What is the velocity of a 0.400-kg billiard ball if its wavelength is 7.50 
cm (large enough for it to interfere with other billiard balls)? 


Exercise: 


Problem: 
Find the wavelength of a proton moving at 1.00% of the speed of light. 


Solution: 


132% 10 m 
Exercise: 
Problem: 
Experiments are performed with ultracold neutrons having velocities 


as small as 1.00 m/s. (a) What is the wavelength of such a neutron? (b) 
What is its kinetic energy in eV? 


Exercise: 
Problem: 
(a) Find the velocity of a neutron that has a 6.00-fm wavelength (about 


the size of a nucleus). Assume the neutron is nonrelativistic. (b) What 
is the neutron’s kinetic energy in MeV? 


Solution: 
(a) 6.62 x 10’ m/s 


(b) 22.9 MeV 
Exercise: 
Problem: 
What is the wavelength of an electron accelerated through a 30.0-kV 
potential, as in a TV tube? 
Exercise: 
Problem: 


What is the kinetic energy of an electron in a TEM having a 0.0100- 
nm wavelength? 


Solution: 
Equation:15.1 keV 


Exercise: 


Problem: 


(a) Calculate the velocity of an electron that has a wavelength of 
1.00 pm. (b) Through what voltage must the electron be accelerated to 
have this velocity? 


Exercise: 
Problem: 
The velocity of a proton emerging from a Van de Graaff accelerator is 
25.0% of the speed of light. (a) What is the proton’s wavelength? (b) 


What is its kinetic energy, assuming it is nonrelativistic? (c) What was 
the equivalent voltage through which it was accelerated? 


Solution: 
(a) 5.29 fm 
(b)4.70:<10°? J 


(c) 29.4 MV 

Exercise: 
Problem: 
The kinetic energy of an electron accelerated in an x-ray tube is 100 
keV. Assuming it is nonrelativistic, what is its wavelength? 

Exercise: 
Problem: Unreasonable Results 
(a) Assuming it is nonrelativistic, calculate the velocity of an electron 
with a 0.100-fm wavelength (small enough to detect details of a 
nucleus). (b) What is unreasonable about this result? (c) Which 


assumptions are unreasonable or inconsistent? 


Solution: 


(a) 7.28 x 10? m/s 
(b) This is thousands of times the speed of light (an impossibility). 
(c) The assumption that the electron is non-relativistic is unreasonable 
at this wavelength. 
Glossary 
de Broglie wavelength 


the wavelength possessed by a particle of matter, calculated by 
A = h/p 


Probability: The Heisenberg Uncertainty Principle 


¢ Use both versions of Heisenberg’s uncertainty principle in 
calculations. 

e Explain the implications of Heisenberg’s uncertainty principle for 
measurements. 


Probability Distribution 


Matter and photons are waves, implying they are spread out over some 
distance. What is the position of a particle, such as an electron? Is it at the 
center of the wave? The answer lies in how you measure the position of an 
electron. Experiments show that you will find the electron at some definite 
location, unlike a wave. But if you set up exactly the same situation and 
measure it again, you will find the electron in a different location, often far 
outside any experimental uncertainty in your measurement. Repeated 
measurements will display a statistical distribution of locations that appears 
wavelike. (See [link].) 
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Intensity 


The building up of the diffraction 
pattern of electrons scattered from a 
crystal surface. Each electron arrives 


at a definite location, which cannot be 
precisely predicted. The overall 
distribution shown at the bottom can 
be predicted as the diffraction of 
waves having the de Broglie 
wavelength of the electrons. 


(a) Electrons (b) Protons 


Double-slit interference for electrons (a) and 
protons (b) is identical for equal 
wavelengths and equal slit separations. Both 
patterns are probability distributions in the 
sense that they are built up by individual 
particles traversing the apparatus, the paths 
of which are not individually predictable. 


After de Broglie proposed the wave nature of matter, many physicists, 
including Schrédinger and Heisenberg, explored the consequences. The 
idea quickly emerged that, because of its wave character, a particle’s 
trajectory and destination cannot be precisely predicted for each particle 
individually. However, each particle goes to a definite place (as illustrated 
in [link]). After compiling enough data, you get a distribution related to the 


particle’s wavelength and diffraction pattern. There is a certain probability 
of finding the particle at a given location, and the overall pattern is called a 
probability distribution. Those who developed quantum mechanics 
devised equations that predicted the probability distribution in various 
circumstances. 


It is somewhat disquieting to think that you cannot predict exactly where an 
individual particle will go, or even follow it to its destination. Let us 
explore what happens if we try to follow a particle. Consider the double-slit 
patterns obtained for electrons and photons in [link]. First, we note that 
these patterns are identical, following d sin 8 = m4, the equation for 
double-slit constructive interference developed in Photon Energies and the 
Electromagnetic Spectrum, where d is the slit separation and J is the 
electron or photon wavelength. 


Both patterns build up statistically as individual particles fall on the 
detector. This can be observed for photons or electrons—for now, let us 
concentrate on electrons. You might imagine that the electrons are 
interfering with one another as any waves do. To test this, you can lower the 
intensity until there is never more than one electron between the slits and 
the screen. The same interference pattern builds up! This implies that a 
particle’s probability distribution spans both slits, and the particles actually 
interfere with themselves. Does this also mean that the electron goes 
through both slits? An electron is a basic unit of matter that is not divisible. 
But it is a fair question, and so we should look to see if the electron 
traverses one slit or the other, or both. One possibility is to have coils 
around the slits that detect charges moving through them. What is observed 
is that an electron always goes through one slit or the other; it does not split 
to go through both. But there is a catch. If you determine that the electron 
went through one of the slits, you no longer get a double slit pattern— 
instead, you get single slit interference. There is no escape by using another 
method of determining which slit the electron went through. Knowing the 
particle went through one slit forces a single-slit pattern. If you do not 
observe which slit the electron goes through, you obtain a double-slit 
pattern. 


Heisenberg Uncertainty 


How does knowing which slit the electron passed through change the 
pattern? The answer is fundamentally important—measurement affects the 
system being observed. Information can be lost, and in some cases it is 
impossible to measure two physical quantities simultaneously to exact 
precision. For example, you can measure the position of a moving electron 
by scattering light or other electrons from it. Those probes have momentum 
themselves, and by scattering from the electron, they change its momentum 
in a manner that loses information. There is a limit to absolute knowledge, 
even in principle. 


Werner Heisenberg 

was one of the best 

of those physicists 
who developed 
early quantum 
mechanics. Not 


only did his work 
enable a description 
of nature on the 
very small scale, it 
also changed our 


view of the 
availability of 
knowledge. 
Although he is 
universally 
recognized for his 
brilliance and the 
importance of his 
work (he received 
the Nobel Prize in 
1932, for example), 
Heisenberg 
remained in 
Germany during 
World War II and 
headed the German 
effort to build a 
nuclear bomb, 
permanently 
alienating himself 
from most of the 
scientific 
community. (credit: 
Author Unknown, 
via Wikimedia 
Commons) 


It was Werner Heisenberg who first stated this limit to knowledge in 1929 
as a result of his work on quantum mechanics and the wave characteristics 
of all particles. (See [link]). Specifically, consider simultaneously 
measuring the position and momentum of an electron (it could be any 
particle). There is an uncertainty in position Az that is approximately 
equal to the wavelength of the particle. That is, 

Equation: 


Nee; 


As discussed above, a wave is not located at one point in space. If the 
electron’s position is measured repeatedly, a spread in locations will be 
observed, implying an uncertainty in position Az. To detect the position of 
the particle, we must interact with it, such as having it collide with a 
detector. In the collision, the particle will lose momentum. This change in 
momentum could be anywhere from close to zero to the total momentum of 
the particle, p = h/X. It is not possible to tell how much momentum will be 
transferred to a detector, and so there is an uncertainty in momentum Ap, 
too. In fact, the uncertainty in momentum may be as large as the momentum 
itself, which in equation form means that 

Equation: 


h 
Ap —. 
aa 


The uncertainty in position can be reduced by using a shorter-wavelength 
electron, since Az ~ A. But shortening the wavelength increases the 
uncertainty in momentum, since Ap ~ h/X. Conversely, the uncertainty in 
momentum can be reduced by using a longer-wavelength electron, but this 
increases the uncertainty in position. Mathematically, you can express this 
trade-off by multiplying the uncertainties. The wavelength cancels, leaving 
Equation: 


AxAp ~& h. 


So if one uncertainty is reduced, the other must increase so that their 
product is = h. 


With the use of advanced mathematics, Heisenberg showed that the best 
that can be done in a simultaneous measurement of position and momentum 
is 

Equation: 


h 
AzAp > —. 
An 


This is known as the Heisenberg uncertainty principle. It is impossible to 
measure position x and momentum p simultaneously with uncertainties Ax 
and Ap that multiply to be less than h/4n. Neither uncertainty can be zero. 
Neither uncertainty can become small without the other becoming large. A 
small wavelength allows accurate position measurement, but it increases the 
momentum of the probe to the point that it further disturbs the momentum 
of a system being measured. For example, if an electron is scattered from an 
atom and has a wavelength small enough to detect the position of electrons 
in the atom, its momentum can knock the electrons from their orbits in a 
manner that loses information about their original motion. It is therefore 
impossible to follow an electron in its orbit around an atom. If you measure 
the electron’s position, you will find it in a definite location, but the atom 
will be disrupted. Repeated measurements on identical atoms will produce 
interesting probability distributions for electrons around the atom, but they 
will not produce motion information. The probability distributions are 
referred to as electron clouds or orbitals. The shapes of these orbitals are 
often shown in general chemistry texts and are discussed in The Wave 
Nature of Matter Causes Quantization. 


Example: 

Heisenberg Uncertainty Principle in Position and Momentum for an 
Atom 

(a) If the position of an electron in an atom is measured to an accuracy of 
0.0100 nm, what is the electron’s uncertainty in velocity? (b) If the electron 
has this velocity, what is its kinetic energy in eV? 

Strategy 

The uncertainty in position is the accuracy of the measurement, or 

Az = 0.0100 nm. Thus the smallest uncertainty in momentum Ap can be 
calculated using AxAp > h/4z. Once the uncertainty in momentum Ap 
is found, the uncertainty in velocity can be found from Ap = mAv. 
Solution for (a) 


Using the equals sign in the uncertainty principle to express the minimum 
uncertainty, we have 


Equation: 
h 
AxAp = —. 

e An 
Solving for Ap and substituting known values gives 
Equation: 

A h 6.63 x 10°4 J-s 598 x10 k / 
a SS SS eS 5 v m Ss. 
Pde 4n(1.00 x 10 m) Ss 

Thus, 
Equation: 


Ap = 5.28 x 104 kg - m/s = mAv. 


Solving for Av and substituting the mass of an electron gives 
Equation: 


A 5.28 x 1074 kg - m/s 
NG ee ee = 5.79 x 10° m/s. 
m 9.11 x 10° kg 


Solution for (b) 
Although large, this velocity is not highly relativistic, and so the electron’s 
kinetic energy is 


Equation: 
Khe = smu" 
= +$(9.11 x 10%! kg)(5.79 x 10° m/s)” 
= (1.53 x 1077 J) (<br; ) = 95.5 ev. 
Discussion 


Since atoms are roughly 0.1 nm in size, knowing the position of an 
electron to 0.0100 nm localizes it reasonably well inside the atom. This 


would be like being able to see details one-tenth the size of the atom. But 
the consequent uncertainty in velocity is large. You certainly could not 
follow it very well if its velocity is so uncertain. To get a further idea of 
how large the uncertainty in velocity is, we assumed the velocity of the 
electron was equal to its uncertainty and found this gave a kinetic energy of 
95.5 eV. This is significantly greater than the typical energy difference 
between levels in atoms (see [link]), so that it is impossible to get a 
meaningful energy for the electron if we know its position even moderately 
well. 


Why don’t we notice Heisenberg’s uncertainty principle in everyday life? 
The answer is that Planck’s constant is very small. Thus the lower limit in 
the uncertainty of measuring the position and momentum of large objects is 
negligible. We can detect sunlight reflected from Jupiter and follow the 
planet in its orbit around the Sun. The reflected sunlight alters the 
momentum of Jupiter and creates an uncertainty in its momentum, but this 
is totally negligible compared with Jupiter’s huge momentum. The 
correspondence principle tells us that the predictions of quantum mechanics 
become indistinguishable from classical physics for large objects, which is 
the case here. 


Heisenberg Uncertainty for Energy and Time 


There is another form of Heisenberg’s uncertainty principle for 
simultaneous measurements of energy and time. In equation form, 
Equation: 


AEAt > a 
An 


where AF is the uncertainty in energy and At is the uncertainty in time. 
This means that within a time interval At, it is not possible to measure 
energy precisely—there will be an uncertainty AF in the measurement. In 
order to measure energy more precisely (to make AE smaller), we must 


increase At. This time interval may be the amount of time we take to make 
the measurement, or it could be the amount of time a particular state exists, 
as in the next [link]. 


Example: 

Heisenberg Uncertainty Principle for Energy and Time for an Atom 
An atom in an excited state temporarily stores energy. If the lifetime of this 
excited state is measured to be 1.0107!" s, what is the minimum 
uncertainty in the energy of the state in eV? 

Strategy 

The minimum uncertainty in energy AF is found by using the equals sign 
in AEFAt > h/4z and corresponds to a reasonable choice for the 
uncertainty in time. The largest the uncertainty in time can be is the full 
lifetime of the excited state, or At = 1.0x10~-!"s. 

Solution 

Solving the uncertainty principle for AF and substituting known values 
gives 

Equation: 


h — 663x 10% J-s 


E= — = 5 x 107 J. 
4nAt = 4n(1.0x1071"s) 


Now converting to eV yields 
Equation: 


leV 


AE = (5.3 x 107 J (a5 
( ) 1.6 x 10°! J 


) =F3 21° e 


Discussion 

The lifetime of 10° s is typical of excited states in atoms—on human 
time scales, they quickly emit their stored energy. An uncertainty in energy 
of only a few millionths of an eV results. This uncertainty is small 
compared with typical excitation energies in atoms, which are on the order 
of 1 eV. So here the uncertainty principle limits the accuracy with which 


we can measure the lifetime and energy of such states, but not very 
significantly. 


The uncertainty principle for energy and time can be of great significance if 
the lifetime of a system is very short. Then At is very small, and AF is 
consequently very large. Some nuclei and exotic particles have extremely 
short lifetimes (as small as 107° s), causing uncertainties in energy as great 
as many GeV (10° eV). Stored energy appears as increased rest mass, and 
so this means that there is significant uncertainty in the rest mass of short- 
lived particles. When measured repeatedly, a spread of masses or decay 
energies are obtained. The spread is AF. You might ask whether this 
uncertainty in energy could be avoided by not measuring the lifetime. The 
answer is no. Nature knows the lifetime, and so its brevity affects the 
energy of the particle. This is so well established experimentally that the 
uncertainty in decay energy is used to calculate the lifetime of short-lived 
states. Some nuclei and particles are so short-lived that it is difficult to 
measure their lifetime. But if their decay energy can be measured, its spread 
is AF, and this is used in the uncertainty principle (ABAt > h/47) to 
calculate the lifetime At. 


There is another consequence of the uncertainty principle for energy and 
time. If energy is uncertain by AF, then conservation of energy can be 
violated by AF for a time At. Neither the physicist nor nature can tell that 
conservation of energy has been violated, if the violation is temporary and 
smaller than the uncertainty in energy. While this sounds innocuous enough, 
we shall see in later chapters that it allows the temporary creation of matter 
from nothing and has implications for how nature transmits forces over very 
small distances. 


Finally, note that in the discussion of particles and waves, we have stated 
that individual measurements produce precise or particle-like results. A 
definite position is determined each time we observe an electron, for 
example. But repeated measurements produce a spread in values consistent 
with wave characteristics. The great theoretical physicist Richard Feynman 
(1918-1988) commented, “What there are, are particles.” When you 


observe enough of them, they distribute themselves as you would expect for 
a wave phenomenon. However, what there are as they travel we cannot tell 
because, when we do try to measure, we affect the traveling. 


Section Summary 


e Matter is found to have the same interference characteristics as any 
other wave. 

e There is now a probability distribution for the location of a particle 
rather than a definite position. 

e Another consequence of the wave character of all particles is the 
Heisenberg uncertainty principle, which limits the precision with 
which certain physical quantities can be known simultaneously. For 


position and momentum, the uncertainty principle is AxAp > fe, 


where Az is the uncertainty in position and Ap is the uncertainty in 
momentum. 
e For energy and time, the uncertainty principle is AF At > Fo where 


AF is the uncertainty in energy and At is the uncertainty in time. 
e These small limits are fundamentally important on the quantum- 
mechanical scale. 


Conceptual Questions 


Exercise: 


Problem: 
What is the Heisenberg uncertainty principle? Does it place limits on 
what can be known? 

Problems & Exercises 


Exercise: 


Problem: 


(a) If the position of an electron in a membrane is measured to an 
accuracy of 1.00 pm, what is the electron’s minimum uncertainty in 
velocity? (b) If the electron has this velocity, what is its kinetic energy 
in eV? (c) What are the implications of this energy, comparing it to 
typical molecular binding energies? 


Solution: 
(a) 57.9 m/s 
(b) 9.55 x 10-8 eV 


(c) From [link], we see that typical molecular binding energies range 
from about 1eV to 10 eV, therefore the result in part (b) is 
approximately 9 orders of magnitude smaller than typical molecular 
binding energies. 


Exercise: 


Problem: 


(a) If the position of a chlorine ion in a membrane is measured to an 
accuracy of 1.00 pm, what is its minimum uncertainty in velocity, 
given its mass is 5.86 x 10° 2° kg? (b) If the ion has this velocity, what 
is its kinetic energy in eV, and how does this compare with typical 
molecular binding energies? 


Exercise: 


Problem: 


Suppose the velocity of an electron in an atom is known to an accuracy 
of 2.0 x 10° m /s (reasonably accurate compared with orbital 
velocities). What is the electron’s minimum uncertainty in position, 
and how does this compare with the approximate 0.1-nm size of the 
atom? 


Solution: 
29 nm, 


290 times greater 
Exercise: 
Problem: 
The velocity of a proton in an accelerator is known to an accuracy of 


0.250% of the speed of light. (This could be small compared with its 
velocity.) What is the smallest possible uncertainty in its position? 


Exercise: 
Problem: 


A relatively long-lived excited state of an atom has a lifetime of 3.00 
ms. What is the minimum uncertainty in its energy? 


Solution: 


1.10 x 10° eV 
Exercise: 


Problem: 


(a) The lifetime of a highly unstable nucleus is 10° 2° s. What is the 
smallest uncertainty in its decay energy? (b) Compare this with the rest 
energy of an electron. 


Exercise: 
Problem: 
The decay energy of a short-lived particle has an uncertainty of 1.0 


MeV due to its short lifetime. What is the smallest lifetime it can 
have? 


Solution: 


3.3 x 10°?" s 
Exercise: 
Problem: 
The decay energy of a short-lived nuclear excited state has an 


uncertainty of 2.0 eV due to its short lifetime. What is the smallest 
lifetime it can have? 


Exercise: 


Problem: 


What is the approximate uncertainty in the mass of a muon, as 
determined from its decay lifetime? 


Solution: 


2.66 x 10 * kg 
Exercise: 


Problem: 


Derive the approximate form of Heisenberg’s uncertainty principle for 
energy and time, AF At ~ h, using the following arguments: Since 
the position of a particle is uncertain by Ax = 4, where 4 is the 
wavelength of the photon used to examine it, there is an uncertainty in 
the time the photon takes to traverse Ax. Furthermore, the photon has 
an energy related to its wavelength, and it can transfer some or all of 
this energy to the object being examined. Thus the uncertainty in the 
energy of the object is also related to A. Find At and AE; then 
multiply them to give the approximate uncertainty principle. 


Glossary 
Heisenberg’s uncertainty principle 


a fundamental limit to the precision with which pairs of quantities 
(momentum and position, and energy and time) can be measured 


uncertainty in energy 
lack of precision or lack of knowledge of precise results in 
measurements of energy 


uncertainty in time 
lack of precision or lack of knowledge of precise results in 
measurements of time 


uncertainty in momentum 
lack of precision or lack of knowledge of precise results in 
measurements of momentum 


uncertainty in position 
lack of precision or lack of knowledge of precise results in 
measurements of position 


probability distribution 
the overall spatial distribution of probabilities to find a particle at a 
given location 


The Particle-Wave Duality Reviewed 
e Explain the concept of particle-wave duality, and its scope. 


Particle-wave duality—the fact that all particles have wave properties—is 
one of the cornerstones of quantum mechanics. We first came across it in 
the treatment of photons, those particles of EM radiation that exhibit both 
particle and wave properties, but not at the same time. Later it was noted 
that particles of matter have wave properties as well. The dual properties of 
particles and waves are found for all particles, whether massless like 
photons, or having a mass like electrons. (See [link].) 
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Ona 
quantum- 
mechanical 
scale (i.e., 
very small), 
particles with 
and without 
mass have 
wave 
properties. 
For example, 
both 
electrons and 
photons have 
wavelengths 
but also 
behave as 
particles. 


There are many submicroscopic particles in nature. Most have mass and are 
expected to act as particles, or the smallest units of matter. All these masses 
have wave properties, with wavelengths given by the de Broglie 
relationship X = h/p. So, too, do combinations of these particles, such as 
nuclei, atoms, and molecules. As a combination of masses becomes large, 
particularly if it is large enough to be called macroscopic, its wave nature 
becomes difficult to observe. This is consistent with our common 
experience with matter. 


Some particles in nature are massless. We have only treated the photon so 
far, but all massless entities travel at the speed of light, have a wavelength, 
and exhibit particle and wave behaviors. They have momentum given by a 
rearrangement of the de Broglie relationship, p = h/A. In large 
combinations of these massless particles (such large combinations are 
common only for photons or EM waves), there is mostly wave behavior 
upon detection, and the particle nature becomes difficult to observe. This is 
also consistent with experience. (See [link].) 


Massive particle Massless wave 


On a classical scale 
(macroscopic), particles 
with mass behave as 
particles and not as 
waves. Particles without 
mass act as waves and not 
as particles. 


The particle-wave duality is a universal attribute. It is another connection 
between matter and energy. Not only has modern physics been able to 


describe nature for high speeds and small sizes, it has also discovered new 
connections and symmetries. There is greater unity and symmetry in nature 
than was known in the classical era—but they were dreamt of. A beautiful 
poem written by the English poet William Blake some two centuries ago 
contains the following four lines: 


To see the World in a Grain of Sand 
And a Heaven in a Wild Flower 
Hold Infinity in the palm of your hand 


And Eternity in an hour 


Integrated Concepts 


The problem set for this section involves concepts from this chapter and 
several others. Physics is most interesting when applied to general 
situations involving more than a narrow set of physical principles. For 
example, photons have momentum, hence the relevance of Linear 
Momentum and Collisions. The following topics are involved in some or all 
of the problems in this section: 


¢ Dynamics: Newton’s Laws of Motion 

e Work, Energy, and Energy Resources 

e Linear Momentum and Collisions 

e Heat and Heat Transfer Methods 

e Electric Potential and Electric Field 

e Electric Current, Resistance, and Ohm’s Law 
e Wave Optics 

e Special Relativity 


Note: 
Problem-Solving Strategy 


1. Identify which physical principles are involved. 


2. Solve the problem using strategies outlined in the text. 


[link] illustrates how these strategies are applied to an integrated-concept 
problem. 


Example: 

Recoil of a Dust Particle after Absorbing a Photon 

The following topics are involved in this integrated concepts worked 
example: 


Photons (quantum mechanics) 
Linear Momentum 
Topics 


A 550-nm photon (visible light) is absorbed by a 1.00-1g particle of dust 
in outer space. (a) Find the momentum of such a photon. (b) What is the 
recoil velocity of the particle of dust, assuming it is initially at rest? 
Strategy Step 1 

To solve an integrated-concept problem, such as those following this 
example, we must first identify the physical principles involved and 
identify the chapters in which they are found. Part (a) of this example asks 
for the momentum of a photon, a topic of the present chapter. Part (b) 
considers recoil following a collision, a topic of Linear Momentum and 
Collisions. 

Strategy Step 2 


The following solutions to each part of the example illustrate how specific 
problem-solving strategies are applied. These involve identifying knowns 
and unknowns, checking to see if the answer is reasonable, and so on. 
Solution for (a) 

The momentum of a photon is related to its wavelength by the equation: 
Equation: 


ia Ne 
Entering the known value for Planck’s constant h and given the 
wavelength A, we obtain 
Equation: 


—  6.63x107*4 J-s 
Di 550x109 m 


= 1.21x10-*"kg- m/s. 


Discussion for (a) 

This momentum is small, as expected from discussions in the text and the 
fact that photons of visible light carry small amounts of energy and 
momentum compared with those carried by macroscopic objects. 
Solution for (b) 

Conservation of momentum in the absorption of this photon by a grain of 
dust can be analyzed using the equation: 

Equation: 


Pit+ p2=ph + plo (Fret = 0). 


The net external force is zero, since the dust is in outer space. Let 1 
represent the photon and 2 the dust particle. Before the collision, the dust is 
at rest (relative to some observer); after the collision, there is no photon (it 
is absorbed). So conservation of momentum can be written 

Equation: 


Pi = plo = MV, 


where pj is the photon momentum before the collision and p/, is the dust 
momentum after the collision. The mass and recoil velocity of the dust are 


m and v, respectively. Solving this for v, the requested quantity, yields 
Equation: 


UU = —-s 
m 


where p is the photon momentum found in part (a). Entering known values 
(noting that a microgram is 10° kg) gives 


Equation: 
Fee 1.21x10~?’ kg-m/s 
1.00x10° kg 
= 1.21x10°8 m/s. 
Discussion 


The recoil velocity of the particle of dust is extremely small. As we have 
noted, however, there are immense numbers of photons in sunlight and 
other macroscopic sources. In time, collisions and absorption of many 
photons could cause a significant recoil of the dust, as observed in comet 
tails. 


Section Summary 


e The particle-wave duality refers to the fact that all particles—those 
with mass and those without mass—have wave characteristics. 
e This is a further connection between mass and energy. 


Conceptual Questions 


Exercise: 


Problem: 
In what ways are matter and energy related that were not known before 
the development of relativity and quantum mechanics? 


Problems & Exercises 


Exercise: 


Problem: Integrated Concepts 


The 54.0-eV electron in [link] has a 0.167-nm wavelength. If such 
electrons are passed through a double slit and have their first 
maximum at an angle of 25.0°, what is the slit separation d? 


Solution: 


0.395 nm 


Exercise: 


Problem: Integrated Concepts 


An electron microscope produces electrons with a 2.00-pm 
wavelength. If these are passed through a 1.00-nm single slit, at what 
angle will the first diffraction minimum be found? 


Exercise: 


Problem: Integrated Concepts 


A certain heat lamp emits 200 W of mostly IR radiation averaging 
1500 nm in wavelength. (a) What is the average photon energy in 
joules? (b) How many of these photons are required to increase the 
temperature of a person’s shoulder by 2.0°C, assuming the affected 
mass is 4.0 kg with a specific heat of 0.83 kcal/kg-°C. Also assume 
no other significant heat transfer. (c) How long does this take? 


Solution: 
Qisxie ds 


(b) 2.1 x 1078 


(c) 1.4 x 107s 


Exercise: 


Problem: Integrated Concepts 


On its high power setting, a microwave oven produces 900 W of 2560 
MHz microwaves. (a) How many photons per second is this? (b) How 
many photons are required to increase the temperature of a 0.500-kg 
mass of pasta by 45.0°C, assuming a specific heat of 

0.900 kcal/kg - °C? Neglect all other heat transfer. (c) How long must 
the microwave operator wait for their pasta to be ready? 


Exercise: 
Problem: Integrated Concepts 
(a) Calculate the amount of microwave energy in joules needed to raise 
the temperature of 1.00 kg of soup from 20.0°C to 100°C. (b) What is 
the total momentum of all the microwave photons it takes to do this? 
(c) Calculate the velocity of a 1.00-kg mass with the same momentum. 
(d) What is the kinetic energy of this mass? 
Solution: 
(a) 3.35 x 10° J 
(b) 1.12 x 10° kg- m/s 
(c) 1.12 x 10° m/s 
(d) 6.23 x 10°77 J 


Exercise: 


Problem: Integrated Concepts 


(a) What is y for an electron emerging from the Stanford Linear 
Accelerator with a total energy of 50.0 GeV? (b) Find its momentum. 
(c) What is the electron’s wavelength? 


Exercise: 
Problem: Integrated Concepts 
(a) What is y for a proton having an energy of 1.00 TeV, produced by 
the Fermilab accelerator? (b) Find its momentum. (c) What is the 
proton’s wavelength? 
Solution: 
(a) 1.06 x 10° 
(b) 5.33 x 10°16 kg - m/s 
(2) 12245610" m 
Exercise: 
Problem: Integrated Concepts 
An electron microscope passes 1.00-pm-wavelength electrons through 


a circular aperture 2.00 tm in diameter. What is the angle between 
two just-resolvable point sources for this microscope? 


Exercise: 
Problem: Integrated Concepts 
(a) Calculate the velocity of electrons that form the same pattern as 
450-nm light when passed through a double slit. (b) Calculate the 
kinetic energy of each and compare them. (c) Would either be easier to 


generate than the other? Explain. 


Solution: 


(a) 1.62 x 10° m/s 


(b) 4.42 x 10°19 J for photon, 1.19 x 10~*4 J for electron, photon 
energy is 3.71 x 10° times greater 


(c) The light is easier to make because 450-nm light is blue light and 
therefore easy to make. Creating electrons with 7.43 peV of energy 
would not be difficult, but would require a vacuum. 


Exercise: 


Problem: Integrated Concepts 


(a) What is the separation between double slits that produces a second- 
order minimum at 45.0° for 650-nm light? (b) What slit separation is 
needed to produce the same pattern for 1.00-keV protons. 


Solution: 
(a) 2.30 x 10 °m 


(b) 3.20 x 10°12 m 


Exercise: 


Problem: Integrated Concepts 


A laser with a power output of 2.00 mW at a wavelength of 400 nm is 
projected onto calcium metal. (a) How many electrons per second are 
ejected? (b) What power is carried away by the electrons, given that 
the binding energy is 2.71 eV? (c) Calculate the current of ejected 
electrons. (d) If the photoelectric material is electrically insulated and 
acts like a 2.00-pF capacitor, how long will current flow before the 
capacitor voltage stops it? 


Exercise: 


Problem: Integrated Concepts 


One problem with x rays is that they are not sensed. Calculate the 
temperature increase of a researcher exposed in a few seconds to a 
nearly fatal accidental dose of x rays under the following conditions. 
The energy of the x-ray photons is 200 keV, and 4.00 x 1018 of them 
are absorbed per kilogram of tissue, the specific heat of which is 
0.830 kcal/kg - °C. (Note that medical diagnostic x-ray machines 
cannot produce an intensity this great.) 


Solution: 


3.69 x 10°*°C 


Exercise: 


Problem: Integrated Concepts 


A 1.00-fm photon has a wavelength short enough to detect some 
information about nuclei. (a) What is the photon momentum? (b) What 
is its energy in joules and MeV? (c) What is the (relativistic) velocity 
of an electron with the same momentum? (d) Calculate the electron’s 
kinetic energy. 


Exercise: 


Problem: Integrated Concepts 


The momentum of light is exactly reversed when reflected straight 
back from a mirror, assuming negligible recoil of the mirror. Thus the 
change in momentum is twice the photon momentum. Suppose light of 
intensity 1.00 kW/ m’” reflects from a mirror of area 2.00 m?. (a) 
Calculate the energy reflected in 1.00 s. (b) What is the momentum 
imparted to the mirror? (c) Using the most general form of Newton’s 
second law, what is the force on the mirror? (d) Does the assumption 
of no mirror recoil seem reasonable? 


Solution: 


(a) 2.00 kJ 


(b) 1.33 x 10-° kg - m/s 
(c).1:33 107° N 


(d) yes 
Exercise: 


Problem: Integrated Concepts 


Sunlight above the Earth’s atmosphere has an intensity of 

1.30 kW/ m2. If this is reflected straight back from a mirror that has 
only a small recoil, the light’s momentum is exactly reversed, giving 
the mirror twice the incident momentum. (a) Calculate the force per 
square meter of mirror. (b) Very low mass mirrors can be constructed 
in the near weightlessness of space, and attached to a spaceship to sail 
it. Once done, the average mass per square meter of the spaceship is 
0.100 kg. Find the acceleration of the spaceship if all other forces are 
balanced. (c) How fast is it moving 24 hours later? 
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This galaxy is 
ejecting huge jets of 
matter, powered by 

an immensely 
massive black hole 
at its center. (credit: 

X-ray: 
NASA/CXC/CfA/R. 
Kraft et al.) 


Frontiers are exciting. There is mystery, surprise, adventure, and discovery. 
The satisfaction of finding the answer to a question is made keener by the 
fact that the answer always leads to a new question. The picture of nature 
becomes more complete, yet nature retains its sense of mystery and never 
loses its ability to awe us. The view of physics is beautiful looking both 
backward and forward in time. What marvelous patterns we have 
discovered. How clever nature seems in its rules and connections. How 
awesome. And we continue looking ever deeper and ever further, probing 


the basic structure of matter, energy, space, and time and wondering about 
the scope of the universe, its beginnings and future. 


You are now in a wonderful position to explore the forefronts of physics, 
both the new discoveries and the unanswered questions. With the concepts, 
qualitative and quantitative, the problem-solving skills, the feeling for 
connections among topics, and all the rest you have mastered, you can more 
deeply appreciate and enjoy the brief treatments that follow. Years from 
now you will still enjoy the quest with an insight all the greater for your 
efforts. 


Cosmology and Particle Physics 


e Discuss the expansion of the universe. 
e Explain the Big Bang. 


Look at the sky on some clear night when you are away from city lights. 
There you will see thousands of individual stars and a faint glowing 
background of millions more. The Milky Way, as it has been called since 
ancient times, is an arm of our galaxy of stars—the word galaxy coming 
from the Greek word galaxias, meaning milky. We know a great deal about 
our Milky Way galaxy and of the billions of other galaxies beyond its 
fringes. But they still provoke wonder and awe (see [link]). And there are 
still many questions to be answered. Most remarkable when we view the 
universe on the large scale is that once again explanations of its character 
and evolution are tied to the very small scale. Particle physics and the 
questions being asked about the very small scales may also have their 
answers in the very large scales. 


Take a moment to contemplate 
these clusters of galaxies, 
photographed by the Hubble 
Space Telescope. Trillions of 
stars linked by gravity in 
fantastic forms, glowing with 
light and showing evidence of 
undiscovered matter. What are 
they like, these myriad stars? 
How did they evolve? What can 


they tell us of matter, energy, 
space, and time? (credit: NASA, 
ESA, K. Sharon (Tel Aviv 
University) and E. Ofek 
(Caltech)) 


As has been noted in numerous Things Great and Small vignettes, this is 
not the first time the large has been explained by the small and vice versa. 
Newton realized that the nature of gravity on Earth that pulls an apple to the 
ground could explain the motion of the moon and planets so much farther 
away. Minute atoms and molecules explain the chemistry of substances on a 
much larger scale. Decays of tiny nuclei explain the hot interior of the 
Earth. Fusion of nuclei likewise explains the energy of stars. Today, the 
patterns in particle physics seem to be explaining the evolution and 
character of the universe. And the nature of the universe has implications 
for unexplored regions of particle physics. 


Cosmology is the study of the character and evolution of the universe. 
What are the major characteristics of the universe as we know them today? 
First, there are approximately 101! galaxies in the observable part of the 
universe. An average galaxy contains more than 101! stars, with our Milky 
Way galaxy being larger than average, both in its number of stars and its 
dimensions. Ours is a spiral-shaped galaxy with a diameter of about 
100,000 light years and a thickness of about 2000 light years in the arms 
with a central bulge about 10,000 light years across. The Sun lies about 
30,000 light years from the center near the galactic plane. There are 
significant clouds of gas, and there is a halo of less-dense regions of stars 
surrounding the main body. (See [link].) Evidence strongly suggests the 
existence of a large amount of additional matter in galaxies that does not 
produce light—the mysterious dark matter we shall later discuss. 


(c) 


The Milky Way galaxy is 
typical of large spiral 
galaxies in its size, its 

shape, and the presence 
of gas and dust. We are 
fortunate to be ina 
location where we can see 
out of the galaxy and 
observe the vastly larger 
and fascinating universe 


around us. (a) Side view. 
(b) View from above. (c) 
The Milky Way as seen 
from Earth. (credits: (a) 
NASA, (b) Nick Risinger, 
(c) Andy) 


Distances are great even within our galaxy and are measured in light years 
(the distance traveled by light in one year). The average distance between 
galaxies is on the order of a million light years, but it varies greatly with 
galaxies forming clusters such as shown in [link]. The Magellanic Clouds, 
for example, are small galaxies close to our own, some 160,000 light years 
from Earth. The Andromeda galaxy is a large spiral galaxy like ours and 
lies 2 million light years away. It is just visible to the naked eye as an 
extended glow in the Andromeda constellation. Andromeda is the closest 
large galaxy in our local group, and we can see some individual stars in it 
with our larger telescopes. The most distant known galaxy is 14 billion light 
years from Earth—a truly incredible distance. (See [link].) 


(a) Andromeda is the 
closest large galaxy, at 2 
million light years 
distance, and is very 
similar to our Milky Way. 
The blue regions harbor 
young and emerging 
stars, while dark streaks 
are vast clouds of gas and 
dust. A smaller satellite 
galaxy is clearly visible. 
(b) The box indicates 
what may be the most 
distant known galaxy, 
estimated to be 13 billion 
light years from us. It 
exists in a much older 
part of the universe. 
(credit: NASA, ESA, G. 


Illingworth (University of 
California, Santa Cruz), 
R. Bouwens (University 
of California, Santa Cruz 
and Leiden University), 
and the HUDF09 Team) 


Consider the fact that the light we receive from these vast distances has 
been on its way to us for a long time. In fact, the time in years is the same 
as the distance in light years. For example, the Andromeda galaxy is 2 
million light years away, so that the light now reaching us left it 2 million 
years ago. If we could be there now, Andromeda would be different. 
Similarly, light from the most distant galaxy left it 14 billion years ago. We 
have an incredible view of the past when looking great distances. We can 
try to see if the universe was different then—if distant galaxies are more 
tightly packed or have younger-looking stars, for example, than closer 
galaxies, in which case there has been an evolution in time. But the problem 
is that the uncertainties in our data are great. Cosmology is almost typified 
by these large uncertainties, so that we must be especially cautious in 
drawing conclusions. One consequence is that there are more questions than 
answers, and so there are many competing theories. Another consequence is 
that any hard data produce a major result. Discoveries of some importance 
are being made on a regular basis, the hallmark of a field in its golden age. 


Perhaps the most important characteristic of the universe is that all galaxies 
except those in our local cluster seem to be moving away from us at speeds 
proportional to their distance from our galaxy. It looks as if a gigantic 
explosion, universally called the Big Bang, threw matter out some billions 
of years ago. This amazing conclusion is based on the pioneering work of 
Edwin Hubble (1889-1953), the American astronomer. In the 1920s, 
Hubble first demonstrated conclusively that other galaxies, many previously 
called nebulae or clouds of stars, were outside our own. He then found that 
all but the closest galaxies have a red shift in their hydrogen spectra that is 
proportional to their distance. The explanation is that there is a 
cosmological red shift due to the expansion of space itself. The photon 


wavelength is stretched in transit from the source to the observer. Double 
the distance, and the red shift is doubled. While this cosmological red shift 
is often called a Doppler shift, it is not—space itself is expanding. There is 
no center of expansion in the universe. All observers see themselves as 
stationary; the other objects in space appear to be moving away from them. 
Hubble was directly responsible for discovering that the universe was much 
larger than had previously been imagined and that it had this amazing 
characteristic of rapid expansion. 


Universal expansion on the scale of galactic clusters (that is, galaxies at 
smaller distances are not uniformly receding from one another) is an 
integral part of modern cosmology. For galaxies farther away than about 50 
Mly (50 million light years), the expansion is uniform with variations due to 
local motions of galaxies within clusters. A representative recession 
velocity v can be obtained from the simple formula 

Equation: 


v= Ad, 


where d is the distance to the galaxy and Ho is the Hubble constant. The 
Hubble constant is a central concept in cosmology. Its value is determined 
by taking the slope of a graph of velocity versus distance, obtained from red 
shift measurements, such as shown in [link]. We shall use an approximate 
value of Hp = 20 km/s- Mly. Thus, v = Hod is an average behavior for 
all but the closest galaxies. For example, a galaxy 100 Mly away (as 
determined by its size and brightness) typically moves away from us at a 
speed of v = (20 km/s- Mly)(100 Mly) = 2000 km/s. There can be 
variations in this speed due to so-called local motions or interactions with 
neighboring galaxies. Conversely, if a galaxy is found to be moving away 
from us at speed of 100,000 km/s based on its red shift, it is at a distance 


d = v/Hp = (10,000 km/s) /(20 km/s- Mly) = 5000 Mly = 5 Gly or 
5 x 10° ly. This last calculation is approximate, because it assumes the 
expansion rate was the same 5 billion years ago as now. A similar 
calculation in Hubble’s measurement changed the notion that the universe is 
in a Steady state. 


Red 
shift 


Me Distance (d) 


This graph of red shift 
versus distance for 
galaxies shows a linear 
relationship, with larger 
red shifts at greater 
distances, implying an 
expanding universe. The 
slope gives an 
approximate value for the 
expansion rate. (credit: 
John Cub). 


One of the most intriguing developments recently has been the discovery 
that the expansion of the universe may be faster now than in the past, rather 
than slowing due to gravity as expected. Various groups have been looking, 
in particular, at supernovas in moderately distant galaxies (less than 1 Gly) 
to get improved distance measurements. Those distances are larger than 
expected for the observed galactic red shifts, implying the expansion was 
slower when that light was emitted. This has cosmological consequences 
that are discussed in Dark Matter and Closure. The first results, published in 
1999, are only the beginning of emerging data, with astronomy now 
entering a data-rich era. 


[link] shows how the recession of galaxies looks like the remnants of a 
gigantic explosion, the famous Big Bang. Extrapolating backward in time, 


the Big Bang would have occurred between 13 and 15 billion years ago 
when all matter would have been at a point. Questions instantly arise. What 
caused the explosion? What happened before the Big Bang? Was there a 
before, or did time start then? Will the universe expand forever, or will 
gravity reverse it into a Big Crunch? And is there other evidence of the Big 
Bang besides the well-documented red shifts? 


Se : Rises 


Galaxies are flying 
apart from one 
another, with the 
more distant 
moving faster as if 
a primordial 
explosion expelled 
the matter from 
which they formed. 
The most distant 
known galaxies 
move nearly at the 
speed of light 
relative to us. 


The Russian-born American physicist George Gamow (1904—1968) was 
among the first to note that, if there was a Big Bang, the remnants of the 
primordial fireball should still be evident and should be blackbody 
radiation. Since the radiation from this fireball has been traveling to us 


since shortly after the Big Bang, its wavelengths should be greatly 
stretched. It will look as if the fireball has cooled in the billions of years 
since the Big Bang. Gamow and collaborators predicted in the late 1940s 
that there should be blackbody radiation from the explosion filling space 
with a characteristic temperature of about 7 K. Such blackbody radiation 
would have its peak intensity in the microwave part of the spectrum. (See 
[link].) In 1964, Arno Penzias and Robert Wilson, two American scientists 
working with Bell Telephone Laboratories on a low-noise radio antenna, 
detected the radiation and eventually recognized it for what it is. 


[link](b) shows the spectrum of this microwave radiation that permeates 
space and is of cosmic origin. It is the most perfect blackbody spectrum 
known, and the temperature of the fireball remnant is determined from it to 
be 2.725 + 0. 002 K. The detection of what is now called the cosmic 
microwave background (CMBR) was so important (generally considered 
as important as Hubble’s detection that the galactic red shift is proportional 
to distance) that virtually every scientist has accepted the expansion of the 
universe as fact. Penzias and Wilson shared the 1978 Nobel Prize in Physics 
for their discovery. 


Blackbody, T = 2.725 K 


0.5 1 2 5 10 2(mm) 


(a) The Big Bang is used to 
explain the present observed 
expansion of the universe. It 
was an incredibly energetic 
explosion some 10 to 20 
billion years ago. After 
expanding and cooling, 
galaxies form inside the 
now-cold remnants of the 
primordial fireball. (b) The 
spectrum of cosmic 
microwave radiation is the 
most perfect blackbody 


spectrum ever detected. It is 
characteristic of a 
temperature of 2.725 K, the 
expansion-cooled 
temperature of the Big 
Bang’s remnant. This 
radiation can be measured 
coming from any direction in 
space not obscured by some 
other source. It is compelling 
evidence of the creation of 
the universe in a gigantic 
explosion, already indicated 
by galactic red shifts. 


Note: 

Making Connections: Cosmology and Particle Physics 

There are many connections of cosmology—by definition involving 
physics on the largest scale—with particle physics—by definition physics 
on the smallest scale. Among these are the dominance of matter over 
antimatter, the nearly perfect uniformity of the cosmic microwave 
background, and the mere existence of galaxies. 


Matter versus antimatter 

We know from direct observation that antimatter is rare. The Earth and the 
solar system are nearly pure matter. Space probes and cosmic rays give 
direct evidence—the landing of the Viking probes on Mars would have 
been spectacular explosions of mutual annihilation energy if Mars were 
antimatter. We also know that most of the universe is dominated by matter. 
This is proven by the lack of annihilation radiation coming to us from 
space, particularly the relative absence of 0.511-MeV y rays created by the 


mutual annihilation of electrons and positrons. It seemed possible that there 
could be entire solar systems or galaxies made of antimatter in perfect 
symmetry with our matter-dominated systems. But the interactions between 
stars and galaxies would sometimes bring matter and antimatter together in 
large amounts. The annihilation radiation they would produce is simply not 
observed. Antimatter in nature is created in particle collisions and in 8* 
decays, but only in small amounts that quickly annihilate, leaving almost 
pure matter surviving. 


Particle physics seems symmetric in matter and antimatter. Why isn’t the 
cosmos? The answer is that particle physics is not quite perfectly symmetric 
in this regard. The decay of one of the neutral K-mesons, for example, 
preferentially creates more matter than antimatter. This is caused by a 
fundamental small asymmetry in the basic forces. This small asymmetry 
produced slightly more matter than antimatter in the early universe. If there 
was only one part in 10? more matter (a small asymmetry), the rest would 
annihilate pair for pair, leaving nearly pure matter to form the stars and 
galaxies we see today. So the vast number of stars we observe may be only 
a tiny remnant of the original matter created in the Big Bang. Here at last 
we see a very real and important asymmetry in nature. Rather than be 
disturbed by an asymmetry, most physicists are impressed by how small it 
is. Furthermore, if the universe were completely symmetric, the mutual 
annihilation would be more complete, leaving far less matter to form us and 
the universe we know. 


How can something so old have so few wrinkles? 

A troubling aspect of cosmic microwave background radiation (CMBR) 
was soon recognized. True, the CMBR verified the Big Bang, had the 
correct temperature, and had a blackbody spectrum as expected. But the 
CMBR was too smooth—it looked identical in every direction. Galaxies 
and other similar entities could not be formed without the existence of 
fluctuations in the primordial stages of the universe and so there should be 
hot and cool spots in the CMBR, nicknamed wrinkles, corresponding to 
dense and sparse regions of gas caused by turbulence or early fluctuations. 
Over time, dense regions would contract under gravity and form stars and 
galaxies. Why aren’t the fluctuations there? (This is a good example of an 
answer producing more questions.) Furthermore, galaxies are observed very 


far from us, so that they formed very long ago. The problem was to explain 
how galaxies could form so early and so quickly after the Big Bang if its 
remnant fingerprint is perfectly smooth. The answer is that if you look very 
closely, the CMBR is not perfectly smooth, only extremely smooth. 


A satellite called the Cosmic Background Explorer (COBE) carried an 
instrument that made very sensitive and accurate measurements of the 
CMBR. In April of 1992, there was extraordinary publicity of COBE’s first 
results—there were small fluctuations in the CMBR. Further measurements 
were carried out by experiments including NASA’s Wilkinson Microwave 
Anisotropy Probe (WMAP), which launched in 2001. Data from WMAP 
provided a much more detailed picture of the CMBR fluctuations. (See 
[link].) These amount to temperature fluctuations of only 200 yk out of 2.7 
K, better than one part in 1000. The WMAP experiment will be followed up 
by the European Space Agency’s Planck Surveyor, which launched in 2009. 


This map of the sky uses 
color to show 
fluctuations, or wrinkles, 
in the cosmic microwave 
background observed 
with the WMAP 
spacecraft. The Milky 
Way has been removed 
for clarity. Red represents 
higher temperature and 
higher density, while blue 
is lower temperature and 
density. The fluctuations 
are small, less than one 
part in 1000, but these are 
still thought to be the 


cause of the eventual 
formation of galaxies. 
(credit: NASA/WMAP 
Science Team) 


Let us now examine the various stages of the overall evolution of the 
universe from the Big Bang to the present, illustrated in [link]. Note that 
scientific notation is used to encompass the many orders of magnitude in 
time, energy, temperature, and size of the universe. Going back in time, the 
two lines approach but do not cross (there is no zero on an exponential 
scale). Rather, they extend indefinitely in ever-smaller time intervals to 
some infinitesimal point. 
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The evolution of the universe from the Big Bang onward is intimately 
tied to the laws of physics, especially those of particle physics at the 
earliest stages. The universe is relativistic throughout its history. 
Theories of the unification of forces at high energies may be verified 
by their shaping of the universe and its evolution. 


Going back in time is equivalent to what would happen if expansion 
stopped and gravity pulled all the galaxies together, compressing and 
heating all matter. At a time long ago, the temperature and density were too 
high for stars and galaxies to exist. Before then, there was a time when the 
temperature was too great for atoms to exist. And farther back yet, there 
was a time when the temperature and density were so great that nuclei could 
not exist. Even farther back in time, the temperature was so high that 
average kinetic energy was great enough to create short-lived particles, and 
the density was high enough to make this likely. When we extrapolate back 
to the point of W* and Z° production (thermal energies reaching 1 TeV, or 
a temperature of about 101° K), we reach the limits of what we know 
directly about particle physics. This is at a time about 10? s after the Big 
Bang. While 10-1? s may seem to be negligibly close to the instant of 
creation, it is not. There are important stages before this time that are tied to 
the unification of forces. At those stages, the universe was at extremely 
high energies and average particle separations were smaller than we can 
achieve with accelerators. What happened in the early stages before 10°!” s 
is crucial to all later stages and is possibly discerned by observing present 
conditions in the universe. One of these is the smoothness of the CMBR. 


Names are given to early stages representing key conditions. The stage 
before 10~*! s back to 10°-** s is called the electroweak epoch, because 
the electromagnetic and weak forces become identical for energies above 
about 100 GeV. As discussed earlier, theorists expect that the strong force 
becomes identical to and thus unified with the electroweak force at energies 
of about 10! GeV. The average particle energy would be this great at 
10 ** s after the Big Bang, if there are no surprises in the unknown physics 
at energies above about 1 TeV. At the immense energy of 10'* GeV 
(corresponding to a temperature of about 107° K), the W* and Z° carrier 
particles would be transformed into massless gauge bosons to accomplish 
the unification. Before 10~** s back to about 10“ s, we have Grand 
Unification in the GUT epoch, in which all forces except gravity are 
identical. At 10~*° s, the average energy reaches the immense 10'° GeV 
needed to unify gravity with the other forces in TOE, the Theory of 
Everything. Before that time is the TOE epoch, but we have almost no idea 


as to the nature of the universe then, since we have no workable theory of 
quantum gravity. We call the hypothetical unified force superforce. 


Now let us imagine starting at TOE and moving forward in time to see what 
type of universe is created from various events along the way. As 
temperatures and average energies decrease with expansion, the universe 
reaches the stage where average particle separations are large enough to see 
differences between the strong and electroweak forces (at about 10-*° s). 
After this time, the forces become distinct in almost all interactions—they 
are no longer unified or symmetric. This transition from GUT to 
electroweak is an example of spontaneous symmetry breaking, in which 
conditions spontaneously evolved to a point where the forces were no 
longer unified, breaking that symmetry. This is analogous to a phase 
transition in the universe, and a clever proposal by American physicist Alan 
Guth in the early 1980s ties it to the smoothness of the CMBR. Guth 
proposed that spontaneous symmetry breaking (like a phase transition 
during cooling of normal matter) released an immense amount of energy 
that caused the universe to expand extremely rapidly for the brief time from 
10~*° s to about 10° *” s. This expansion may have been by an incredible 
factor of 10°° or more in the size of the universe and is thus called the 
inflationary scenario. One result of this inflation is that it would stretch the 
wrinkles in the universe nearly flat, leaving an extremely smooth CMBR. 
While speculative, there is as yet no other plausible explanation for the 
smoothness of the CMBR. Unless the CMBR is not really cosmic but local 
in origin, the distances between regions of similar temperatures are too 
great for any coordination to have caused them, since any coordination 
mechanism must travel at the speed of light. Again, particle physics and 
cosmology are intimately entwined. There is little hope that we may be able 
to test the inflationary scenario directly, since it occurs at energies near 
1014 GeV, vastly greater than the limits of modern accelerators. But the 
idea is so attractive that it is incorporated into most cosmological theories. 


Characteristics of the present universe may help us determine the validity of 
this intriguing idea. Additionally, the recent indications that the universe’s 
expansion rate may be increasing (see Dark Matter and Closure) could even 
imply that we are in another inflationary epoch. 


It is important to note that, if conditions such as those found in the early 
universe could be created in the laboratory, we would see the unification of 
forces directly today. The forces have not changed in time, but the average 
energy and separation of particles in the universe have. As discussed in The 
Four Basic Forces, the four basic forces in nature are distinct under most 
circumstances found today. The early universe and its remnants provide 
evidence from times when they were unified under most circumstances. 


Section Summary 


¢ Cosmology is the study of the character and evolution of the universe. 

e The two most important features of the universe are the cosmological 
red shifts of its galaxies being proportional to distance and its cosmic 
microwave background (CMBR). Both support the notion that there 
was a gigantic explosion, known as the Big Bang that created the 
universe. 

e Galaxies farther away than our local group have, on an average, a 
recessional velocity given by 
Equation: 


v= Aod, 


where d is the distance to the galaxy and Ho is the Hubble constant, 
taken to have the average value Hp = 20 km/s -Mly. 

e Explanations of the large-scale characteristics of the universe are 
intimately tied to particle physics. 

¢ The dominance of matter over antimatter and the smoothness of the 
CMBR are two characteristics that are tied to particle physics. 

e The epochs of the universe are known back to very shortly after the 
Big Bang, based on known laws of physics. 

e The earliest epochs are tied to the unification of forces, with the 
electroweak epoch being partially understood, the GUT epoch being 
speculative, and the TOE epoch being highly speculative since it 
involves an unknown single superforce. 

e The transition from GUT to electroweak is called spontaneous 
symmetry breaking. It released energy that caused the inflationary 


scenario, which in turn explains the smoothness of the CMBR. 


Conceptual Questions 


Exercise: 
Problem: 
Explain why it only appears that we are at the center of expansion of 


the universe and why an observer in another galaxy would see the 
same relative motion of all but the closest galaxies away from her. 


Exercise: 
Problem: 
If there is no observable edge to the universe, can we determine where 
its center of expansion is? Explain. 


Exercise: 


Problem: If the universe is infinite, does it have a center? Discuss. 
Exercise: 

Problem: 

Another known cause of red shift in light is the source being in a high 

gravitational field. Discuss how this can be eliminated as the source of 


galactic red shifts, given that the shifts are proportional to distance and 
not to the size of the galaxy. 


Exercise: 
Problem: 
If some unknown cause of red shift—such as light becoming “tired” 


from traveling long distances through empty space—is discovered, 
what effect would there be on cosmology? 


Exercise: 


Problem: 


Olbers’s paradox poses an interesting question: If the universe is 
infinite, then any line of sight should eventually fall on a star’s surface. 
Why then is the sky dark at night? Discuss the commonly accepted 
evolution of the universe as a solution to this paradox. 


Exercise: 


Problem: 


If the cosmic microwave background radiation (CMBR) is the remnant 
of the Big Bang’s fireball, we expect to see hot and cold regions in it. 
What are two causes of these wrinkles in the CMBR? Are the observed 
temperature variations greater or less than originally expected? 


Exercise: 


Problem: 


The decay of one type of K-meson is cited as evidence that nature 
favors matter over antimatter. Since mesons are composed of a quark 
and an antiquark, is it surprising that they would preferentially decay 
to one type over another? Is this an asymmetry in nature? Is the 
predominance of matter over antimatter an asymmetry? 


Exercise: 
Problem: 
Distances to local galaxies are determined by measuring the brightness 
of stars, called Cepheid variables, that can be observed individually 
and that have absolute brightnesses at a standard distance that are well 


known. Explain how the measured brightness would vary with 
distance as compared with the absolute brightness. 


Exercise: 


Problem: 


Distances to very remote galaxies are estimated based on their 
apparent type, which indicate the number of stars in the galaxy, and 
their measured brightness. Explain how the measured brightness would 
vary with distance. Would there be any correction necessary to 
compensate for the red shift of the galaxy (all distant galaxies have 
significant red shifts)? Discuss possible causes of uncertainties in these 
measurements. 


Exercise: 


Problem: 
If the smallest meaningful time interval is greater than zero, will the 
lines in [link] ever meet? 

Problems & Exercises 


Exercise: 


Problem: 


Find the approximate mass of the luminous matter in the Milky Way 
galaxy, given it has approximately 101! stars of average mass 1.5 times 
that of our Sun. 


Solution: 


3 x 107! kg 


Exercise: 


Problem: 


Find the approximate mass of the dark and luminous matter in the 
Milky Way galaxy. Assume the luminous matter is due to 
approximately 10" stars of average mass 1.5 times that of our Sun, 
and take the dark matter to be 10 times as massive as the luminous 
matter. 


Exercise: 
Problem: 
(a) Estimate the mass of the luminous matter in the known universe, 
given there are 101! galaxies, each containing 10" stars of average 
mass 1.5 times that of our Sun. (b) How many protons (the most 
abundant nuclide) are there in this mass? (c) Estimate the total number 
of particles in the observable universe by multiplying the answer to (b) 
by two, since there is an electron for each proton, and then by 10°, 


since there are far more particles (such as photons and neutrinos) in 
space than in luminous matter. 


Solution: 
(a) 3 x 10°? kg 
(b)2 x10” 


(c) 4 x 1088 
Exercise: 
Problem: 
If a galaxy is 500 Mly away from us, how fast do we expect it to be 
moving and in what direction? 


Exercise: 


Problem: 


On average, how far away are galaxies that are moving away from us 
at 2.0% of the speed of light? 


Solution: 


0.30 Gly 

Exercise: 
Problem: 
Our solar system orbits the center of the Milky Way galaxy. Assuming 
a circular orbit 30,000 ly in radius and an orbital speed of 250 km/s, 
how many years does it take for one revolution? Note that this is 
approximate, assuming constant speed and circular orbit, but it is 


representative of the time for our system and local stars to make one 
revolution around the galaxy. 


Exercise: 


Problem: 


(a) What is the approximate speed relative to us of a galaxy near the 
edge of the known universe, some 10 Gly away? (b) What fraction of 
the speed of light is this? Note that we have observed galaxies moving 
away from us at greater than 0.9c. 


Solution: 
(a) 2.0 x 10° km/s 


(b) 0.67c 


Exercise: 


Problem: 


(a) Calculate the approximate age of the universe from the average 
value of the Hubble constant, Hy = 20 km/s -Mly. To do this, 
calculate the time it would take to travel 1 Mly at a constant expansion 
rate of 20 km/s. (b) If deceleration is taken into account, would the 
actual age of the universe be greater or less than that found here? 
Explain. 


Exercise: 
Problem: 
Assuming a circular orbit for the Sun about the center of the Milky 
Way galaxy, calculate its orbital speed using the following 
information: The mass of the galaxy is equivalent to a single mass 


1.5 x 10" times that of the Sun (or 3 x 10*! kg), located 30,000 ly 
away. 


Solution: 


2.7 x 10° m/s 

Exercise: 
Problem: 
(a) What is the approximate force of gravity on a 70-kg person due to 
the Andromeda galaxy, assuming its total mass is 10’° that of our Sun 
and acts like a single mass 2 Mly away? (b) What is the ratio of this 


force to the person’s weight? Note that Andromeda is the closest large 
galaxy. 


Exercise: 
Problem: 
Andromeda galaxy is the closest large galaxy and is visible to the 


naked eye. Estimate its brightness relative to the Sun, assuming it has 
luminosity 10’? times that of the Sun and lies 2 Mly away. 


Solution: 
6 x 10-1! (an overestimate, since some of the light from Andromeda 
is blocked by gas and dust within that galaxy) 

Exercise: 


Problem: 


(a) A particle and its antiparticle are at rest relative to an observer and 
annihilate (completely destroying both masses), creating two ¥ rays of 
equal energy. What is the characteristic y-ray energy you would look 
for if searching for evidence of proton-antiproton annihilation? (The 
fact that such radiation is rarely observed is evidence that there is very 
little antimatter in the universe.) (b) How does this compare with the 
0.511-MeV energy associated with electron-positron annihilation? 


Exercise: 
Problem: 
The average particle energy needed to observe unification of forces is 
estimated to be 10!9 GeV. (a) What is the rest mass in kilograms of a 


particle that has a rest mass of 10'° GeV /c?? (b) How many times the 
mass of a hydrogen atom is this? 


Solution: 
(a2 10> ke 


(b) 1 x 10 


Exercise: 


Problem: 


The peak intensity of the CMBR occurs at a wavelength of 1.1 mm. (a) 
What is the energy in eV of a 1.1-mm photon? (b) There are 
approximately 10° photons for each massive particle in deep space. 
Calculate the energy of 10° such photons. (c) If the average massive 
particle in space has a mass half that of a proton, what energy would 
be created by converting its mass to energy? (d) Does this imply that 
space is “matter dominated”? Explain briefly. 


Exercise: 
Problem: 
(a) What Hubble constant corresponds to an approximate age of the 
universe of 102° y? To get an approximate value, assume the 
expansion rate is constant and calculate the speed at which two 
galaxies must move apart to be separated by 1 Mly (present average 


galactic separation) in a time of 10° y. (b) Similarly, what Hubble 
constant corresponds to a universe approximately 2 x 10°-y old? 


Solution: 
(a) 30 km/s - Mly 


(b) 15 km/s - Mly 
Exercise: 


Problem: 


Show that the velocity of a star orbiting its galaxy in a circular orbit is 
inversely proportional to the square root of its orbital radius, assuming 
the mass of the stars inside its orbit acts like a single mass at the center 
of the galaxy. You may use an equation from a previous chapter to 
support your conclusion, but you must justify its use and define all 
terms used. 


Exercise: 


Problem: 


The core of a star collapses during a supernova, forming a neutron star. 
Angular momentum of the core is conserved, and so the neutron star 
spins rapidly. If the initial core radius is 5.0 x 10° km and it collapses 
to 10.0 km, find the neutron star’s angular velocity in revolutions per 
second, given the core’s angular velocity was originally 1 revolution 
per 30.0 days. 


Solution: 


960 rev/s 
Exercise: 
Problem: 
Using data from the previous problem, find the increase in rotational 


kinetic energy, given the core’s mass is 1.3 times that of our Sun. 
Where does this increase in kinetic energy come from? 


Exercise: 
Problem: 
Distances to the nearest stars (up to 500 ly away) can be measured by a 
technique called parallax, as shown in [link]. What are the angles 0, 


and @» relative to the plane of the Earth’s orbit for a star 4.0 ly directly 
above the Sun? 


Solution: 


89.999773° (many digits are used to show the difference between 90°) 


Exercise: 


Problem: 


(a) Use the Heisenberg uncertainty principle to calculate the 
uncertainty in energy for a corresponding time interval of 10~*° s. (b) 
Compare this energy with the 101° GeV unification-of-forces energy 
and discuss why they are similar. 


Exercise: 


Problem: Construct Your Own Problem 


Consider a star moving in a circular orbit at the edge of a galaxy. 
Construct a problem in which you calculate the mass of that galaxy in 
kg and in multiples of the solar mass based on the velocity of the star 


and its distance from the center of the galaxy. 
/, Star 


| «——— Diameter of orbit ———» | 


Distances to nearby 

Stars are measured 

using triangulation, 
also called the parallax 
method. The angle of 


line of sight to the star 
is measured at 
intervals six months 
apart, and the distance 
is calculated by using 
the known diameter of 
the Earth’s orbit. This 
can be done for stars 


up to about 500 ly 
away. 
Glossary 
Big Bang 


a gigantic explosion that threw out matter a few billion years ago 


cosmic microwave background 
the spectrum of microwave radiation of cosmic origin 


cosmological red shift 
the photon wavelength is stretched in transit from the source to the 
observer because of the expansion of space itself 


cosmology 
the study of the character and evolution of the universe 


electroweak epoch 
the stage before 10"! back to 10 ** after the Big Bang 


GUT epoch 
the time period from 10°*° to 10 *4 after the Big Bang, when Grand 
Unification Theory, in which all forces except gravity are identical, 
governed the universe 


Hubble constant 


a central concept in cosmology whose value is determined by taking 
the slope of a graph of velocity versus distance, obtained from red shift 
measurements 


inflationary scenario 
the rapid expansion of the universe by an incredible factor of 10°°° for 
the brief time from 10~*° to about 10° *2s 


spontaneous symmetry breaking 
the transition from GUT to electroweak where the forces were no 
longer unified 


superforce 
hypothetical unified force in TOE epoch 


TOE epoch 
before 10°*° after the Big Bang 


General Relativity and Quantum Gravity 


e Explain the effect of gravity on light. 
e Discuss black hole. 
e Explain quantum gravity. 


When we talk of black holes or the unification of forces, we are actually 
discussing aspects of general relativity and quantum gravity. We know from 
Special Relativity that relativity is the study of how different observers 
measure the same event, particularly if they move relative to one another. 
Einstein’s theory of general relativity describes all types of relative motion 
including accelerated motion and the effects of gravity. General relativity 
encompasses special relativity and classical relativity in situations where 
acceleration is zero and relative velocity is small compared with the speed 
of light. Many aspects of general relativity have been verified 
experimentally, some of which are better than science fiction in that they 
are bizarre but true. Quantum gravity is the theory that deals with particle 
exchange of gravitons as the mechanism for the force, and with extreme 
conditions where quantum mechanics and general relativity must both be 
used. A good theory of quantum gravity does not yet exist, but one will be 
needed to understand how all four forces may be unified. If we are 
successful, the theory of quantum gravity will encompass all others, from 
classical physics to relativity to quantum mechanics—truly a Theory of 
Everything (TOE). 


General Relativity 


Einstein first considered the case of no observer acceleration when he 
developed the revolutionary special theory of relativity, publishing his first 
work on it in 1905. By 1916, he had laid the foundation of general 
relativity, again almost on his own. Much of what Einstein did to develop 
his ideas was to mentally analyze certain carefully and clearly defined 
situations—doing this is to perform a thought experiment. [link] illustrates 
a thought experiment like the ones that convinced Einstein that light must 
fall in a gravitational field. Think about what a person feels in an elevator 
that is accelerated upward. It is identical to being in a stationary elevator in 
a gravitational field. The feet of a person are pressed against the floor, and 


objects released from hand fall with identical accelerations. In fact, it is not 
possible, without looking outside, to know what is happening—acceleration 
upward or gravity. This led Einstein to correctly postulate that acceleration 
and gravity will produce identical effects in all situations. So, if acceleration 
affects light, then gravity will, too. [link] shows the effect of acceleration on 
a beam of light shone horizontally at one wall. Since the accelerated 
elevator moves up during the time light travels across the elevator, the beam 
of light strikes low, seeming to the person to bend down. (Normally a tiny 
effect, since the speed of light is so great.) The same effect must occur due 
to gravity, Einstein reasoned, since there is no way to tell the effects of 
gravity acting downward from acceleration of the elevator upward. Thus 
gravity affects the path of light, even though we think of gravity as acting 
between masses and photons are massless. 


Accelerated up 


(a) A beam of light 
emerges from a flashlight 
in an upward-accelerating 


elevator. Since the 
elevator moves up during 
the time the light takes to 
reach the wall, the beam 
strikes lower than it 
would if the elevator were 
not accelerated. (b) 
Gravity has the same 
effect on light, since it is 
not possible to tell 
whether the elevator is 
accelerating upward or 
acted upon by gravity. 


Einstein’s theory of general relativity got its first verification in 1919 when 
starlight passing near the Sun was observed during a solar eclipse. (See 
[link].) During an eclipse, the sky is darkened and we can briefly see stars. 
Those in a line of sight nearest the Sun should have a shift in their apparent 
positions. Not only was this shift observed, but it agreed with Einstein’s 
predictions well within experimental uncertainties. This discovery created a 
scientific and public sensation. Einstein was now a folk hero as well as a 
very great scientist. The bending of light by matter is equivalent to a 
bending of space itself, with light following the curve. This is another 
radical change in our concept of space and time. It is also another 
connection that any particle with mass or energy (massless photons) is 
affected by gravity. 


There are several current forefront efforts related to general relativity. One 
is the observation and analysis of gravitational lensing of light. Another is 
analysis of the definitive proof of the existence of black holes. Direct 
observation of gravitational waves or moving wrinkles in space is being 
searched for. Theoretical efforts are also being aimed at the possibility of 
time travel and wormholes into other parts of space due to black holes. 


Gravitational lensing 


As you can see in [link], light is bent toward a mass, producing an effect 
much like a converging lens (large masses are needed to produce observable 
effects). On a galactic scale, the light from a distant galaxy could be 
“lensed” into several images when passing close by another galaxy on its 
way to Earth. Einstein predicted this effect, but he considered it unlikely 
that we would ever observe it. A number of cases of this effect have now 
been observed; one is shown in [link]. This effect is a much larger scale 
verification of general relativity. But such gravitational lensing is also 
useful in verifying that the red shift is proportional to distance. The red shift 
of the intervening galaxy is always less than that of the one being lensed, 
and each image of the lensed galaxy has the same red shift. This 
verification supplies more evidence that red shift is proportional to distance. 
Confidence that the multiple images are not different objects is bolstered by 
the observations that if one image varies in brightness over time, the others 
also vary in the same manner. 


Apparent 
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Earth-bound 
observer 


This schematic shows how light passing 
near a massive body like the Sun is curved 
toward it. The light that reaches the Earth 

then seems to be coming from different 
locations than the known positions of the 
originating stars. Not only was this effect 

observed, the amount of bending was 
precisely what Einstein predicted in his 
general theory of relativity. 


Distant galaxy oe a Din 
and two images ~*~ 
( a) Earth-bound 


observer 


(b) 


(a) Light from a distant galaxy can travel different paths 
to the Earth because it is bent around an intermediary 
galaxy by gravity. This produces several images of the 
more distant galaxy. (b) The images around the central 

galaxy are produced by gravitational lensing. Each image 
has the same spectrum and a larger red shift than the 
intermediary. (credit: NASA, ESA, and STScI) 


Black holes 

Black holes are objects having such large gravitational fields that things 
can fall in, but nothing, not even light, can escape. Bodies, like the Earth or 
the Sun, have what is called an escape velocity. If an object moves straight 
up from the body, starting at the escape velocity, it will just be able to 
escape the gravity of the body. The greater the acceleration of gravity on the 
body, the greater is the escape velocity. As long ago as the late 1700s, it was 
proposed that if the escape velocity is greater than the speed of light, then 


light cannot escape. Simon Laplace (1749-1827), the French astronomer 
and mathematician, even incorporated this idea of a dark star into his 
writings. But the idea was dropped after Young’s double slit experiment 
showed light to be a wave. For some time, light was thought not to have 
particle characteristics and, thus, could not be acted upon by gravity. The 
idea of a black hole was very quickly reincarnated in 1916 after Einstein’s 
theory of general relativity was published. It is now thought that black holes 
can form in the supernova collapse of a massive star, forming an object 
perhaps 10 km across and having a mass greater than that of our Sun. It is 
interesting that several prominent physicists who worked on the concept, 
including Einstein, firmly believed that nature would find a way to prohibit 
such objects. 


Black holes are difficult to observe directly, because they are small and no 
light comes directly from them. In fact, no light comes from inside the 
event horizon, which is defined to be at a distance from the object at which 
the escape velocity is exactly the speed of light. The radius of the event 
horizon is known as the Schwarzschild radius Fg and is given by 
Equation: 


2GM 
Bese, 


where G is the universal gravitational constant, M is the mass of the body, 
and c is the speed of light. The event horizon is the edge of the black hole 
and Rg is its radius (that is, the size of a black hole is twice Rg). Since G is 
small and c? is large, you can see that black holes are extremely small, only 
a few kilometers for masses a little greater than the Sun’s. The object itself 
is inside the event horizon. 


Physics near a black hole is fascinating. Gravity increases so rapidly that, as 
you approach a black hole, the tidal effects tear matter apart, with matter 
closer to the hole being pulled in with much more force than that only 
slightly farther away. This can pull a companion star apart and heat 
inflowing gases to the point of producing X rays. (See [link].) We have 
observed X rays from certain binary star systems that are consistent with 
such a picture. This is not quite proof of black holes, because the X rays 


could also be caused by matter falling onto a neutron star. These objects 
were first discovered in 1967 by the British astrophysicists, Jocelyn Bell 
and Anthony Hewish. Neutron stars are literally a star composed of 
neutrons. They are formed by the collapse of a star’s core in a supernova, 
during which electrons and protons are forced together to form neutrons 
(the reverse of neutron @ decay). Neutron stars are slightly larger than a 
black hole of the same mass and will not collapse further because of 
resistance by the strong force. However, neutron stars cannot have a mass 
greater than about eight solar masses or they must collapse to a black hole. 
With recent improvements in our ability to resolve small details, such as 
with the orbiting Chandra X-ray Observatory, it has become possible to 
measure the masses of X-ray-emitting objects by observing the motion of 
companion stars and other matter in their vicinity. What has emerged is a 
plethora of X-ray-emitting objects too massive to be neutron stars. This 
evidence is considered conclusive and the existence of black holes is widely 
accepted. These black holes are concentrated near galactic centers. 


We also have evidence that supermassive black holes may exist at the cores 
of many galaxies, including the Milky Way. Such a black hole might have a 
mass millions or even billions of times that of the Sun, and it would 
probably have formed when matter first coalesced into a galaxy billions of 
years ago. Supporting this is the fact that very distant galaxies are more 
likely to have abnormally energetic cores. Some of the moderately distant 
galaxies, and hence among the younger, are known as quasars and emit as 
much or more energy than a normal galaxy but from a region less than a 
light year across. Quasar energy outputs may vary in times less than a year, 
so that the energy-emitting region must be less than a light year across. The 
best explanation of quasars is that they are young galaxies with a 
supermassive black hole forming at their core, and that they become less 
energetic over billions of years. In closer superactive galaxies, we observe 
tremendous amounts of energy being emitted from very small regions of 
space, consistent with stars falling into a black hole at the rate of one or 
more a month. The Hubble Space Telescope (1994) observed an accretion 
disk in the galaxy M87 rotating rapidly around a region of extreme energy 
emission. (See [link].) A jet of material being ejected perpendicular to the 
plane of rotation gives further evidence of a supermassive black hole as the 
engine. 


A black hole is shown 
pulling matter away from 
a companion star, 
forming a superheated 
accretion disk where X 
rays are emitted before 
the matter disappears 
forever into the hole. The 
in-fall energy also ejects 
some material, forming 
the two vertical spikes. 
(See also the photograph 
in Introduction to 
Frontiers of Physics.) 
There are several X-ray- 
emitting objects in space 
that are consistent with 
this picture and are likely 
to be black holes. 


Gravitational waves 

If a massive object distorts the space around it, like the foot of a water bug 
on the surface of a pond, then movement of the massive object should 
create waves in space like those on a pond. Gravitational waves are mass- 
created distortions in space that propagate at the speed of light and are 
predicted by general relativity. Since gravity is by far the weakest force, 
extreme conditions are needed to generate significant gravitational waves. 
Gravity near binary neutron star systems is so great that significant 
gravitational wave energy is radiated as the two neutron stars orbit one 


another. American astronomers, Joseph Taylor and Russell Hulse, measured 
changes in the orbit of such a binary neutron star system. They found its 
orbit to change precisely as predicted by general relativity, a strong 
indication of gravitational waves, and were awarded the 1993 Nobel Prize. 
But direct detection of gravitational waves on Earth would be conclusive. 
For many years, various attempts have been made to detect gravitational 
waves by observing vibrations induced in matter distorted by these waves. 
American physicist Joseph Weber pioneered this field in the 1960s, but no 
conclusive events have been observed. (No gravity wave detectors were in 
operation at the time of the 1987A supernova, unfortunately.) There are 
now several ambitious systems of gravitational wave detectors in use 
around the world. These include the LIGO (Laser Interferometer 
Gravitational Wave Observatory) system with two laser interferometer 
detectors, one in the state of Washington and another in Louisiana (See 
[link]) and the VIRGO (Variability of Irradiance and Gravitational 
Oscillations) facility in Italy with a single detector. 


Quantum Gravity 


Black holes radiate 

Quantum gravity is important in those situations where gravity is so 
extremely strong that it has effects on the quantum scale, where the other 
forces are ordinarily much stronger. The early universe was such a place, 
but black holes are another. The first significant connection between gravity 
and quantum effects was made by the Russian physicist Yakov Zel’dovich 
in 1971, and other significant advances followed from the British physicist 
Stephen Hawking. (See [link].) These two showed that black holes could 
radiate away energy by quantum effects just outside the event horizon 
(nothing can escape from inside the event horizon). Black holes are, thus, 
expected to radiate energy and shrink to nothing, although extremely slowly 
for most black holes. The mechanism is the creation of a particle- 
antiparticle pair from energy in the extremely strong gravitational field near 
the event horizon. One member of the pair falls into the hole and the other 
escapes, conserving momentum. (See [link].) When a black hole loses 
energy and, hence, rest mass, its event horizon shrinks, creating an even 
greater gravitational field. This increases the rate of pair production so that 
the process grows exponentially until the black hole is nuclear in size. A 


final burst of particles and 7 rays ensues. This is an extremely slow process 
for black holes about the mass of the Sun (produced by supernovas) or 
larger ones (like those thought to be at galactic centers), taking on the order 
of 10°” years or longer! Smaller black holes would evaporate faster, but 
they are only speculated to exist as remnants of the Big Bang. Searches for 
characteristic ‘y-ray bursts have produced events attributable to more 
mundane objects like neutron stars accreting matter. 


Core of Galaxy NGC 426l 
Hubble Space Telescope 


Wide Field / Planetary Camera 


Ground-Based Optical/Radio Image HST Image of a Gas and Dust Disk 


———— 
380 Arc Seconds 17 Arc Seconds 
88,000 LIGHT-YEARS 400 LIGHT-YEARS 


This Hubble Space Telescope 
photograph shows the extremely 
energetic core of the NGC 4261 

galaxy. With the superior resolution of 
the orbiting telescope, it has been 
possible to observe the rotation of an 
accretion disk around the energy- 
producing object as well as to map 
jets of material being ejected from the 
object. A supermassive black hole is 
consistent with these observations, but 
other possibilities are not quite 
eliminated. (credit: NASA and ESA) 


The control room of the LIGO 
gravitational wave detector. 
Gravitational waves will cause 
extremely small vibrations in a 
mass in this detector, which will 
be detected by laser 
interferometer techniques. Such 
detection in coincidence with 
other detectors and with 
astronomical events, such as 
supernovas, would provide 
direct evidence of gravitational 
waves. (credit: Tobin Fricke) 


Stephen Hawking (b. 1942) has 
made many contributions to the 
theory of quantum gravity. 
Hawking is a long-time survivor 
of ALS and has produced 
popular books on general 
relativity, cosmology, and 
quantum gravity. (credit: Lwp 
Kommunikacio) 
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Gravity and quantum mechanics 


come into play when a black 
hole creates a particle- 
antiparticle pair from the energy 
in its gravitational field. One 
member of the pair falls into the 
hole while the other escapes, 
removing energy and shrinking 
the black hole. The search is on 
for the characteristic energy. 


Wormholes and time travel 

The subject of time travel captures the imagination. Theoretical physicists, 
such as the American Kip Thorne, have treated the subject seriously, 
looking into the possibility that falling into a black hole could result in 
popping up in another time and place—a trip through a so-called wormhole. 
Time travel and wormholes appear in innumerable science fiction 
dramatizations, but the consensus is that time travel is not possible in 
theory. While still debated, it appears that quantum gravity effects inside a 
black hole prevent time travel due to the creation of particle pairs. Direct 
evidence is elusive. 


The shortest time 

Theoretical studies indicate that, at extremely high energies and 
correspondingly early in the universe, quantum fluctuations may make time 
intervals meaningful only down to some finite time limit. Early work 
indicated that this might be the case for times as long as 10 *° s, the time at 
which all forces were unified. If so, then it would be meaningless to 
consider the universe at times earlier than this. Subsequent studies indicate 
that the crucial time may be as short as 10-°° s. But the point remains— 
quantum gravity seems to imply that there is no such thing as a vanishingly 
short time. Time may, in fact, be grainy with no meaning to time intervals 
shorter than some tiny but finite size. 


The future of quantum gravity 


Not only is quantum gravity in its infancy, no one knows how to get started 
on a theory of gravitons and unification of forces. The energies at which 
TOE should be valid may be so high (at least 1019 GeV) and the necessary 
particle separation so small (less than 10 °° m) that only indirect evidence 
can provide clues. For some time, the common lament of theoretical 
physicists was one so familiar to struggling students—how do you even get 
started? But Hawking and others have made a start, and the approach many 
theorists have taken is called Superstring theory, the topic of the 
Superstrings. 


Section Summary 


Einstein’s theory of general relativity includes accelerated frames and, 
thus, encompasses special relativity and gravity. Created by use of 
careful thought experiments, it has been repeatedly verified by real 
experiments. 

One direct result of this behavior of nature is the gravitational lensing 
of light by massive objects, such as galaxies, also seen in the 
microlensing of light by smaller bodies in our galaxy. 

Another prediction is the existence of black holes, objects for which 
the escape velocity is greater than the speed of light and from which 
nothing can escape. 

The event horizon is the distance from the object at which the escape 
velocity equals the speed of light c. It is called the Schwarzschild 
radius Rg and is given by 

Equation: 


where G is the universal gravitational constant, and M is the mass of 
the body. 

Physics is unknown inside the event horizon, and the possibility of 
wormholes and time travel are being studied. 

Candidates for black holes may power the extremely energetic 
emissions of quasars, distant objects that seem to be early stages of 


galactic evolution. 

e Neutron stars are stellar remnants, having the density of a nucleus, that 
hint that black holes could form from supernovas, too. 

e Gravitational waves are wrinkles in space, predicted by general 
relativity but not yet observed, caused by changes in very massive 
objects. 

¢ Quantum gravity is an incompletely developed theory that strives to 
include general relativity, quantum mechanics, and unification of 
forces (thus, a TOE). 

e One unconfirmed connection between general relativity and quantum 
mechanics is the prediction of characteristic radiation from just outside 
black holes. 


Conceptual Questions 


Exercise: 
Problem: 
Quantum gravity, if developed, would be an improvement on both 
general relativity and quantum mechanics, but more mathematically 
difficult. Under what circumstances would it be necessary to use 
quantum gravity? Similarly, under what circumstances could general 


relativity be used? When could special relativity, quantum mechanics, 
or classical physics be used? 


Exercise: 
Problem: 
Does observed gravitational lensing correspond to a converging or 
diverging lens? Explain briefly. 


Exercise: 


Problem: 


Suppose you measure the red shifts of all the images produced by 
gravitational lensing, such as in [link]. You find that the central image 
has a red shift less than the outer images, and those all have the same 
red shift. Discuss how this not only shows that the images are of the 
same object, but also implies that the red shift is not affected by taking 
different paths through space. Does it imply that cosmological red 
shifts are not caused by traveling through space (light getting tired, 
perhaps)? 


Exercise: 
Problem: 
What are gravitational waves, and have they yet been observed either 
directly or indirectly? 

Exercise: 
Problem: 
Is the event horizon of a black hole the actual physical surface of the 
object? 

Exercise: 
Problem: 
Suppose black holes radiate their mass away and the lifetime of a 
black hole created by a supernova is about 10°’ years. How does this 


lifetime compare with the accepted age of the universe? Is it surprising 
that we do not observe the predicted characteristic radiation? 


Problems & Exercises 


Exercise: 


Problem: 


What is the Schwarzschild radius of a black hole that has a mass eight 
times that of our Sun? Note that stars must be more massive than the 
Sun to form black holes as a result of a supernova. 


Solution: 


23.6 km 
Exercise: 


Problem: 


Black holes with masses smaller than those formed in supernovas may 
have been created in the Big Bang. Calculate the radius of one that has 
a mass equal to the Earth’s. 


Exercise: 


Problem: 


Supermassive black holes are thought to exist at the center of many 
galaxies. 


(a) What is the radius of such an object if it has a mass of 10° Suns? 
(b) What is this radius in light years? 

Solution: 

(a) 2.95 x 102 m 

(b) 3.12 x 10-4 ly 


Exercise: 


Problem: Construct Your Own Problem 


Consider a supermassive black hole near the center of a galaxy. 
Calculate the radius of such an object based on its mass. You must 
consider how much mass is reasonable for these large objects, and 
which is now nearly directly observed. (Information on black holes 
posted on the Web by NASA and other agencies is reliable, for 
example.) 


Glossary 


black holes 
objects having such large gravitational fields that things can fall in, but 
nothing, not even light, can escape 


general relativity 
Einstein’s theory thatdescribes all types of relative motion including 
accelerated motion and the effects of gravity 


gravitational waves 
mass-created distortions in space that propagate at the speed of light 
and that are predicted by general relativity 


escape velocity 
takeoff velocity when kinetic energy just cancels gravitational 
potential energy 


event horizon 
the distance from the object at which the escape velocity is exactly the 
speed of light 


neutron stars 
literally a star composed of neutrons 


Schwarzschild radius 
the radius of the event horizon 


thought experiment 


mental analysis of certain carefully and clearly defined situations to 
develop an idea 


quasars 
the moderately distant galaxies that emit as much or more energy than 
a normal galaxy 


Quantum gravity 
the theory that deals with particle exchange of gravitons as the 
mechanism for the force 


Superstrings 


e Define Superstring theory. 
e Explain the relationship between Superstring theory and the Big Bang. 


Introduced earlier in GUTS: The Unification of Forces Superstring theory 
is an attempt to unify gravity with the other three forces and, thus, must 
contain quantum gravity. The main tenet of Superstring theory is that 
fundamental particles, including the graviton that carries the gravitational 
force, act like one-dimensional vibrating strings. Since gravity affects the 
time and space in which all else exists, Superstring theory is an attempt at a 
Theory of Everything (TOE). Each independent quantum number is thought 
of as a separate dimension in some super space (analogous to the fact that 
the familiar dimensions of space are independent of one another) and is 
represented by a different type of Superstring. As the universe evolved after 
the Big Bang and forces became distinct (spontaneous symmetry breaking), 
some of the dimensions of superspace are imagined to have curled up and 
become unnoticed. 


Forces are expected to be unified only at extremely high energies and at 
particle separations on the order of 10-*° m. This could mean that 
Superstrings must have dimensions or wavelengths of this size or smaller. 
Just as quantum gravity may imply that there are no time intervals shorter 
than some finite value, it also implies that there may be no sizes smaller 
than some tiny but finite value. That may be about 10-*° m. If so, and if 
Superstring theory can explain all it strives to, then the structures of 
Superstrings are at the lower limit of the smallest possible size and can have 
no further substructure. This would be the ultimate answer to the question 
the ancient Greeks considered. There is a finite lower limit to space. 


Not only is Superstring theory in its infancy, it deals with dimensions about 
17 orders of magnitude smaller than the 10~ 1% m details that we have been 
able to observe directly. It is thus relatively unconstrained by experiment, 
and there are a host of theoretical possibilities to choose from. This has led 
theorists to make choices subjectively (as always) on what is the most 
elegant theory, with less hope than usual that experiment will guide them. It 
has also led to speculation of alternate universes, with their Big Bangs 


creating each new universe with a random set of rules. These speculations 
may not be tested even in principle, since an alternate universe is by 
definition unattainable. It is something like exploring a self-consistent field 
of mathematics, with its axioms and rules of logic that are not consistent 
with nature. Such endeavors have often given insight to mathematicians and 
scientists alike and occasionally have been directly related to the 
description of new discoveries. 


Section Summary 
e Superstring theory holds that fundamental particles are one- 


dimensional vibrations analogous to those on strings and is an attempt 
at a theory of quantum gravity. 


Problems & Exercises 


Exercise: 


Problem: 


The characteristic length of entities in Superstring theory is 
approximately 10° °° m. 


(a) Find the energy in GeV of a photon of this wavelength. 


(b) Compare this with the average particle energy of 10'° GeV needed 
for unification of forces. 


Solution: 
(a) 1 x 107° 


(b) 10 times greater 


Glossary 


Superstring theory 


a theory to unify gravity with the other three forces in which the 
fundamental particles are considered to act like one-dimensional 
vibrating strings 


Dark Matter and Closure 


e Discuss the existence of dark matter. 
e Explain neutrino oscillations and their consequences. 


One of the most exciting problems in physics today is the fact that there is 
far more matter in the universe than we can see. The motion of stars in 
galaxies and the motion of galaxies in clusters imply that there is about 10 
times as much mass as in the luminous objects we can see. The indirectly 
observed non-luminous matter is called dark matter. Why is dark matter a 
problem? For one thing, we do not know what it is. It may well be 90% of 
all matter in the universe, yet there is a possibility that it is of a completely 
unknown form—a stunning discovery if verified. Dark matter has 
implications for particle physics. It may be possible that neutrinos actually 
have small masses or that there are completely unknown types of particles. 
Dark matter also has implications for cosmology, since there may be 
enough dark matter to stop the expansion of the universe. That is another 
problem related to dark matter—we do not know how much there is. We 
keep finding evidence for more matter in the universe, and we have an idea 
of how much it would take to eventually stop the expansion of the universe, 
but whether there is enough is still unknown. 


Evidence 


The first clues that there is more matter than meets the eye came from the 
Swiss-born American astronomer Fritz Zwicky in the 1930s; some initial 
work was also done by the American astronomer Vera Rubin. Zwicky 
measured the velocities of stars orbiting the galaxy, using the relativistic 
Doppler shift of their spectra (see [link](a)). He found that velocity varied 
with distance from the center of the galaxy, as graphed in [link](b). If the 
mass of the galaxy was concentrated in its center, as are its luminous stars, 
the velocities should decrease as the square root of the distance from the 
center. Instead, the velocity curve is almost flat, implying that there is a 
tremendous amount of matter in the galactic halo. Although not 
immediately recognized for its significance, such measurements have now 
been made for many galaxies, with similar results. Further, studies of 
galactic clusters have also indicated that galaxies have a mass distribution 


greater than that obtained from their brightness (proportional to the number 
of stars), which also extends into large halos surrounding the luminous parts 
of galaxies. Observations of other EM wavelengths, such as radio waves 
and X rays, have similarly confirmed the existence of dark matter. Take, for 
example, X rays in the relatively dark space between galaxies, which 
indicates the presence of previously unobserved hot, ionized gas (see [link] 


(c)). 


Theoretical Yearnings for Closure 


Is the universe open or closed? That is, will the universe expand forever or 
will it stop, perhaps to contract? This, until recently, was a question of 
whether there is enough gravitation to stop the expansion of the universe. In 
the past few years, it has become a question of the combination of 
gravitation and what is called the cosmological constant. The cosmological 
constant was invented by Einstein to prohibit the expansion or contraction 
of the universe. At the time he developed general relativity, Einstein 
considered that an illogical possibility. The cosmological constant was 
discarded after Hubble discovered the expansion, but has been re-invoked 
in recent years. 


Gravitational attraction between galaxies is slowing the expansion of the 
universe, but the amount of slowing down is not known directly. In fact, the 
cosmological constant can counteract gravity’s effect. As recent 
measurements indicate, the universe is expanding faster now than in the 
past—perhaps a “modern inflationary era” in which the dark energy is 
thought to be causing the expansion of the present-day universe to 
accelerate. If the expansion rate were affected by gravity alone, we should 
be able to see that the expansion rate between distant galaxies was once 
greater than it is now. However, measurements show it was less than now. 
We can, however, calculate the amount of slowing based on the average 
density of matter we observe directly. Here we have a definite answer— 
there is far less visible matter than needed to stop expansion. The critical 
density p, is defined to be the density needed to just halt universal 
expansion in a universe with no cosmological constant. It is estimated to be 
about 

Equation: 


pe © 10-8 kg/m. 


However, this estimate of p, is only good to about a factor of two, due to 
uncertainties in the expansion rate of the universe. The critical density is 
equivalent to an average of only a few nucleons per cubic meter, 
remarkably small and indicative of how truly empty intergalactic space is. 
Luminous matter seems to account for roughly 0.5% to 2% of the critical 
density, far less than that needed for closure. Taking into account the 
amount of dark matter we detect indirectly and all other types of indirectly 
observed normal matter, there is only 10% to 40% of what is needed for 
closure. If we are able to refine the measurements of expansion rates now 
and in the past, we will have our answer regarding the curvature of space 
and we will determine a value for the cosmological constant to justify this 
observation. Finally, the most recent measurements of the CMBR have 
implications for the cosmological constant, so it is not simply a device 
concocted for a single purpose. 


After the recent experimental discovery of the cosmological constant, most 
researchers feel that the universe should be just barely open. Since matter 
can be thought to curve the space around it, we call an open universe 
negatively curved. This means that you can in principle travel an unlimited 
distance in any direction. A universe that is closed is called positively 
curved. This means that if you travel far enough in any direction, you will 
return to your starting point, analogous to circumnavigating the Earth. In 
between these two is a flat (zero curvature) universe. The recent 
discovery of the cosmological constant has shown the universe is very close 
to flat, and will expand forever. Why do theorists feel the universe is flat? 
Flatness is a part of the inflationary scenario that helps explain the flatness 
of the microwave background. In fact, since general relativity implies that 
matter creates the space in which it exists, there is a special symmetry to a 
flat universe. 
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Evidence for dark matter: (a) 
We can measure the 
velocities of stars relative to 
their galaxies by observing 
the Doppler shift in emitted 
light, usually using the 
hydrogen spectrum. These 
measurements indicate the 


rotation of a spiral galaxy. 
(b) A graph of velocity 
versus distance from the 
galactic center shows that 
the velocity does not 
decrease as it would if the 
matter were concentrated in 
luminous stars. The flatness 
of the curve implies a 
massive galactic halo of dark 
matter extending beyond the 
visible stars. (c) This is a 
computer-generated image 
of X rays from a galactic 
cluster. The X rays indicate 
the presence of otherwise 
unseen hot clouds of ionized 
gas in the regions of space 
previously considered more 
empty. (credit: NASA, ESA, 
CXC, M. Bradac (University 
of California, Santa 
Barbara), and S. Allen 
(Stanford University)) 


What Is the Dark Matter We See Indirectly? 


There is no doubt that dark matter exists, but its form and the amount in 
existence are two facts that are still being studied vigorously. As always, we 
seek to explain new observations in terms of known principles. However, as 
more discoveries are made, it is becoming more and more difficult to 
explain dark matter as a known type of matter. 


One of the possibilities for normal matter is being explored using the 
Hubble Space Telescope and employing the lensing effect of gravity on 


light (see [link]). Stars glow because of nuclear fusion in them, but planets 
are visible primarily by reflected light. Jupiter, for example, is too small to 
ignite fusion in its core and become a star, but we can see sunlight reflected 
from it, since we are relatively close. If Jupiter orbited another star, we 
would not be able to see it directly. The question is open as to how many 
planets or other bodies smaller than about 1/1000 the mass of the Sun are 
there. If such bodies pass between us and a star, they will not block the 
star’s light, being too small, but they will form a gravitational lens, as 
discussed in General Relativity and Quantum Gravity. 


In a process called microlensing, light from the star is focused and the star 
appears to brighten in a characteristic manner. Searches for dark matter in 
this form are particularly interested in galactic halos because of the huge 
amount of mass that seems to be there. Such microlensing objects are thus 
called massive compact halo objects, or MACHOs. To date, a few 
MACHOs have been observed, but not predominantly in galactic halos, nor 
in the numbers needed to explain dark matter. 


MACHOs are among the most conventional of unseen objects proposed to 
explain dark matter. Others being actively pursued are red dwarfs, which 
are small dim stars, but too few have been seen so far, even with the Hubble 
Telescope, to be of significance. Old remnants of stars called white dwarfs 
are also under consideration, since they contain about a solar mass, but are 
small as the Earth and may dim to the point that we ordinarily do not 
observe them. While white dwarfs are known, old dim ones are not. Yet 
another possibility is the existence of large numbers of smaller than stellar 
mass black holes left from the Big Bang—here evidence is entirely absent. 


There is a very real possibility that dark matter is composed of the known 
neutrinos, which may have small, but finite, masses. As discussed earlier, 
neutrinos are thought to be massless, but we only have upper limits on their 
masses, rather than knowing they are exactly zero. So far, these upper limits 
come from difficult measurements of total energy emitted in the decays and 
reactions in which neutrinos are involved. There is an amusing possibility 
of proving that neutrinos have mass in a completely different way. 


We have noted in Particles, Patterns, and Conservation Laws that there are 
three flavors of neutrinos (v, v,, and v,) and that the weak interaction 
could change quark flavor. It should also change neutrino flavor—that is, 
any type of neutrino could change spontaneously into any other, a process 
called neutrino oscillations. However, this can occur only if neutrinos have 
a mass. Why? Crudely, because if neutrinos are massless, they must travel 
at the speed of light and time will not pass for them, so that they cannot 
change without an interaction. In 1999, results began to be published 
containing convincing evidence that neutrino oscillations do occur. Using 
the Super-Kamiokande detector in Japan, the oscillations have been 
observed and are being verified and further explored at present at the same 
facility and others. 


Neutrino oscillations may also explain the low number of observed solar 
neutrinos. Detectors for observing solar neutrinos are specifically designed 
to detect electron neutrinos v, produced in huge numbers by fusion in the 
Sun. A large fraction of electron neutrinos 1, may be changing flavor to 
muon neutrinos v,, on their way out of the Sun, possibly enhanced by 
specific interactions, reducing the flux of electron neutrinos to observed 
levels. There is also a discrepancy in observations of neutrinos produced in 
cosmic ray showers. While these showers of radiation produced by 
extremely energetic cosmic rays should contain twice as many v,, § aS l% S, 
their numbers are nearly equal. This may be explained by neutrino 
oscillations from muon flavor to electron flavor. Massive neutrinos are a 
particularly appealing possibility for explaining dark matter, since their 
existence is consistent with a large body of known information and explains 
more than dark matter. The question is not settled at this writing. 


The most radical proposal to explain dark matter is that it consists of 
previously unknown leptons (sometimes obtusely referred to as non- 
baryonic matter). These are called weakly interacting massive particles, 
or WIMPs, and would also be chargeless, thus interacting negligibly with 
normal matter, except through gravitation. One proposed group of WIMPs 
would have masses several orders of magnitude greater than nucleons and 
are sometimes called neutralinos. Others are called axions and would have 
masses about 107° that of an electron mass. Both neutralinos and axions 
would be gravitationally attached to galaxies, but because they are 


chargeless and only feel the weak force, they would be in a halo rather than 
interact and coalesce into spirals, and so on, like normal matter (see [link]). 


The Hubble Space Telescope 
is producing exciting data 
with its corrected optics and 
with the absence of 
atmospheric distortion. It has 
observed some MACHOs, 
disks of material around 
stars thought to precede 
planet formation, black hole 
candidates, and collisions of 
comets with Jupiter. (credit: 
NASA (crew of STS-125)) 


Dark matter may shepherd 
normal matter gravitationally 
in space, as this stream 
moves the leaves. Dark 
matter may be invisible and 
even move through the 
normal matter, as neutrinos 
penetrate us without small- 
scale effect. (credit: Shinichi 
Sugiyama) 


Some particle theorists have built WIMPs into their unified force theories 
and into the inflationary scenario of the evolution of the universe so popular 
today. These particles would have been produced in just the correct 
numbers to make the universe flat, shortly after the Big Bang. The proposal 
is radical in the sense that it invokes entirely new forms of matter, in fact 
two entirely new forms, in order to explain dark matter and other 
phenomena. WIMPs have the extra burden of automatically being very 
difficult to observe directly. This is somewhat analogous to quark 
confinement, which guarantees that quarks are there, but they can never be 
seen directly. One of the primary goals of the LHC at CERN, however, is to 
produce and detect WIMPs. At any rate, before WIMPs are accepted as the 
best explanation, all other possibilities utilizing known phenomena will 
have to be shown inferior. Should that occur, we will be in the unanticipated 
position of admitting that, to date, all we know is only 10% of what exists. 


A far cry from the days when people firmly believed themselves to be not 
only the center of the universe, but also the reason for its existence. 


Section Summary 


e Dark matter is non-luminous matter detected in and around galaxies 
and galactic clusters. 

e It may be 10 times the mass of the luminous matter in the universe, 
and its amount may determine whether the universe is open or closed 
(expands forever or eventually stops). 

e The determining factor is the critical density of the universe and the 
cosmological constant, a theoretical construct intimately related to the 
expansion and closure of the universe. 

e The critical density p, is the density needed to just halt universal 
expansion. It is estimated to be approximately 10-76 kg/m?. 

e An open universe is negatively curved, a closed universe is positively 
curved, whereas a universe with exactly the critical density is flat. 

e Dark matter’s composition is a major mystery, but it may be due to the 
suspected mass of neutrinos or a completely unknown type of leptonic 
matter. 

e If neutrinos have mass, they will change families, a process known as 
neutrino oscillations, for which there is growing evidence. 


Conceptual Questions 


Exercise: 


Problem: 


Discuss the possibility that star velocities at the edges of galaxies 
being greater than expected is due to unknown properties of gravity 
rather than to the existence of dark matter. Would this mean, for 
example, that gravity is greater or smaller than expected at large 
distances? Are there other tests that could be made of gravity at large 
distances, such as observing the motions of neighboring galaxies? 


Exercise: 


Problem: 


How does relativistic time dilation prohibit neutrino oscillations if they 
are massless? 


Exercise: 


Problem: 


If neutrino oscillations do occur, will they violate conservation of the 
various lepton family numbers (Le, L,,, and L-)? Will neutrino 
oscillations violate conservation of the total number of leptons? 


Exercise: 


Problem: 


Lacking direct evidence of WIMPs as dark matter, why must we 
eliminate all other possible explanations based on the known forms of 
matter before we invoke their existence? 


Problems Exercises 


Exercise: 


Problem: 


If the dark matter in the Milky Way were composed entirely of 
MACHOs (evidence shows it is not), approximately how many would 
there have to be? Assume the average mass of a MACHO is 1/1000 
that of the Sun, and that dark matter has a mass 10 times that of the 
luminous Milky Way galaxy with its 10" stars of average mass 1.5 
times the Sun’s mass. 


Solution: 
Equation: 


1.5 x 10% 


Exercise: 
Problem: 


The critical mass density needed to just halt the expansion of the 
universe is approximately ii;= kg / m?, 


(a) Convert this to eV/c? - m°. 


(b) Find the number of neutrinos per cubic meter needed to close the 
universe if their average mass is 7 eV /c” and they have negligible 
kinetic energies. 


Exercise: 
Problem: 
Assume the average density of the universe is 0.1 of the critical density 


needed for closure. What is the average number of protons per cubic 
meter, assuming the universe is composed mostly of hydrogen? 


Solution: 
Equation: 


0.6m? 


Exercise: 


Problem: 


To get an idea of how empty deep space is on the average, perform the 
following calculations: 


(a) Find the volume our Sun would occupy if it had an average density 
equal to the critical density of 10°7° kg /m? thought necessary to halt 
the expansion of the universe. 


(b) Find the radius of a sphere of this volume in light years. 


(c) What would this radius be if the density were that of luminous 
matter, which is approximately 5% that of the critical density? 


(d) Compare the radius found in part (c) with the 4-ly average 
separation of stars in the arms of the Milky Way. 


Glossary 


axions 
a type of WIMPs having masses about 10°! of an electron mass 


cosmological constant 
a theoretical construct intimately related to the expansion and closure 
of the universe 


critical density 
the density of matter needed to just halt universal expansion 


dark matter 
indirectly observed non-luminous matter 


flat (zero curvature) universe 
a universe that is infinite but not curved 


microlensing 
a process in which light from a distant star is focused and the star 
appears to brighten in a characteristic manner, when a small body 
(smaller than about 1/1000 the mass of the Sun) passes between us and 
the star 


MACHOs 
massive compact halo objects; microlensing objects of huge mass 


neutrino oscillations 
a process in which any type of neutrino could change spontaneously 
into any other 


neutralinos 
a type of WIMPs having masses several orders of magnitude greater 
than nucleon masses 


negatively curved 
an open universe that expands forever 


positively curved 
a universe that is closed and eventually contracts 


WIMPs 
weakly interacting massive particles; chargeless leptons (non-baryonic 
matter) interacting negligibly with normal matter 


Complexity and Chaos 


e Explain complex systems. 
e Discuss chaotic behavior of different systems. 


Much of what impresses us about physics is related to the underlying 
connections and basic simplicity of the laws we have discovered. The 
language of physics is precise and well defined because many basic systems 
we study are simple enough that we can perform controlled experiments 
and discover unambiguous relationships. Our most spectacular successes, 
such as the prediction of previously unobserved particles, come from the 
simple underlying patterns we have been able to recognize. But there are 
systems of interest to physicists that are inherently complex. The simple 
laws of physics apply, of course, but complex systems may reveal patterns 
that simple systems do not. The emerging field of complexity is devoted to 
the study of complex systems, including those outside the traditional 
bounds of physics. Of particular interest is the ability of complex systems to 
adapt and evolve. 


What are some examples of complex adaptive systems? One is the 
primordial ocean. When the oceans first formed, they were a random mix of 
elements and compounds that obeyed the laws of physics and chemistry. In 
a relatively short geological time (about 500 million years), life had 
emerged. Laboratory simulations indicate that the emergence of life was far 
too fast to have come from random combinations of compounds, even if 
driven by lightning and heat. There must be an underlying ability of the 
complex system to organize itself, resulting in the self-replication we 
recognize as life. Living entities, even at the unicellular level, are highly 
organized and systematic. Systems of living organisms are themselves 
complex adaptive systems. The grandest of these evolved into the biological 
system we have today, leaving traces in the geological record of steps taken 
along the way. 


Complexity as a discipline examines complex systems, how they adapt and 
evolve, looking for similarities with other complex adaptive systems. Can, 
for example, parallels be drawn between biological evolution and the 
evolution of economic systems? Economic systems do emerge quickly, they 
show tendencies for self-organization, they are complex (in the number and 


types of transactions), and they adapt and evolve. Biological systems do all 
the same types of things. There are other examples of complex adaptive 
systems being studied for fundamental similarities. Cultures show signs of 
adaptation and evolution. The comparison of different cultural evolutions 
may bear fruit as well as comparisons to biological evolution. Science also 
is acomplex system of human interactions, like culture and economics, that 
adapts to new information and political pressure, and evolves, usually 
becoming more organized rather than less. Those who study creative 
thinking also see parallels with complex systems. Humans sometimes 
organize almost random pieces of information, often subconsciously while 
doing other things, and come up with brilliant creative insights. The 
development of language is another complex adaptive system that may 
show similar tendencies. Artificial intelligence is an overt attempt to devise 
an adaptive system that will self-organize and evolve in the same manner as 
an intelligent living being learns. These are a few of the broad range of 
topics being studied by those who investigate complexity. There are now 
institutes, journals, and meetings, as well as popularizations of the emerging 
topic of complexity. 


In traditional physics, the discipline of complexity may yield insights in 
certain areas. Thermodynamics treats systems on the average, while 
statistical mechanics deals in some detail with complex systems of atoms 
and molecules in random thermal motion. Yet there is organization, 
adaptation, and evolution in those complex systems. Non-equilibrium 
phenomena, such as heat transfer and phase changes, are characteristically 
complex in detail, and new approaches to them may evolve from 
complexity as a discipline. Crystal growth is another example of self- 
organization spontaneously emerging in a complex system. Alloys are also 
inherently complex mixtures that show certain simple characteristics 
implying some self-organization. The organization of iron atoms into 
magnetic domains as they cool is another. Perhaps insights into these 
difficult areas will emerge from complexity. But at the minimum, the 
discipline of complexity is another example of human effort to understand 
and organize the universe around us, partly rooted in the discipline of 
physics. 


A predecessor to complexity is the topic of chaos, which has been widely 
publicized and has become a discipline of its own. It is also based partly in 
physics and treats broad classes of phenomena from many disciplines. 
Chaos is a word used to describe systems whose outcomes are extremely 
sensitive to initial conditions. The orbit of the planet Pluto, for example, 
may be chaotic in that it can change tremendously due to small interactions 
with other planets. This makes its long-term behavior impossible to predict 
with precision, just as we cannot tell precisely where a decaying Earth 
satellite will land or how many pieces it will break into. But the discipline 
of chaos has found ways to deal with such systems and has been applied to 
apparently unrelated systems. For example, the heartbeat of people with 
certain types of potentially lethal arrhythmias seems to be chaotic, and this 
knowledge may allow more sophisticated monitoring and recognition of the 
need for intervention. 


Chaos is related to complexity. Some chaotic systems are also inherently 
complex; for example, vortices in a fluid as opposed to a double pendulum. 
Both are chaotic and not predictable in the same sense as other systems. But 
there can be organization in chaos and it can also be quantified. Examples 
of chaotic systems are beautiful fractal patterns such as in [link]. Some 
chaotic systems exhibit self-organization, a type of stable chaos. The orbits 
of the planets in our solar system, for example, may be chaotic (we are not 
certain yet). But they are definitely organized and systematic, with a simple 
formula describing the orbital radii of the first eight planets and the asteroid 
belt. Large-scale vortices in Jupiter’s atmosphere are chaotic, but the Great 
Red Spot is a stable self-organization of rotational energy. (See [link].) The 
Great Red Spot has been in existence for at least 400 years and is a complex 
self-adaptive system. 


The emerging field of complexity, like the now almost traditional field of 
chaos, is partly rooted in physics. Both attempt to see similar systematics in 
a very broad range of phenomena and, hence, generate a better 
understanding of them. Time will tell what impact these fields have on 
more traditional areas of physics as well as on the other disciplines they 
relate to. 


This image is related to 
the Mandelbrot set, a 
complex mathematical 
form that is chaotic. The 
patterns are infinitely fine 
as you look closer and 
closer, and they indicate 
order in the presence of 
chaos. (credit: Gilberto 
Santa Rosa) 


The Great Red Spot on 
Jupiter is an example of 
self-organization in a 
complex and chaotic 
system. Smaller vortices 
in Jupiter’s atmosphere 


behave chaotically, but 
the triple-Earth-size spot 
is self-organized and 
stable for at least 
hundreds of years. (credit: 
NASA) 


Section Summary 


¢ Complexity is an emerging field, rooted primarily in physics, that 
considers complex adaptive systems and their evolution, including 
self-organization. 

e¢ Complexity has applications in physics and many other disciplines, 
such as biological evolution. 

e Chaos is a field that studies systems whose properties depend 
extremely sensitively on some variables and whose evolution is 
impossible to predict. 

e Chaotic systems may be simple or complex. 

e Studies of chaos have led to methods for understanding and predicting 
certain chaotic behaviors. 


Conceptual Questions 


Exercise: 


Problem: 


Must a complex system be adaptive to be of interest in the field of 
complexity? Give an example to support your answer. 


Exercise: 


Problem: State a necessary condition for a system to be chaotic. 


Glossary 


complexity 
an emerging field devoted to the study of complex systems 


chaos 
word used to describe systems the outcomes of which are extremely 
sensitive to initial conditions 


High-temperature Superconductors 


e Identify superconductors and their uses. 
e Discuss the need for a high-T, superconductor. 


Superconductors are materials with a resistivity of zero. They are familiar 
to the general public because of their practical applications and have been 
mentioned at a number of points in the text. Because the resistance of a 
piece of superconductor is zero, there are no heat losses for currents through 
them; they are used in magnets needing high currents, such as in MRI 
machines, and could cut energy losses in power transmission. But most 
superconductors must be cooled to temperatures only a few kelvin above 
absolute zero, a costly procedure limiting their practical applications. In the 
past decade, tremendous advances have been made in producing materials 
that become superconductors at relatively high temperatures. There is hope 
that room temperature superconductors may someday be manufactured. 


Superconductivity was discovered accidentally in 1911 by the Dutch 
physicist H. Kamerlingh Onnes (1853-1926) when he used liquid helium to 
cool mercury. Onnes had been the first person to liquefy helium a few years 
earlier and was surprised to observe the resistivity of a mediocre conductor 
like mercury drop to zero at a temperature of 4.2 K. We define the 
temperature at which and below which a material becomes a 
superconductor to be its critical temperature, denoted by 7T¢. (See [link].) 
Progress in understanding how and why a material became a 
superconductor was relatively slow, with the first workable theory coming 
in 1957. Certain other elements were also found to become 
superconductors, but all had 7. s less than 10 K, which are expensive to 
maintain. Although Onnes received a Nobel prize in 1913, it was primarily 
for his work with liquid helium. 


In 1986, a breakthrough was announced—a ceramic compound was found 
to have an unprecedented J; of 35 K. It looked as if much higher critical 
temperatures could be possible, and by early 1988 another ceramic (this of 
thallium, calcium, barium, copper, and oxygen) had been found to have 

T. = 125 K (see [link].) The economic potential of perfect conductors 
saving electric energy is immense for T; s above 77 K, since that is the 
temperature of liquid nitrogen. Although liquid helium has a boiling point 


of 4 K and can be used to make materials superconducting, it costs about $5 
per liter. Liquid nitrogen boils at 77 K, but only costs about $0.30 per liter. 
There was general euphoria at the discovery of these complex ceramic 
superconductors, but this soon subsided with the sobering difficulty of 
forming them into usable wires. The first commercial use of a high 
temperature superconductor is in an electronic filter for cellular phones. 
High-temperature superconductors are used in experimental apparatus, and 
they are actively being researched, particularly in thin film applications. 
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A graph of resistivity 
versus temperature for a 
superconductor shows a 
sharp transition to zero at 

the critical temperature 

T,. High temperature 
superconductors have 
verifiable T, s greater 
than 125 K, well above 
the easily achieved 77-K 
temperature of liquid 
nitrogen. 


= 


One characteristic of a 
superconductor is that it 
excludes magnetic flux 
and, thus, repels other 
magnets. The small 
magnet levitated above a 
high-temperature 
superconductor, which is 
cooled by liquid nitrogen, 
gives evidence that the 
material is 
superconducting. When 
the material warms and 
becomes conducting, 
magnetic flux can 
penetrate it, and the 
magnet will rest upon it. 
(credit: Saperaud) 


The search is on for even higher T, superconductors, many of complex and 
exotic copper oxide ceramics, sometimes including strontium, mercury, or 
yttrium as well as barium, calcium, and other elements. Room temperature 
(about 293 K) would be ideal, but any temperature close to room 
temperature is relatively cheap to produce and maintain. There are 
persistent reports of TJ, s over 200 K and some in the vicinity of 270 K. 
Unfortunately, these observations are not routinely reproducible, with 


samples losing their superconducting nature once heated and recooled 
(cycled) a few times (see [link].) They are now called USOs or unidentified 
superconducting objects, out of frustration and the refusal of some samples 
to show high 7’, even though produced in the same manner as others. 
Reproducibility is crucial to discovery, and researchers are justifiably 
reluctant to claim the breakthrough they all seek. Time will tell whether 
USOs are real or an experimental quirk. 


The theory of ordinary superconductors is difficult, involving quantum 
effects for widely separated electrons traveling through a material. 
Electrons couple in a manner that allows them to get through the material 
without losing energy to it, making it a superconductor. High- 7; 
superconductors are more difficult to understand theoretically, but theorists 
seem to be closing in on a workable theory. The difficulty of understanding 
how electrons can sneak through materials without losing energy in 
collisions is even greater at higher temperatures, where vibrating atoms 
should get in the way. Discoverers of high 7, may feel something analogous 
to what a politician once said upon an unexpected election victory—“I 
wonder what we did right?” 
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(a) This graph, adapted from an article in 
Physics Today, shows the behavior of a 
single sample of a high-temperature 
superconductor in three different trials. In 
one case the sample exhibited a TJ, of 
about 230 K, whereas in the others it did 
not become superconducting at all. The 
lack of reproducibility is typical of 
forefront experiments and prohibits 
definitive conclusions. (b) This colorful 
diagram shows the complex but 
systematic nature of the lattice structure 
of a high-temperature superconducting 


ceramic. (credit: en:Cadmium, 
Wikimedia Commons) 


Section Summary 


e High-temperature superconductors are materials that become 
superconducting at temperatures well above a few kelvin. 

e The critical temperature 7’, is the temperature below which a material 
is superconducting. 

e Some high-temperature superconductors have verified T,,s above 125 
K, and there are reports of TJ. s as high as 250 K. 


Conceptual Questions 


Exercise: 
Problem: 
What is critical temperature 7’? Do all materials have a critical 
temperature? Explain why or why not. 
Exercise: 
Problem: 
Explain how good thermal contact with liquid nitrogen can keep 


objects at a temperature of 77 K (liquid nitrogen’s boiling point at 
atmospheric pressure). 


Exercise: 


Problem: 


Not only is liquid nitrogen a cheaper coolant than liquid helium, its 
boiling point is higher (77 K vs. 4.2 K). How does higher temperature 
help lower the cost of cooling a material? Explain in terms of the rate 
of heat transfer being related to the temperature difference between the 
sample and its surroundings. 


Problem Exercises 


Exercise: 


Problem: 


A section of superconducting wire carries a current of 100 A and 
requires 1.00 L of liquid nitrogen per hour to keep it below its critical 
temperature. For it to be economically advantageous to use a 
superconducting wire, the cost of cooling the wire must be less than 
the cost of energy lost to heat in the wire. Assume that the cost of 
liquid nitrogen is $0.30 per liter, and that electric energy costs $0.10 
per kW-h. What is the resistance of a normal wire that costs as much in 
wasted electric energy as the cost of liquid nitrogen for the 
superconductor? 


Solution: 
Equation: 


0.30 


Glossary 


Superconductors 
materials with resistivity of zero 


critical temperature 
the temperature at which and below which a material becomes a 
superconductor 


Some Questions We Know to Ask 


e Identify sample questions to be asked on the largest scales. 
e Identify sample questions to be asked on the intermediate scale. 
¢ Identify sample questions to be asked on the smallest scales. 


Throughout the text we have noted how essential it is to be curious and to 
ask questions in order to first understand what is known, and then to goa 
little farther. Some questions may go unanswered for centuries; others may 
not have answers, but some bear delicious fruit. Part of discovery is 
knowing which questions to ask. You have to know something before you 
can even phrase a decent question. As you may have noticed, the mere act 
of asking a question can give you the answer. The following questions are a 
sample of those physicists now know to ask and are representative of the 
forefronts of physics. Although these questions are important, they will be 
replaced by others if answers are found to them. The fun continues. 


On the Largest Scale 


1. Is the universe open or closed? Theorists would like it to be just barely 
closed and evidence is building toward that conclusion. Recent 
measurements in the expansion rate of the universe and in CMBR 
support a flat universe. There is a connection to small-scale physics in 
the type and number of particles that may contribute to closing the 
universe. 

2. What is dark matter? It is definitely there, but we really do not know 
what it is. Conventional possibilities are being ruled out, but one of 
them still may explain it. The answer could reveal whole new realms 
of physics and the disturbing possibility that most of what is out there 
is unknown to us, a completely different form of matter. 

3. How do galaxies form? They exist since very early in the evolution of 
the universe and it remains difficult to understand how they evolved so 
quickly. The recent finer measurements of fluctuations in the CMBR 
may yet allow us to explain galaxy formation. 

4. What is the nature of various-mass black holes? Only recently have we 
become confident that many black hole candidates cannot be explained 
by other, less exotic possibilities. But we still do not know much about 


how they form, what their role in the history of galactic evolution has 
been, and the nature of space in their vicinity. However, so many black 
holes are now known that correlations between black hole mass and 
galactic nuclei characteristics are being studied. 

5. What is the mechanism for the energy output of quasars? These distant 
and extraordinarily energetic objects now seem to be early stages of 
galactic evolution with a supermassive black-hole-devouring material. 
Connections are now being made with galaxies having energetic cores, 
and there is evidence consistent with less consuming, supermassive 
black holes at the center of older galaxies. New instruments are 
allowing us to see deeper into our own galaxy for evidence of our own 
massive black hole. 

6. Where do the y bursts come from? We see bursts of y rays coming 
from all directions in space, indicating the sources are very distant 
objects rather than something associated with our own galaxy. Some 
bursts finally are being correlated with known sources so that the 
possibility they may originate in binary neutron star interactions or 
black holes eating a companion neutron star can be explored. 


On the Intermediate Scale 


1. How do phase transitions take place on the microscopic scale? We 
know a lot about phase transitions, such as water freezing, but the 
details of how they occur molecule by molecule are not well 
understood. Similar questions about specific heat a century ago led to 
early quantum mechanics. It is also an example of a complex adaptive 
system that may yield insights into other self-organizing systems. 

2. Is there a way to deal with nonlinear phenomena that reveals 
underlying connections? Nonlinear phenomena lack a direct or linear 
proportionality that makes analysis and understanding a little easier. 
There are implications for nonlinear optics and broader topics such as 
chaos. 

3. How do high- T.. superconductors become resistanceless at such high 
temperatures? Understanding how they work may help make them 
more practical or may result in surprises as unexpected as the 
discovery of superconductivity itself. 


. There are magnetic effects in materials we do not understand—how do 


they work? Although beyond the scope of this text, there is a great deal 
to learn in condensed matter physics (the physics of solids and 
liquids). We may find surprises analogous to lasing, the quantum Hall 
effect, and the quantization of magnetic flux. Complexity may play a 
role here, too. 


On the Smallest Scale 


1. Are quarks and leptons fundamental, or do they have a substructure? 


Ds 


The higher energy accelerators that are just completed or being 
constructed may supply some answers, but there will also be input 
from cosmology and other systematics. 


. Why do leptons have integral charge while quarks have fractional 


charge? If both are fundamental and analogous as thought, this 
question deserves an answer. It is obviously related to the previous 
question. 

Why are there three families of quarks and leptons? First, does this 
imply some relationship? Second, why three and only three families? 


4. Are all forces truly equal (unified) under certain circumstances? They 


don’t have to be equal just because we want them to be. The answer 
may have to be indirectly obtained because of the extreme energy at 
which we think they are unified. 


5. Are there other fundamental forces? There was a flurry of activity with 


claims of a fifth and even a sixth force a few years ago. Interest has 
subsided, since those forces have not been detected consistently. 
Moreover, the proposed forces have strengths similar to gravity, 
making them extraordinarily difficult to detect in the presence of 
stronger forces. But the question remains; and if there are no other 
forces, we need to ask why only four and why these four. 


. Is the proton stable? We have discussed this in some detail, but the 


question is related to fundamental aspects of the unification of forces. 
We may never know from experiment that the proton is stable, only 
that it is very long lived. 


7. Are there magnetic monopoles? Many particle theories call for very 


massive individual north- and south-pole particles—magnetic 


monopoles. If they exist, why are they so different in mass and 
elusiveness from electric charges, and if they do not exist, why not? 

8. Do neutrinos have mass? Definitive evidence has emerged for 
neutrinos having mass. The implications are significant, as discussed 
in this chapter. There are effects on the closure of the universe and on 
the patterns in particle physics. 

9. What are the systematic characteristics of high- Z nuclei? All 
elements with Z = 118 or less (with the exception of 115 and 117) 
have now been discovered. It has long been conjectured that there may 
be an island of relative stability near Z = 114, and the study of the 
most recently discovered nuclei will contribute to our understanding of 
nuclear forces. 


These lists of questions are not meant to be complete or consistently 
important—you can no doubt add to it yourself. There are also important 
questions in topics not broached in this text, such as certain particle 
symmetries, that are of current interest to physicists. Hopefully, the point is 
clear that no matter how much we learn, there always seems to be more to 
know. Although we are fortunate to have the hard-won wisdom of those 
who preceded us, we can look forward to new enlightenment, undoubtedly 
sprinkled with surprise. 


Section Summary 


e On the largest scale, the questions which can be asked may be about 
dark matter, dark energy, black holes, quasars, and other aspects of the 
universe. 

e On the intermediate scale, we can query about gravity, phase 
transitions, nonlinear phenomena, high- 7, superconductors, and 
magnetic effects on materials. 

e On the smallest scale, questions may be about quarks and leptons, 
fundamental forces, stability of protons, and existence of monopoles. 


Conceptual Questions 


Exercise: 


Problem: 


For experimental evidence, particularly of previously unobserved 
phenomena, to be taken seriously it must be reproducible or of 
sufficiently high quality that a single observation is meaningful. 
Supernova 1987A is not reproducible. How do we know observations 
of it were valid? The fifth force is not broadly accepted. Is this due to 
lack of reproducibility or poor-quality experiments (or both)? Discuss 
why forefront experiments are more subject to observational problems 
than those involving established phenomena. 


Exercise: 
Problem: 


Discuss whether you think there are limits to what humans can 
understand about the laws of physics. Support your arguments. 


Useful Information 
This appendix is broken into several tables. 


[link], Important Constants 

[link], Submicroscopic Masses 

[link], Solar System Data 

[link], Metric Prefixes for Powers of Ten and Their Symbols 
[link], The Greek Alphabet 

[link], SI units 

[link], Selected British Units 

[link], Other Units 

[link], Useful Formulae 


e e e e e e e e e 


Symbol Meaning Best Value Approximate Value 
Speed of 

c light in 2.99792458 x 10°m/s 3.00 x 10°m/s 
vacuum 
Gravitational -UNQ . 2 /bo2 lV . 2 /ke2 

G pene 6.67408(31) x 10°" N- m?/kg 6.67 x 10°" N- m?/kg 

N Avogadro’s 23 23 

‘A oie as 6.02214129(27) x 10 6.02 x 10 

k Bolan 1.3806488(13) x 10-J/K 1.38 x 10-33 /K 
constant 

R Gas constant 8.3144621(75) J/mol - K 8.31 J/mol- K = 1.99 cal/mol - K = 
Stefan- 

oO Boltzmann 5.670373(21) x 10°°W/m?-K 5.67 x 10 °W/m?-K 
constant 
Coulomb 

k force 8.987551788... x 10°N-m?/C? 8.99 x 10°9N-m?/C? 
constant 

de a —1.602176565(35) x 10°C ~1.60 x 10°C 

Ep “ee 8.854187817... x 10-2C?/N-m2 8.85 x 107!2C?/N- m? 
Permeability -7m, 6m, 

Lo Gf free space 4n x 10°'T-m/A 1.26 x 10°T-m/A 
Planck’s —34 —34 

h 6.62606957(29) x 10° *J-s 6.63 x 10°*4J-s 
constant 


Important Constants! footnote! 


Stated values are according to the National Institute of Standards and Technology Reference on Constants, Units, an 


www.physics.nist.gov/cuu (accessed May 18, 2012). Values in parentheses are the uncertainties in the last digits. Nu 
are exact as defined. 


Symbol Meaning Best Value Approximate Value 
Me Electron mass 9.10938291(40) x 10-*/ke 9.11 x 10 “kg 
Mp Proton mass 1.672621777(74) x 10 *"kg 1.6726 x 10 *"ke 
Mn Neutron mass 1.674927351(74) x 10 ?"kg 1.6749 x 10° *"kg 
u Atomic mass unit 1.660538921(73) x 10° "kg 1.6605 x 10°-?"kg 


Submicroscopic Masses/footnote] 

Stated values are according to the National Institute of Standards and Technology Reference on Constants, Units, 
and Uncertainty, www.physics.nist.gov/cuu (accessed May 18, 2012). Values in parentheses are the uncertainties in 
the last digits. Numbers without uncertainties are exact as defined. 


Sun mass 1.99 x 10*°*kg 


Earth 


average radius 


Earth-sun distance (average) 


mass 


average radius 


orbital period 


6.96 x 108m 


1.496 x 10'!m 


5.9736 x 10%kg 


6.376 x 10°m 


3.16 x 10’s 


Moon 


Solar System Data 


Metric Prefixes for Powers of Ten and Their Symbols 


Alpha 
Beta 
Gamma 


Delta 


ia 


> 


mass 


average radius 


orbital period (average) 


Earth-moon distance (average) 


Symbol 


da 


Value 
10” 
10° 


10° 
10° 


102 
10! 


10°( = 1) 


Prefix 
deci 
centi 


milli 


micro 


nano 
pico 


femto 


Nu 
Xi 
Omicron 


Pi 


UH] 


7.35 x 10"kg 


1.74 x 10m 
2.36 x 108s 
3.84 x 10°m 
Symbol Value 
d 10-1 
Cc 10~? 
m 10-3 
iy 10-6 
n 10-° 
p 19712 
f 10-16 
Vv Tau T 
é Upsilon YT 
fo) Phi ® 
T Chi xX 


Epsilon E € 


Zeta Z ¢ 


The Greek Alphabet 


Fundamental units 


Supplementary unit 


Derived units 


Lambda A A 


Mu 


M Le 


Entity 
Length 
Mass 
Time 
Current 


Angle 


Force 


Energy 


Power 


Pressure 


Frequency 


Electronic potential 


Capacitance 


Charge 


Resistance 


P Pp 
>», oO 
Abbreviation 
m 
kg 
S 
A 
rad 
N = kg- m/s? 
J =kg-m?/s? 
W=4J/s 
Pa = N/m? 
Hz = 1/s 
V=J/C 
F=C/V 
C=s-A 


Q=V/A 


Name 
meter 
kilogram 
second 
ampere 


radian 


newton 


joule 


watt 


pascal 


hertz 


volt 


farad 


coulomb 


ohm 


SI Units 


Length 


Force 
Energy 
Power 
Pressure 


Selected British Units 


Length 


Area 


Volume 


Entity Abbreviation Name 


Magnetic field tesla 
T=N/(A-m) 


Nuclear decay rate Bq = 1/s becquerel 


linch (in.) = 2.54 cm (exactly) 

1 foot (ft) = 0.3048 m 

1 mile (mi) = 1.609 km 

1 pound (Ib) = 4.448 N 

1 British thermal unit (Btu) = 1.055 x 10° J 
1 horsepower (hp) = 746 W 


1 lb/in? = 6.895 x 10? Pa 


1 light year (ly) = 9.46 x 10'°m 

1 astronomical unit (au) = 1.50 x 10/4m 
1 nautical mile = 1.852 km 

1 angstrom(A) =107!°m 

1 acre (ac) = 4.05 x 10° m? 

1 square foot (ft?) = 9.29 x 10-? m? 

1 barn (b) = 10-78 m? 


1 liter (L) = 10° m? 


Mass 


Time 


Speed 


Angle 


Energy 


Pressure 


Nuclear decay rate 


1 USS. gallon (gal) = 3.785 x 10-3 m? 

1solar mass = 1.99 x 10° kg 

1 metric ton = 10° kg 

1 atomic mass unit (u) = 1.6605 x 10°?" kg 

1 year (y) = 3.16 x 10’s 

1 day (d) = 86,400 s 

1 mile per hour (mph) = 1.609 km/h 

1 nautical mile per hour (naut) = 1.852 km/h 
1 degree (°) = 1.745 x 10-* rad 

1 minute of arc () = 1/60 degree 

1 second of arc (’) = 1/60 minute of arc 

1 grad = 1.571 x 10-* rad 

1 kiloton TNT (kT) = 4.2 x 10” J 

1 kilowatt hour (kW - h) = 3.60 x 10°J 

1 food calorie (kcal) = 4186 J 

1 calorie (cal) = 4.186 J 

1 electron volt (eV) = 1.60 x 10-9 J 

1 atmosphere (atm) = 1.013 x 10° Pa 

1 millimeter of mercury (mm Hg) = 133.3 Pa 
1 torricelli (torr) = 1 mm Hg = 133.3 Pa 


1 curie (Ci) = 3.70 x 10'° Bq 


Other Units 
Circumference of a circle with radius r or diameter d C = 2nr = 7d 
Area of a circle with radius r or diameter d A=rr =nd /4 


Area of a sphere with radius r 


A=A4nr? 


Volume of a sphere with radius r V = (4/3) ar 


Useful Formulae 


